Date Topic Reading
1/13/16 Course overview
1/20/16 Database Provenance 

Section 1.1-1.2  of Why, How, Where, by Cheney, Chiticariu and Tan.   Foundations and Trends in Databases 1:4: 379-392 (2009).

1/27/16 Workflow Provenance 

Provenance in Scientific Workflow Systems, by Davidson, Cohen-Boulakia, Eyal, Ludasher, McPhillips, Bowers, Anand, Freire. IEEE Data Engineering Bulletin 30(4): 44-50 (2007).

Section 1-2.2 of Labeling Workflow Views with Fine-Grained Dependencies, by Bao, Davidson and Milo. PVLDB 5(11): 1208-1219 (2012).

1/29/16 Implementing Provenance Note that in lieu of meeting on 1/25 we will meet on Friday 1/29, 1-2:15pm, Moore 102 (DSL) Conference Room for a special talk by Zack Ives.
2/1/16 Background material:  Datalog

Ch. 24 (Deductive Databases) of Database Management Systems by Ramakrishnan and Gehrke

2/3/16 Background material:  RDF and SPARQL RDF and SPARQL
2/8/16 Data Citation Why data citation is a computational challenge, by Buneman, Davidson and Frew.  Submitted to CACM.
2/12/16 Data Citation

See also:  A Methodology for Citing Linked Open Data Subsets by Gianmaria Silvello.  D-Lib Magazine 21:1/2 (Jan/Feb 2015).   

Note that in lieu of meeting on Wednesday 2/10 we will meet on Friday 2/12, 1-2:15pm, Moore 102 (DSL) Conference Room for a special talk by Gianmaria Silvello.

2/17/16 Provenance graphs, querying provenance, and PROV

Querying Data Provenance, by Karvounarakis, Ives and Tannen.  SIGMOD (2010).

PROV Primer  and Toolbox

2/19/16 Search-driven data integration

Collaborative learning for search-driven data integration, by Yan et al.

2/22 Class cancelled Make progress on project
2/24/16 Causality Causality in Databases, by Meliou et al.
2/29/16 Class cancelled Make progress on project
3/2/16 Class cancelled Make progress on project
3/14/16 NoWorkflow

noWorkflow:  Capturing and Analyzing Provenance of Scripts, by Murta et al.

3/16/16 Data Citation Scalable data citation in dynamic, large databases, by Proll and Rauber.
3/21/16 Archiving Archiving Scientific Data, by Buneman, Khanna, Tajima and Tan.
3/25/16 Reducing provenance

Approximated Summarization of Database Provenance, by Ainy et al.

Note that in lieu of meeting on Wednesday 3/23 we will meet on Friday 3/25, 1-2:15pm, Moore 102 (DSL) Conference Room.

3/28/16 Reducing provenance

Selective Provenance for Datalog Programs Using Top-k Queries, by Deutch, Gilad, Moskovitch.  1484-1487 (2015).

3/30/16

 In lieu of class, we will have individualized meetings to discuss progress on project.

4/4/16 Provenance and privacy On provenance and privacy, by Davidson et al.
4/8/16

Class cancelled, we will meet on Friday 4/15, 1-2:15pm, Moore 102 (DSL) Conference Room for a talk by Tannen.

4/11/16 Rewriting using views
(related to Data Citation)
Query Reformulation with Constraints, by Deutsch, Popa and Tannen.
4/13/16 Data Achiving A Versioning and Evolution Framework for RDF Knowledge Bases, by Auer and Herre
4/15 Provenance for Matrices Fine-grained provenance for linear algebra operators,by Yan, Tannen, and Ives.  Under submission.
4/18/16 Rewriting using views, cont. Query Reformulation with Constraints, by Deutsch, Popa and Tannen.
4/20/16

Levine 307, 2:10-3:10  (optional) talk by Yale Patt, Franklin Award winner:  The END of X, the BEGINNING of Y and what they mean for future microprocessors.

See http://www.cis.upenn.edu/~cjtaylor/Patt16/symposium.html for complete schedule.

4/25/16 Storing provenance A hybrid approach for efficient provenance storage, by Xie, et al.

4/27/16

Project presentations.