CiteULike is a free online bibliography manager. Register and you can start organising your references online.

A survey of data provenance in e-science

SIGMOD Rec., Vol. 34, No. 3. (September 2005), pp. 31-36.

X Abstract

Data management is growing in complexity as large-scale applications take advantage of the loosely coupled resources brought together by grid middleware and by abundant storage capacity. Metadata describing the data products used in and generated by these applications is essential to disambiguate the data and enable reuse. Data provenance, one kind of metadata, pertains to the derivation history of a data product starting from its original sources.In this paper we create a taxonomy of data provenance characteristics and apply it to current research efforts in e-science, focusing primarily on scientific workflow approaches. The main aspect of our taxonomy categorizes provenance systems based on why they record provenance, what they describe, how they represent and store provenance, and ways to disseminate it. The survey culminates with an identification of open research problems in the field.

View the full article here:

ACM, DOI

This article has been bookmarked 9 times, initially on 2006-05-31.

2009-06-10 User johnwilkes , 1 note

Nice overview of many of the ways in which provenance is gathered, used, and tracked. Takes a fairly broad view of what provenance includes.

2009-06-10 00:12:28
2009-05-28 User shankark
2008-04-08 User elsantosneto
2008-02-05 User apadmana
2007-04-18 User ragibhasan
2006-11-14 User simmie
2006-08-09 User simmhan
2006-07-15 User render
2006-05-31 User neilernst
Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.