Back to Search Start Over

Golden Trail: Retrieving the Data History that Matters from a Comprehensive Provenance Repository

Authors :
Paolo Missier
Bertram Ludäscher
Saumen Dey
Michael Wang
Tim McPhillips
Shawn Bowers
Michael Agun
Ilkay Altintas
Source :
International Journal of Digital Curation, Vol 7, Iss 1, Pp 139-150 (2012)
Publication Year :
2012
Publisher :
University of Edinburgh, 2012.

Abstract

Experimental science can be thought of as the exploration of a large research space, in search of a few valuable results. While it is this “Golden Data” that gets published, the history of the exploration is often as valuable to the scientists as some of its outcomes. We envision an e-research infrastructure that is capable of systematically and automatically recording such history – an assumption that holds today for a number of workflow management systems routinely used in e-science. In keeping with our gold rush metaphor, the provenance of a valuable result is a “Golden Trail”. Logically, this represents a detailed account of how the Golden Data was arrived at, and technically it is a sub-graph in the much larger graph of provenance traces that collectively tell the story of the entire research (or of some of it).In this paper we describe a model and architecture for a repository dedicated to storing provenance traces and selectively retrieving Golden Trails from it. As traces from multiple experiments over long periods of time are accommodated, the trails may be sub-graphs of one trace, or they may be the logical representation of a virtual experiment obtained by joining together traces that share common data.The project has been carried out within the Provenance Working Group of the Data Observation Network for Earth (DataONE) NSF project. Ultimately, our longer-term plan is to integrate the provenance repository into the data preservation architecture currently being developed by DataONE.

Details

Language :
English
ISSN :
17468256
Volume :
7
Issue :
1
Database :
Directory of Open Access Journals
Journal :
International Journal of Digital Curation
Publication Type :
Academic Journal
Accession number :
edsdoj.b86dd67437cc465cba04622f97fdcdd8
Document Type :
article
Full Text :
https://doi.org/10.2218/ijdc.v7i1.221