Document Type


Publication Date



Provenance metadata describes the 'lineage' or history of an entity and necessary information to verify the quality of data, validate experiment protocols, and associate trust value with scientific results. eScience projects generate data and the associated provenance metadata in a distributed environment (such as myGrid) and on a very large scale that often precludes manual analysis. Given this scenario, provenance information should be, (a) interoperable across projects, research groups, and application domains, and (b) support analysis over large datasets using reasoning to discover implicit information. In this paper, we introduce an ontology-driven framework for eScience provenance management underpinned by an 'upper-level' ontology called provenir defined in OWL-DL. This framework is implemented in a modular fashion by extending provenir ontology to create a suite of domain-specific provenance ontologies that facilitate interoperability and enable reasoning. We demonstrate the application of this framework in two eScience projects domains through creation of, (a) Parasite Experiment ontology to model provenance in parasite research, and (b) Trident ontology to model provenance in the Neptune oceanography project.

Additional Files

Provenir_ontology_CommonProvenanceModel.pdf (1463 kB)
PowerPoint Slides of Presentation