Kno.e.sis Publications

A Unified Framework fro Managing Provenance Information in Translational Research

Satya S. Sahoo, Wright State University - Main Campus
Vinh Nguyen, Wright State University - Main CampusFollow
Olivier Bodenreider
Priti Parikh, Wright State University - Main CampusFollow
Todd Minning
Amit P. Sheth, Wright State University - Main CampusFollow

Document Type

Article

Publication Date

2011

Abstract

Background

A critical aspect of the NIH Translational Research roadmap, which seeks to accelerate the delivery of "bench-side" discoveries to patient's "bedside," is the management of the provenance metadata that keeps track of the origin and history of data resources as they traverse the path from the bench to the bedside and back. A comprehensive provenance framework is essential for researchers to verify the quality of data, reproduce scientific results published in peer-reviewed literature, validate scientific process, and associate trust value with data and results. Traditional approaches to provenance management have focused on only partial sections of the translational research life cycle and they do not incorporate "domain semantics", which is essential to support domain-specific querying and analysis by scientists.

Results

We identify a common set of challenges in managing provenance information across the pre-publication and post-publication phases of data in the translational research lifecycle. We define the semantic provenance framework (SPF), underpinned by the Provenir upper-level provenance ontology, to address these challenges in the four stages of provenance metadata:

(a) Provenance collection - during data generation

(b) Provenance representation - to support interoperability, reasoning, and incorporate domain semantics

(c) Provenance storage and propagation - to allow efficient storage and seamless propagation of provenance as the data is transferred across applications

(d) Provenance query - to support queries with increasing complexity over large data size and also support knowledge discovery applications

We apply the SPF to two exemplar translational research projects, namely the Semantic Problem Solving Environment for Trypanosoma cruzi (T.cruzi SPSE) and the Biomedical Knowledge Repository (BKR) project, to demonstrate its effectiveness.

Conclusions

The SPF provides a unified framework to effectively manage provenance of translational research data during pre and post-publication phases. This framework is underpinned by an upper-level provenance ontology called Provenir that is extended to create domain-specific provenance ontologies to facilitate provenance interoperability, seamless propagation of provenance, automated querying, and analysis.

Repository Citation

Sahoo, S. S., Nguyen, V., Bodenreider, O., Parikh, P., Minning, T., & Sheth, A. P. (2011). A Unified Framework fro Managing Provenance Information in Translational Research. BMC Bioinformatics, 12, 461.
https://corescholar.libraries.wright.edu/knoesis/605

DOI

10.1186/1471-2105-12-461

Download

Included in

Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, Science and Technology Studies Commons

COinS

Kno.e.sis Publications

A Unified Framework fro Managing Provenance Information in Translational Research

Document Type

Publication Date

Abstract

Background

Results

Conclusions

Repository Citation

DOI

Included in

Search

Browse

About

SelectedWorks Sites

Kno.e.sis Publications

A Unified Framework fro Managing Provenance Information in Translational Research

Authors

Document Type

Publication Date

Abstract

Background

Results

Conclusions

Repository Citation

DOI

Included in

Share

Search

Browse

About

SelectedWorks Sites