Kno.e.sis Publications

Characterization of 1H NMR Spectroscopic Data and the Generation of Synthetic Validation Sets

Document Type

Article

Publication Date

2009

Abstract

Motivation: Common contemporary practice within the nuclear magnetic resonance (NMR) metabolomics community is to evaluate and validate novel algorithms on empirical data or simplified simulated data. Empirical data captures the complex characteristics of experimental data, but the optimal or most correct analysis is unknown a priori; therefore, researchers are forced to rely on indirect performance metrics, which are of limited value. In order to achieve fair and complete analysis of competing techniques more exacting metrics are required. Thus, metabolomics researchers often evaluate their algorithms on simplified simulated data with a known answer. Unfortunately, the conclusions obtained on simulated data are only of value if the data sets are complex enough for results to generalize to true experimental data. Ideally, synthetic data should be indistinguishable from empirical data, yet retain a known best analysis.

Results: We have developed a technique for creating realistic synthetic metabolomics validation sets based on NMR spectroscopic data. The validation sets are developed by characterizing the salient distributions in sets of empirical spectroscopic data. Using this technique, several validation sets are constructed with a variety of characteristics present in ‘real’ data. A case study is then presented to compare the relative accuracy of several alignment algorithms using the increased precision afforded by these synthetic data sets.

Availability: These data sets are available for download at http://birg.cs.wright.edu/nmr_synthetic_data_sets. Contact: travis.doom@wright.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

Comments

Repository Citation

Anderson, P. E., Raymer, M. L., Kelly, B. J., Reo, N. V., DelRaso, N. J., & Doom, T. E. (2009). Characterization of 1H NMR Spectroscopic Data and the Generation of Synthetic Validation Sets. Bioinformatics, 25 (22), 2992-3000.
https://corescholar.libraries.wright.edu/knoesis/82

DOI

10.1093/bioinformatics/btp540

Download

Request Accessible Version

Included in

Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, Science and Technology Studies Commons

COinS

Kno.e.sis Publications

Characterization of 1H NMR Spectroscopic Data and the Generation of Synthetic Validation Sets

Document Type

Publication Date

Abstract

Comments

Repository Citation

DOI

Included in

Search

Browse

About

Kno.e.sis Publications

Characterization of 1H NMR Spectroscopic Data and the Generation of Synthetic Validation Sets

Authors

Document Type

Publication Date

Abstract

Comments

Repository Citation

DOI

Included in

Share

Search

Browse

About