Classification of samples from NMR-based metabolomics using principal components analysis and partial least squares with uncertainty estimation

Werickson Fortunato de Carvalho Rocha; David Sheen; Dan Bearden

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Classification of samples from NMR-based metabolomics using principal components analysis and partial least squares with uncertainty estimation

Published

July 25, 2018

Author(s)

Werickson Fortunato de Carvalho Rocha, David Sheen, Dan Bearden

Abstract

Recent progress in metabolomics has been aided by the development of analysis techniques such as gas and liquid chromatography coupled with mass spectrometry (GC-MS and LC-MS) and nuclear magnetic resonance (NMR) spectroscopy. The vast quantities of data produced by these techniques has resulted in an increase in the use of machine algorithms that can aid in the interpretation of this data, such as principal components analysis (PCA) and partial least squares (PLS). Techniques such as these can be applied to biomarker discovery, interlaboratory comparison, and clinical diagnoses. However, there is a lingering question whether the results of these studies can be applied to broader sets of clinical data, usually taken from different data sources. In this work, we address this question by creating a metabolomics workflow that combines a previously-published consensus analysis procedure (10.1016/j.chemolab.2016.12.010) with PCA and PLS models using uncertainty analysis based on bootstrapping. This workflow is applied to NMR data that come from an interlaboratory comparison study using synthetic and biologically- obtained metabolite mixtures. The consensus analysis identifies trusted laboratories, whose data are used to create classification models that are more reliable than without. With uncertainty analysis, the reliability of the classification can be rigorously quantified, both for data from the original set and from new data that the model is analyzing.

Citation

Analytical and Bioanalytical Chemistry

Pub Type

Journals

Download Paper

https://doi.org/10.1007/s00216-018-1240-2

Local Download

Keywords

Metabolomics, reliability, bootstrap, uncertainty estimation, chemometrics, biomarker discovery

Uncertainty quantification and Analytical chemistry

Citation

Fortunato de Carvalho Rocha, W. , Sheen, D. and Bearden, D. (2018), Classification of samples from NMR-based metabolomics using principal components analysis and partial least squares with uncertainty estimation, Analytical and Bioanalytical Chemistry, [online], https://doi.org/10.1007/s00216-018-1240-2, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=923049 (Accessed July 25, 2026)

Additional citation formats

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created July 24, 2018, Updated October 12, 2021

Was this page helpful?