Disparate Metabolomics Data Reassembler: A Novel Algorithm for Agglomerating Incongruent LC-MS Metabolomics Datasets

Tytus D. Mak; Maryam Goudarzi; Evagelia C. Laiakis; Stephen E. Stein

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Disparate Metabolomics Data Reassembler: A Novel Algorithm for Agglomerating Incongruent LC-MS Metabolomics Datasets

Published

March 2, 2020

Author(s)

Tytus D. Mak, Maryam Goudarzi, Evagelia C. Laiakis, Stephen E. Stein

Abstract

In the past decade, the field of LC-MS based metabolomics has transformed from an obscure specialty into a major -omics platform for studying metabolic processes and biomolecular characterization. However, as a whole the field is still very fractured, as the nature of the instrumentation, and of the information produced by the platform essentially creates incompatible islands of datasets. This lack of data coherency results in the inability to accumulate a critical mass of metabolomics data that has enabled other omics platforms to make impactful discoveries and meaningful advances. As such, we have developed a novel algorithm, called Disparate Metabolomics Data Reassembler (DIMEDR), which attempts to bridge the inconsistencies between incongruent LC-MS metabolomics datasets of the same biological sample type. A single primary dataset is postprocessed via traditional means of peak identification, alignment, and grouping. DIMEDR utilizes this primary dataset as a progenitor template by which data from subsequent disparate datasets are reassembled and integrated into a unified framework that maximizes spectral feature similarity across all samples. This is accomplished by a novel procedure for universal retention time correction and comparison via identification of ubiquitous features in the initial primary dataset, which are subsequently utilized as endogenous internal standards during integration. For demonstration purposes, two human and two mouse urine metabolomics datasets from four unrelated studies acquired over 4 years were unified via DIMEDR, which enabled meaningful analysis across otherwise incomparable and unrelated datasets.

Citation

Analytical Chemistry

Pub Type

Journals

Download Paper

Local Download

Keywords

Metabolomics, harmonization, informatics, mass spectrometry, liquid chromatography

Statistical analysis, Molecular characterization, Experiment design, Biotechnology and Analytical chemistry

Citation

Mak, T. , Goudarzi, M. , Laiakis, E. and Stein, S. (2020), Disparate Metabolomics Data Reassembler: A Novel Algorithm for Agglomerating Incongruent LC-MS Metabolomics Datasets, Analytical Chemistry, [online], https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=928910 (Accessed July 30, 2026)

Additional citation formats

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created March 2, 2020, Updated June 3, 2020

Was this page helpful?