Cargando…

Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data

BACKGROUND: Metabolomics offers a unique integrative perspective for health research, reflecting genetic and environmental contributions to disease-related phenotypes. Identifying robust associations in population-based or large-scale clinical studies demands large numbers of subjects and therefore...

Descripción completa

Detalles Bibliográficos
Autores principales: Reisetter, Anna C., Muehlbauer, Michael J., Bain, James R., Nodzenski, Michael, Stevens, Robert D., Ilkayeva, Olga, Metzger, Boyd E., Newgard, Christopher B., Lowe, William L., Scholtens, Denise M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5290663/
https://www.ncbi.nlm.nih.gov/pubmed/28153035
http://dx.doi.org/10.1186/s12859-017-1501-7
_version_ 1782504679267106816
author Reisetter, Anna C.
Muehlbauer, Michael J.
Bain, James R.
Nodzenski, Michael
Stevens, Robert D.
Ilkayeva, Olga
Metzger, Boyd E.
Newgard, Christopher B.
Lowe, William L.
Scholtens, Denise M.
author_facet Reisetter, Anna C.
Muehlbauer, Michael J.
Bain, James R.
Nodzenski, Michael
Stevens, Robert D.
Ilkayeva, Olga
Metzger, Boyd E.
Newgard, Christopher B.
Lowe, William L.
Scholtens, Denise M.
author_sort Reisetter, Anna C.
collection PubMed
description BACKGROUND: Metabolomics offers a unique integrative perspective for health research, reflecting genetic and environmental contributions to disease-related phenotypes. Identifying robust associations in population-based or large-scale clinical studies demands large numbers of subjects and therefore sample batching for gas-chromatography/mass spectrometry (GC/MS) non-targeted assays. When run over weeks or months, technical noise due to batch and run-order threatens data interpretability. Application of existing normalization methods to metabolomics is challenged by unsatisfied modeling assumptions and, notably, failure to address batch-specific truncation of low abundance compounds. RESULTS: To curtail technical noise and make GC/MS metabolomics data amenable to analyses describing biologically relevant variability, we propose mixture model normalization (mixnorm) that accommodates truncated data and estimates per-metabolite batch and run-order effects using quality control samples. Mixnorm outperforms other approaches across many metrics, including improved correlation of non-targeted and targeted measurements and superior performance when metabolite detectability varies according to batch. For some metrics, particularly when truncation is less frequent for a metabolite, mean centering and median scaling demonstrate comparable performance to mixnorm. CONCLUSIONS: When quality control samples are systematically included in batches, mixnorm is uniquely suited to normalizing non-targeted GC/MS metabolomics data due to explicit accommodation of batch effects, run order and varying thresholds of detectability. Especially in large-scale studies, normalization is crucial for drawing accurate conclusions from non-targeted GC/MS metabolomics data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1501-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5290663
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52906632017-02-07 Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data Reisetter, Anna C. Muehlbauer, Michael J. Bain, James R. Nodzenski, Michael Stevens, Robert D. Ilkayeva, Olga Metzger, Boyd E. Newgard, Christopher B. Lowe, William L. Scholtens, Denise M. BMC Bioinformatics Methodology Article BACKGROUND: Metabolomics offers a unique integrative perspective for health research, reflecting genetic and environmental contributions to disease-related phenotypes. Identifying robust associations in population-based or large-scale clinical studies demands large numbers of subjects and therefore sample batching for gas-chromatography/mass spectrometry (GC/MS) non-targeted assays. When run over weeks or months, technical noise due to batch and run-order threatens data interpretability. Application of existing normalization methods to metabolomics is challenged by unsatisfied modeling assumptions and, notably, failure to address batch-specific truncation of low abundance compounds. RESULTS: To curtail technical noise and make GC/MS metabolomics data amenable to analyses describing biologically relevant variability, we propose mixture model normalization (mixnorm) that accommodates truncated data and estimates per-metabolite batch and run-order effects using quality control samples. Mixnorm outperforms other approaches across many metrics, including improved correlation of non-targeted and targeted measurements and superior performance when metabolite detectability varies according to batch. For some metrics, particularly when truncation is less frequent for a metabolite, mean centering and median scaling demonstrate comparable performance to mixnorm. CONCLUSIONS: When quality control samples are systematically included in batches, mixnorm is uniquely suited to normalizing non-targeted GC/MS metabolomics data due to explicit accommodation of batch effects, run order and varying thresholds of detectability. Especially in large-scale studies, normalization is crucial for drawing accurate conclusions from non-targeted GC/MS metabolomics data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1501-7) contains supplementary material, which is available to authorized users. BioMed Central 2017-02-02 /pmc/articles/PMC5290663/ /pubmed/28153035 http://dx.doi.org/10.1186/s12859-017-1501-7 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Reisetter, Anna C.
Muehlbauer, Michael J.
Bain, James R.
Nodzenski, Michael
Stevens, Robert D.
Ilkayeva, Olga
Metzger, Boyd E.
Newgard, Christopher B.
Lowe, William L.
Scholtens, Denise M.
Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data
title Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data
title_full Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data
title_fullStr Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data
title_full_unstemmed Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data
title_short Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data
title_sort mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5290663/
https://www.ncbi.nlm.nih.gov/pubmed/28153035
http://dx.doi.org/10.1186/s12859-017-1501-7
work_keys_str_mv AT reisetterannac mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata
AT muehlbauermichaelj mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata
AT bainjamesr mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata
AT nodzenskimichael mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata
AT stevensrobertd mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata
AT ilkayevaolga mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata
AT metzgerboyde mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata
AT newgardchristopherb mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata
AT lowewilliaml mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata
AT scholtensdenisem mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata