Cargando…
Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data
BACKGROUND: Metabolomics offers a unique integrative perspective for health research, reflecting genetic and environmental contributions to disease-related phenotypes. Identifying robust associations in population-based or large-scale clinical studies demands large numbers of subjects and therefore...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5290663/ https://www.ncbi.nlm.nih.gov/pubmed/28153035 http://dx.doi.org/10.1186/s12859-017-1501-7 |
_version_ | 1782504679267106816 |
---|---|
author | Reisetter, Anna C. Muehlbauer, Michael J. Bain, James R. Nodzenski, Michael Stevens, Robert D. Ilkayeva, Olga Metzger, Boyd E. Newgard, Christopher B. Lowe, William L. Scholtens, Denise M. |
author_facet | Reisetter, Anna C. Muehlbauer, Michael J. Bain, James R. Nodzenski, Michael Stevens, Robert D. Ilkayeva, Olga Metzger, Boyd E. Newgard, Christopher B. Lowe, William L. Scholtens, Denise M. |
author_sort | Reisetter, Anna C. |
collection | PubMed |
description | BACKGROUND: Metabolomics offers a unique integrative perspective for health research, reflecting genetic and environmental contributions to disease-related phenotypes. Identifying robust associations in population-based or large-scale clinical studies demands large numbers of subjects and therefore sample batching for gas-chromatography/mass spectrometry (GC/MS) non-targeted assays. When run over weeks or months, technical noise due to batch and run-order threatens data interpretability. Application of existing normalization methods to metabolomics is challenged by unsatisfied modeling assumptions and, notably, failure to address batch-specific truncation of low abundance compounds. RESULTS: To curtail technical noise and make GC/MS metabolomics data amenable to analyses describing biologically relevant variability, we propose mixture model normalization (mixnorm) that accommodates truncated data and estimates per-metabolite batch and run-order effects using quality control samples. Mixnorm outperforms other approaches across many metrics, including improved correlation of non-targeted and targeted measurements and superior performance when metabolite detectability varies according to batch. For some metrics, particularly when truncation is less frequent for a metabolite, mean centering and median scaling demonstrate comparable performance to mixnorm. CONCLUSIONS: When quality control samples are systematically included in batches, mixnorm is uniquely suited to normalizing non-targeted GC/MS metabolomics data due to explicit accommodation of batch effects, run order and varying thresholds of detectability. Especially in large-scale studies, normalization is crucial for drawing accurate conclusions from non-targeted GC/MS metabolomics data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1501-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5290663 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52906632017-02-07 Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data Reisetter, Anna C. Muehlbauer, Michael J. Bain, James R. Nodzenski, Michael Stevens, Robert D. Ilkayeva, Olga Metzger, Boyd E. Newgard, Christopher B. Lowe, William L. Scholtens, Denise M. BMC Bioinformatics Methodology Article BACKGROUND: Metabolomics offers a unique integrative perspective for health research, reflecting genetic and environmental contributions to disease-related phenotypes. Identifying robust associations in population-based or large-scale clinical studies demands large numbers of subjects and therefore sample batching for gas-chromatography/mass spectrometry (GC/MS) non-targeted assays. When run over weeks or months, technical noise due to batch and run-order threatens data interpretability. Application of existing normalization methods to metabolomics is challenged by unsatisfied modeling assumptions and, notably, failure to address batch-specific truncation of low abundance compounds. RESULTS: To curtail technical noise and make GC/MS metabolomics data amenable to analyses describing biologically relevant variability, we propose mixture model normalization (mixnorm) that accommodates truncated data and estimates per-metabolite batch and run-order effects using quality control samples. Mixnorm outperforms other approaches across many metrics, including improved correlation of non-targeted and targeted measurements and superior performance when metabolite detectability varies according to batch. For some metrics, particularly when truncation is less frequent for a metabolite, mean centering and median scaling demonstrate comparable performance to mixnorm. CONCLUSIONS: When quality control samples are systematically included in batches, mixnorm is uniquely suited to normalizing non-targeted GC/MS metabolomics data due to explicit accommodation of batch effects, run order and varying thresholds of detectability. Especially in large-scale studies, normalization is crucial for drawing accurate conclusions from non-targeted GC/MS metabolomics data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1501-7) contains supplementary material, which is available to authorized users. BioMed Central 2017-02-02 /pmc/articles/PMC5290663/ /pubmed/28153035 http://dx.doi.org/10.1186/s12859-017-1501-7 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Reisetter, Anna C. Muehlbauer, Michael J. Bain, James R. Nodzenski, Michael Stevens, Robert D. Ilkayeva, Olga Metzger, Boyd E. Newgard, Christopher B. Lowe, William L. Scholtens, Denise M. Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data |
title | Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data |
title_full | Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data |
title_fullStr | Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data |
title_full_unstemmed | Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data |
title_short | Mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data |
title_sort | mixture model normalization for non-targeted gas chromatography/mass spectrometry metabolomics data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5290663/ https://www.ncbi.nlm.nih.gov/pubmed/28153035 http://dx.doi.org/10.1186/s12859-017-1501-7 |
work_keys_str_mv | AT reisetterannac mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata AT muehlbauermichaelj mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata AT bainjamesr mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata AT nodzenskimichael mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata AT stevensrobertd mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata AT ilkayevaolga mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata AT metzgerboyde mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata AT newgardchristopherb mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata AT lowewilliaml mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata AT scholtensdenisem mixturemodelnormalizationfornontargetedgaschromatographymassspectrometrymetabolomicsdata |