Cargando…

A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation

BACKGROUND: The growing ubiquity of metabolomic techniques has facilitated high frequency time-course data collection for an increasing number of applications. While the concentration trends of individual metabolites can be modeled with common curve fitting techniques, a more accurate representation...

Descripción completa

Detalles Bibliográficos
Autores principales: Sokolenko, Stanislav, Aucoin, Marc G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4558828/
https://www.ncbi.nlm.nih.gov/pubmed/26335002
http://dx.doi.org/10.1186/s12918-015-0197-4
_version_ 1782388676485971968
author Sokolenko, Stanislav
Aucoin, Marc G.
author_facet Sokolenko, Stanislav
Aucoin, Marc G.
author_sort Sokolenko, Stanislav
collection PubMed
description BACKGROUND: The growing ubiquity of metabolomic techniques has facilitated high frequency time-course data collection for an increasing number of applications. While the concentration trends of individual metabolites can be modeled with common curve fitting techniques, a more accurate representation of the data needs to consider effects that act on more than one metabolite in a given sample. To this end, we present a simple algorithm that uses nonparametric smoothing carried out on all observed metabolites at once to identify and correct systematic error from dilution effects. In addition, we develop a simulation of metabolite concentration time-course trends to supplement available data and explore algorithm performance. Although we focus on nuclear magnetic resonance (NMR) analysis in the context of cell culture, a number of possible extensions are discussed. RESULTS: Realistic metabolic data was successfully simulated using a 4-step process. Starting with a set of metabolite concentration time-courses from a metabolomic experiment, each time-course was classified as either increasing, decreasing, concave, or approximately constant. Trend shapes were simulated from generic functions corresponding to each classification. The resulting shapes were then scaled to simulated compound concentrations. Finally, the scaled trends were perturbed using a combination of random and systematic errors. To detect systematic errors, a nonparametric fit was applied to each trend and percent deviations calculated at every timepoint. Systematic errors could be identified at time-points where the median percent deviation exceeded a threshold value, determined by the choice of smoothing model and the number of observed trends. Regardless of model, increasing the number of observations over a time-course resulted in more accurate error estimates, although the improvement was not particularly large between 10 and 20 samples per trend. The presented algorithm was able to identify systematic errors as small as 2.5 % under a wide range of conditions. CONCLUSION: Both the simulation framework and error correction method represent examples of time-course analysis that can be applied to further developments in (1)H-NMR methodology and the more general application of quantitative metabolomics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-015-0197-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4558828
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45588282015-09-04 A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation Sokolenko, Stanislav Aucoin, Marc G. BMC Syst Biol Methodology Article BACKGROUND: The growing ubiquity of metabolomic techniques has facilitated high frequency time-course data collection for an increasing number of applications. While the concentration trends of individual metabolites can be modeled with common curve fitting techniques, a more accurate representation of the data needs to consider effects that act on more than one metabolite in a given sample. To this end, we present a simple algorithm that uses nonparametric smoothing carried out on all observed metabolites at once to identify and correct systematic error from dilution effects. In addition, we develop a simulation of metabolite concentration time-course trends to supplement available data and explore algorithm performance. Although we focus on nuclear magnetic resonance (NMR) analysis in the context of cell culture, a number of possible extensions are discussed. RESULTS: Realistic metabolic data was successfully simulated using a 4-step process. Starting with a set of metabolite concentration time-courses from a metabolomic experiment, each time-course was classified as either increasing, decreasing, concave, or approximately constant. Trend shapes were simulated from generic functions corresponding to each classification. The resulting shapes were then scaled to simulated compound concentrations. Finally, the scaled trends were perturbed using a combination of random and systematic errors. To detect systematic errors, a nonparametric fit was applied to each trend and percent deviations calculated at every timepoint. Systematic errors could be identified at time-points where the median percent deviation exceeded a threshold value, determined by the choice of smoothing model and the number of observed trends. Regardless of model, increasing the number of observations over a time-course resulted in more accurate error estimates, although the improvement was not particularly large between 10 and 20 samples per trend. The presented algorithm was able to identify systematic errors as small as 2.5 % under a wide range of conditions. CONCLUSION: Both the simulation framework and error correction method represent examples of time-course analysis that can be applied to further developments in (1)H-NMR methodology and the more general application of quantitative metabolomics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-015-0197-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-04 /pmc/articles/PMC4558828/ /pubmed/26335002 http://dx.doi.org/10.1186/s12918-015-0197-4 Text en © Sokolenko and Aucoin. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Sokolenko, Stanislav
Aucoin, Marc G.
A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation
title A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation
title_full A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation
title_fullStr A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation
title_full_unstemmed A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation
title_short A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation
title_sort correction method for systematic error in (1)h-nmr time-course data validated through stochastic cell culture simulation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4558828/
https://www.ncbi.nlm.nih.gov/pubmed/26335002
http://dx.doi.org/10.1186/s12918-015-0197-4
work_keys_str_mv AT sokolenkostanislav acorrectionmethodforsystematicerrorin1hnmrtimecoursedatavalidatedthroughstochasticcellculturesimulation
AT aucoinmarcg acorrectionmethodforsystematicerrorin1hnmrtimecoursedatavalidatedthroughstochasticcellculturesimulation
AT sokolenkostanislav correctionmethodforsystematicerrorin1hnmrtimecoursedatavalidatedthroughstochasticcellculturesimulation
AT aucoinmarcg correctionmethodforsystematicerrorin1hnmrtimecoursedatavalidatedthroughstochasticcellculturesimulation