Cargando…
A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation
BACKGROUND: The growing ubiquity of metabolomic techniques has facilitated high frequency time-course data collection for an increasing number of applications. While the concentration trends of individual metabolites can be modeled with common curve fitting techniques, a more accurate representation...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4558828/ https://www.ncbi.nlm.nih.gov/pubmed/26335002 http://dx.doi.org/10.1186/s12918-015-0197-4 |
_version_ | 1782388676485971968 |
---|---|
author | Sokolenko, Stanislav Aucoin, Marc G. |
author_facet | Sokolenko, Stanislav Aucoin, Marc G. |
author_sort | Sokolenko, Stanislav |
collection | PubMed |
description | BACKGROUND: The growing ubiquity of metabolomic techniques has facilitated high frequency time-course data collection for an increasing number of applications. While the concentration trends of individual metabolites can be modeled with common curve fitting techniques, a more accurate representation of the data needs to consider effects that act on more than one metabolite in a given sample. To this end, we present a simple algorithm that uses nonparametric smoothing carried out on all observed metabolites at once to identify and correct systematic error from dilution effects. In addition, we develop a simulation of metabolite concentration time-course trends to supplement available data and explore algorithm performance. Although we focus on nuclear magnetic resonance (NMR) analysis in the context of cell culture, a number of possible extensions are discussed. RESULTS: Realistic metabolic data was successfully simulated using a 4-step process. Starting with a set of metabolite concentration time-courses from a metabolomic experiment, each time-course was classified as either increasing, decreasing, concave, or approximately constant. Trend shapes were simulated from generic functions corresponding to each classification. The resulting shapes were then scaled to simulated compound concentrations. Finally, the scaled trends were perturbed using a combination of random and systematic errors. To detect systematic errors, a nonparametric fit was applied to each trend and percent deviations calculated at every timepoint. Systematic errors could be identified at time-points where the median percent deviation exceeded a threshold value, determined by the choice of smoothing model and the number of observed trends. Regardless of model, increasing the number of observations over a time-course resulted in more accurate error estimates, although the improvement was not particularly large between 10 and 20 samples per trend. The presented algorithm was able to identify systematic errors as small as 2.5 % under a wide range of conditions. CONCLUSION: Both the simulation framework and error correction method represent examples of time-course analysis that can be applied to further developments in (1)H-NMR methodology and the more general application of quantitative metabolomics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-015-0197-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4558828 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45588282015-09-04 A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation Sokolenko, Stanislav Aucoin, Marc G. BMC Syst Biol Methodology Article BACKGROUND: The growing ubiquity of metabolomic techniques has facilitated high frequency time-course data collection for an increasing number of applications. While the concentration trends of individual metabolites can be modeled with common curve fitting techniques, a more accurate representation of the data needs to consider effects that act on more than one metabolite in a given sample. To this end, we present a simple algorithm that uses nonparametric smoothing carried out on all observed metabolites at once to identify and correct systematic error from dilution effects. In addition, we develop a simulation of metabolite concentration time-course trends to supplement available data and explore algorithm performance. Although we focus on nuclear magnetic resonance (NMR) analysis in the context of cell culture, a number of possible extensions are discussed. RESULTS: Realistic metabolic data was successfully simulated using a 4-step process. Starting with a set of metabolite concentration time-courses from a metabolomic experiment, each time-course was classified as either increasing, decreasing, concave, or approximately constant. Trend shapes were simulated from generic functions corresponding to each classification. The resulting shapes were then scaled to simulated compound concentrations. Finally, the scaled trends were perturbed using a combination of random and systematic errors. To detect systematic errors, a nonparametric fit was applied to each trend and percent deviations calculated at every timepoint. Systematic errors could be identified at time-points where the median percent deviation exceeded a threshold value, determined by the choice of smoothing model and the number of observed trends. Regardless of model, increasing the number of observations over a time-course resulted in more accurate error estimates, although the improvement was not particularly large between 10 and 20 samples per trend. The presented algorithm was able to identify systematic errors as small as 2.5 % under a wide range of conditions. CONCLUSION: Both the simulation framework and error correction method represent examples of time-course analysis that can be applied to further developments in (1)H-NMR methodology and the more general application of quantitative metabolomics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-015-0197-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-04 /pmc/articles/PMC4558828/ /pubmed/26335002 http://dx.doi.org/10.1186/s12918-015-0197-4 Text en © Sokolenko and Aucoin. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Sokolenko, Stanislav Aucoin, Marc G. A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation |
title | A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation |
title_full | A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation |
title_fullStr | A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation |
title_full_unstemmed | A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation |
title_short | A correction method for systematic error in (1)H-NMR time-course data validated through stochastic cell culture simulation |
title_sort | correction method for systematic error in (1)h-nmr time-course data validated through stochastic cell culture simulation |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4558828/ https://www.ncbi.nlm.nih.gov/pubmed/26335002 http://dx.doi.org/10.1186/s12918-015-0197-4 |
work_keys_str_mv | AT sokolenkostanislav acorrectionmethodforsystematicerrorin1hnmrtimecoursedatavalidatedthroughstochasticcellculturesimulation AT aucoinmarcg acorrectionmethodforsystematicerrorin1hnmrtimecoursedatavalidatedthroughstochasticcellculturesimulation AT sokolenkostanislav correctionmethodforsystematicerrorin1hnmrtimecoursedatavalidatedthroughstochasticcellculturesimulation AT aucoinmarcg correctionmethodforsystematicerrorin1hnmrtimecoursedatavalidatedthroughstochasticcellculturesimulation |