Cargando…
Statistical Inference Methods for Sparse Biological Time Series Data
BACKGROUND: Comparing metabolic profiles under different biological perturbations has become a powerful approach to investigating the functioning of cells. The profiles can be taken as single snapshots of a system, but more information is gained if they are measured longitudinally over time. The res...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3114728/ https://www.ncbi.nlm.nih.gov/pubmed/21518445 http://dx.doi.org/10.1186/1752-0509-5-57 |
_version_ | 1782206100101136384 |
---|---|
author | Ndukum, Juliet Fonseca, Luís L Santos, Helena Voit, Eberhard O Datta, Susmita |
author_facet | Ndukum, Juliet Fonseca, Luís L Santos, Helena Voit, Eberhard O Datta, Susmita |
author_sort | Ndukum, Juliet |
collection | PubMed |
description | BACKGROUND: Comparing metabolic profiles under different biological perturbations has become a powerful approach to investigating the functioning of cells. The profiles can be taken as single snapshots of a system, but more information is gained if they are measured longitudinally over time. The results are short time series consisting of relatively sparse data that cannot be analyzed effectively with standard time series techniques, such as autocorrelation and frequency domain methods. In this work, we study longitudinal time series profiles of glucose consumption in the yeast Saccharomyces cerevisiae under different temperatures and preconditioning regimens, which we obtained with methods of in vivo nuclear magnetic resonance (NMR) spectroscopy. For the statistical analysis we first fit several nonlinear mixed effect regression models to the longitudinal profiles and then used an ANOVA likelihood ratio method in order to test for significant differences between the profiles. RESULTS: The proposed methods are capable of distinguishing metabolic time trends resulting from different treatments and associate significance levels to these differences. Among several nonlinear mixed-effects regression models tested, a three-parameter logistic function represents the data with highest accuracy. ANOVA and likelihood ratio tests suggest that there are significant differences between the glucose consumption rate profiles for cells that had been--or had not been--preconditioned by heat during growth. Furthermore, pair-wise t-tests reveal significant differences in the longitudinal profiles for glucose consumption rates between optimal conditions and heat stress, optimal and recovery conditions, and heat stress and recovery conditions (p-values <0.0001). CONCLUSION: We have developed a nonlinear mixed effects model that is appropriate for the analysis of sparse metabolic and physiological time profiles. The model permits sound statistical inference procedures, based on ANOVA likelihood ratio tests, for testing the significance of differences between short time course data under different biological perturbations. |
format | Online Article Text |
id | pubmed-3114728 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31147282011-06-15 Statistical Inference Methods for Sparse Biological Time Series Data Ndukum, Juliet Fonseca, Luís L Santos, Helena Voit, Eberhard O Datta, Susmita BMC Syst Biol Methodology Article BACKGROUND: Comparing metabolic profiles under different biological perturbations has become a powerful approach to investigating the functioning of cells. The profiles can be taken as single snapshots of a system, but more information is gained if they are measured longitudinally over time. The results are short time series consisting of relatively sparse data that cannot be analyzed effectively with standard time series techniques, such as autocorrelation and frequency domain methods. In this work, we study longitudinal time series profiles of glucose consumption in the yeast Saccharomyces cerevisiae under different temperatures and preconditioning regimens, which we obtained with methods of in vivo nuclear magnetic resonance (NMR) spectroscopy. For the statistical analysis we first fit several nonlinear mixed effect regression models to the longitudinal profiles and then used an ANOVA likelihood ratio method in order to test for significant differences between the profiles. RESULTS: The proposed methods are capable of distinguishing metabolic time trends resulting from different treatments and associate significance levels to these differences. Among several nonlinear mixed-effects regression models tested, a three-parameter logistic function represents the data with highest accuracy. ANOVA and likelihood ratio tests suggest that there are significant differences between the glucose consumption rate profiles for cells that had been--or had not been--preconditioned by heat during growth. Furthermore, pair-wise t-tests reveal significant differences in the longitudinal profiles for glucose consumption rates between optimal conditions and heat stress, optimal and recovery conditions, and heat stress and recovery conditions (p-values <0.0001). CONCLUSION: We have developed a nonlinear mixed effects model that is appropriate for the analysis of sparse metabolic and physiological time profiles. The model permits sound statistical inference procedures, based on ANOVA likelihood ratio tests, for testing the significance of differences between short time course data under different biological perturbations. BioMed Central 2011-04-25 /pmc/articles/PMC3114728/ /pubmed/21518445 http://dx.doi.org/10.1186/1752-0509-5-57 Text en Copyright ©2011 Ndukum et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Ndukum, Juliet Fonseca, Luís L Santos, Helena Voit, Eberhard O Datta, Susmita Statistical Inference Methods for Sparse Biological Time Series Data |
title | Statistical Inference Methods for Sparse Biological Time Series Data |
title_full | Statistical Inference Methods for Sparse Biological Time Series Data |
title_fullStr | Statistical Inference Methods for Sparse Biological Time Series Data |
title_full_unstemmed | Statistical Inference Methods for Sparse Biological Time Series Data |
title_short | Statistical Inference Methods for Sparse Biological Time Series Data |
title_sort | statistical inference methods for sparse biological time series data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3114728/ https://www.ncbi.nlm.nih.gov/pubmed/21518445 http://dx.doi.org/10.1186/1752-0509-5-57 |
work_keys_str_mv | AT ndukumjuliet statisticalinferencemethodsforsparsebiologicaltimeseriesdata AT fonsecaluisl statisticalinferencemethodsforsparsebiologicaltimeseriesdata AT santoshelena statisticalinferencemethodsforsparsebiologicaltimeseriesdata AT voiteberhardo statisticalinferencemethodsforsparsebiologicaltimeseriesdata AT dattasusmita statisticalinferencemethodsforsparsebiologicaltimeseriesdata |