Cargando…

Statistical Inference Methods for Sparse Biological Time Series Data

BACKGROUND: Comparing metabolic profiles under different biological perturbations has become a powerful approach to investigating the functioning of cells. The profiles can be taken as single snapshots of a system, but more information is gained if they are measured longitudinally over time. The res...

Descripción completa

Detalles Bibliográficos
Autores principales: Ndukum, Juliet, Fonseca, Luís L, Santos, Helena, Voit, Eberhard O, Datta, Susmita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3114728/
https://www.ncbi.nlm.nih.gov/pubmed/21518445
http://dx.doi.org/10.1186/1752-0509-5-57
_version_ 1782206100101136384
author Ndukum, Juliet
Fonseca, Luís L
Santos, Helena
Voit, Eberhard O
Datta, Susmita
author_facet Ndukum, Juliet
Fonseca, Luís L
Santos, Helena
Voit, Eberhard O
Datta, Susmita
author_sort Ndukum, Juliet
collection PubMed
description BACKGROUND: Comparing metabolic profiles under different biological perturbations has become a powerful approach to investigating the functioning of cells. The profiles can be taken as single snapshots of a system, but more information is gained if they are measured longitudinally over time. The results are short time series consisting of relatively sparse data that cannot be analyzed effectively with standard time series techniques, such as autocorrelation and frequency domain methods. In this work, we study longitudinal time series profiles of glucose consumption in the yeast Saccharomyces cerevisiae under different temperatures and preconditioning regimens, which we obtained with methods of in vivo nuclear magnetic resonance (NMR) spectroscopy. For the statistical analysis we first fit several nonlinear mixed effect regression models to the longitudinal profiles and then used an ANOVA likelihood ratio method in order to test for significant differences between the profiles. RESULTS: The proposed methods are capable of distinguishing metabolic time trends resulting from different treatments and associate significance levels to these differences. Among several nonlinear mixed-effects regression models tested, a three-parameter logistic function represents the data with highest accuracy. ANOVA and likelihood ratio tests suggest that there are significant differences between the glucose consumption rate profiles for cells that had been--or had not been--preconditioned by heat during growth. Furthermore, pair-wise t-tests reveal significant differences in the longitudinal profiles for glucose consumption rates between optimal conditions and heat stress, optimal and recovery conditions, and heat stress and recovery conditions (p-values <0.0001). CONCLUSION: We have developed a nonlinear mixed effects model that is appropriate for the analysis of sparse metabolic and physiological time profiles. The model permits sound statistical inference procedures, based on ANOVA likelihood ratio tests, for testing the significance of differences between short time course data under different biological perturbations.
format Online
Article
Text
id pubmed-3114728
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31147282011-06-15 Statistical Inference Methods for Sparse Biological Time Series Data Ndukum, Juliet Fonseca, Luís L Santos, Helena Voit, Eberhard O Datta, Susmita BMC Syst Biol Methodology Article BACKGROUND: Comparing metabolic profiles under different biological perturbations has become a powerful approach to investigating the functioning of cells. The profiles can be taken as single snapshots of a system, but more information is gained if they are measured longitudinally over time. The results are short time series consisting of relatively sparse data that cannot be analyzed effectively with standard time series techniques, such as autocorrelation and frequency domain methods. In this work, we study longitudinal time series profiles of glucose consumption in the yeast Saccharomyces cerevisiae under different temperatures and preconditioning regimens, which we obtained with methods of in vivo nuclear magnetic resonance (NMR) spectroscopy. For the statistical analysis we first fit several nonlinear mixed effect regression models to the longitudinal profiles and then used an ANOVA likelihood ratio method in order to test for significant differences between the profiles. RESULTS: The proposed methods are capable of distinguishing metabolic time trends resulting from different treatments and associate significance levels to these differences. Among several nonlinear mixed-effects regression models tested, a three-parameter logistic function represents the data with highest accuracy. ANOVA and likelihood ratio tests suggest that there are significant differences between the glucose consumption rate profiles for cells that had been--or had not been--preconditioned by heat during growth. Furthermore, pair-wise t-tests reveal significant differences in the longitudinal profiles for glucose consumption rates between optimal conditions and heat stress, optimal and recovery conditions, and heat stress and recovery conditions (p-values <0.0001). CONCLUSION: We have developed a nonlinear mixed effects model that is appropriate for the analysis of sparse metabolic and physiological time profiles. The model permits sound statistical inference procedures, based on ANOVA likelihood ratio tests, for testing the significance of differences between short time course data under different biological perturbations. BioMed Central 2011-04-25 /pmc/articles/PMC3114728/ /pubmed/21518445 http://dx.doi.org/10.1186/1752-0509-5-57 Text en Copyright ©2011 Ndukum et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Ndukum, Juliet
Fonseca, Luís L
Santos, Helena
Voit, Eberhard O
Datta, Susmita
Statistical Inference Methods for Sparse Biological Time Series Data
title Statistical Inference Methods for Sparse Biological Time Series Data
title_full Statistical Inference Methods for Sparse Biological Time Series Data
title_fullStr Statistical Inference Methods for Sparse Biological Time Series Data
title_full_unstemmed Statistical Inference Methods for Sparse Biological Time Series Data
title_short Statistical Inference Methods for Sparse Biological Time Series Data
title_sort statistical inference methods for sparse biological time series data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3114728/
https://www.ncbi.nlm.nih.gov/pubmed/21518445
http://dx.doi.org/10.1186/1752-0509-5-57
work_keys_str_mv AT ndukumjuliet statisticalinferencemethodsforsparsebiologicaltimeseriesdata
AT fonsecaluisl statisticalinferencemethodsforsparsebiologicaltimeseriesdata
AT santoshelena statisticalinferencemethodsforsparsebiologicaltimeseriesdata
AT voiteberhardo statisticalinferencemethodsforsparsebiologicaltimeseriesdata
AT dattasusmita statisticalinferencemethodsforsparsebiologicaltimeseriesdata