Cargando…

Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver

BACKGROUND: Microarray technology is a powerful and widely accepted experimental technique in molecular biology that allows studying genome wide transcriptional responses. However, experimental data usually contain potential sources of uncertainty and thus many experiments are now designed with repe...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Tung T, Almon, Richard R, DuBois, Debra C, Jusko, William J, Androulakis, Ioannis P
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2889936/
https://www.ncbi.nlm.nih.gov/pubmed/20500897
http://dx.doi.org/10.1186/1471-2105-11-279
_version_ 1782182739575832576
author Nguyen, Tung T
Almon, Richard R
DuBois, Debra C
Jusko, William J
Androulakis, Ioannis P
author_facet Nguyen, Tung T
Almon, Richard R
DuBois, Debra C
Jusko, William J
Androulakis, Ioannis P
author_sort Nguyen, Tung T
collection PubMed
description BACKGROUND: Microarray technology is a powerful and widely accepted experimental technique in molecular biology that allows studying genome wide transcriptional responses. However, experimental data usually contain potential sources of uncertainty and thus many experiments are now designed with repeated measurements to better assess such inherent variability. Many computational methods have been proposed to account for the variability in replicates. As yet, there is no model to output expression profiles accounting for replicate information so that a variety of computational models that take the expression profiles as the input data can explore this information without any modification. RESULTS: We propose a methodology which integrates replicate variability into expression profiles, to generate so-called 'true' expression profiles. The study addresses two issues: (i) develop a statistical model that can estimate 'true' expression profiles which are more robust than the average profile, and (ii) extend our previous micro-clustering which was designed specifically for clustering time-series expression data. The model utilizes a previously proposed error model and the concept of 'relative difference'. The clustering effectiveness is demonstrated through synthetic data where several methods are compared. We subsequently analyze in vivo rat data to elucidate circadian transcriptional dynamics as well as liver-specific corticosteroid induced changes in gene expression. CONCLUSIONS: We have proposed a model which integrates the error information from repeated measurements into the expression profiles. Through numerous synthetic and real time-series data, we demonstrated the ability of the approach to improve the clustering performance and assist in the identification and selection of informative expression motifs.
format Text
id pubmed-2889936
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28899362010-06-23 Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver Nguyen, Tung T Almon, Richard R DuBois, Debra C Jusko, William J Androulakis, Ioannis P BMC Bioinformatics Methodology article BACKGROUND: Microarray technology is a powerful and widely accepted experimental technique in molecular biology that allows studying genome wide transcriptional responses. However, experimental data usually contain potential sources of uncertainty and thus many experiments are now designed with repeated measurements to better assess such inherent variability. Many computational methods have been proposed to account for the variability in replicates. As yet, there is no model to output expression profiles accounting for replicate information so that a variety of computational models that take the expression profiles as the input data can explore this information without any modification. RESULTS: We propose a methodology which integrates replicate variability into expression profiles, to generate so-called 'true' expression profiles. The study addresses two issues: (i) develop a statistical model that can estimate 'true' expression profiles which are more robust than the average profile, and (ii) extend our previous micro-clustering which was designed specifically for clustering time-series expression data. The model utilizes a previously proposed error model and the concept of 'relative difference'. The clustering effectiveness is demonstrated through synthetic data where several methods are compared. We subsequently analyze in vivo rat data to elucidate circadian transcriptional dynamics as well as liver-specific corticosteroid induced changes in gene expression. CONCLUSIONS: We have proposed a model which integrates the error information from repeated measurements into the expression profiles. Through numerous synthetic and real time-series data, we demonstrated the ability of the approach to improve the clustering performance and assist in the identification and selection of informative expression motifs. BioMed Central 2010-05-26 /pmc/articles/PMC2889936/ /pubmed/20500897 http://dx.doi.org/10.1186/1471-2105-11-279 Text en Copyright ©2010 Nguyen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Nguyen, Tung T
Almon, Richard R
DuBois, Debra C
Jusko, William J
Androulakis, Ioannis P
Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver
title Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver
title_full Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver
title_fullStr Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver
title_full_unstemmed Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver
title_short Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver
title_sort importance of replication in analyzing time-series gene expression data: corticosteroid dynamics and circadian patterns in rat liver
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2889936/
https://www.ncbi.nlm.nih.gov/pubmed/20500897
http://dx.doi.org/10.1186/1471-2105-11-279
work_keys_str_mv AT nguyentungt importanceofreplicationinanalyzingtimeseriesgeneexpressiondatacorticosteroiddynamicsandcircadianpatternsinratliver
AT almonrichardr importanceofreplicationinanalyzingtimeseriesgeneexpressiondatacorticosteroiddynamicsandcircadianpatternsinratliver
AT duboisdebrac importanceofreplicationinanalyzingtimeseriesgeneexpressiondatacorticosteroiddynamicsandcircadianpatternsinratliver
AT juskowilliamj importanceofreplicationinanalyzingtimeseriesgeneexpressiondatacorticosteroiddynamicsandcircadianpatternsinratliver
AT androulakisioannisp importanceofreplicationinanalyzingtimeseriesgeneexpressiondatacorticosteroiddynamicsandcircadianpatternsinratliver