Cargando…
Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver
BACKGROUND: Microarray technology is a powerful and widely accepted experimental technique in molecular biology that allows studying genome wide transcriptional responses. However, experimental data usually contain potential sources of uncertainty and thus many experiments are now designed with repe...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2889936/ https://www.ncbi.nlm.nih.gov/pubmed/20500897 http://dx.doi.org/10.1186/1471-2105-11-279 |
_version_ | 1782182739575832576 |
---|---|
author | Nguyen, Tung T Almon, Richard R DuBois, Debra C Jusko, William J Androulakis, Ioannis P |
author_facet | Nguyen, Tung T Almon, Richard R DuBois, Debra C Jusko, William J Androulakis, Ioannis P |
author_sort | Nguyen, Tung T |
collection | PubMed |
description | BACKGROUND: Microarray technology is a powerful and widely accepted experimental technique in molecular biology that allows studying genome wide transcriptional responses. However, experimental data usually contain potential sources of uncertainty and thus many experiments are now designed with repeated measurements to better assess such inherent variability. Many computational methods have been proposed to account for the variability in replicates. As yet, there is no model to output expression profiles accounting for replicate information so that a variety of computational models that take the expression profiles as the input data can explore this information without any modification. RESULTS: We propose a methodology which integrates replicate variability into expression profiles, to generate so-called 'true' expression profiles. The study addresses two issues: (i) develop a statistical model that can estimate 'true' expression profiles which are more robust than the average profile, and (ii) extend our previous micro-clustering which was designed specifically for clustering time-series expression data. The model utilizes a previously proposed error model and the concept of 'relative difference'. The clustering effectiveness is demonstrated through synthetic data where several methods are compared. We subsequently analyze in vivo rat data to elucidate circadian transcriptional dynamics as well as liver-specific corticosteroid induced changes in gene expression. CONCLUSIONS: We have proposed a model which integrates the error information from repeated measurements into the expression profiles. Through numerous synthetic and real time-series data, we demonstrated the ability of the approach to improve the clustering performance and assist in the identification and selection of informative expression motifs. |
format | Text |
id | pubmed-2889936 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28899362010-06-23 Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver Nguyen, Tung T Almon, Richard R DuBois, Debra C Jusko, William J Androulakis, Ioannis P BMC Bioinformatics Methodology article BACKGROUND: Microarray technology is a powerful and widely accepted experimental technique in molecular biology that allows studying genome wide transcriptional responses. However, experimental data usually contain potential sources of uncertainty and thus many experiments are now designed with repeated measurements to better assess such inherent variability. Many computational methods have been proposed to account for the variability in replicates. As yet, there is no model to output expression profiles accounting for replicate information so that a variety of computational models that take the expression profiles as the input data can explore this information without any modification. RESULTS: We propose a methodology which integrates replicate variability into expression profiles, to generate so-called 'true' expression profiles. The study addresses two issues: (i) develop a statistical model that can estimate 'true' expression profiles which are more robust than the average profile, and (ii) extend our previous micro-clustering which was designed specifically for clustering time-series expression data. The model utilizes a previously proposed error model and the concept of 'relative difference'. The clustering effectiveness is demonstrated through synthetic data where several methods are compared. We subsequently analyze in vivo rat data to elucidate circadian transcriptional dynamics as well as liver-specific corticosteroid induced changes in gene expression. CONCLUSIONS: We have proposed a model which integrates the error information from repeated measurements into the expression profiles. Through numerous synthetic and real time-series data, we demonstrated the ability of the approach to improve the clustering performance and assist in the identification and selection of informative expression motifs. BioMed Central 2010-05-26 /pmc/articles/PMC2889936/ /pubmed/20500897 http://dx.doi.org/10.1186/1471-2105-11-279 Text en Copyright ©2010 Nguyen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology article Nguyen, Tung T Almon, Richard R DuBois, Debra C Jusko, William J Androulakis, Ioannis P Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver |
title | Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver |
title_full | Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver |
title_fullStr | Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver |
title_full_unstemmed | Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver |
title_short | Importance of replication in analyzing time-series gene expression data: Corticosteroid dynamics and circadian patterns in rat liver |
title_sort | importance of replication in analyzing time-series gene expression data: corticosteroid dynamics and circadian patterns in rat liver |
topic | Methodology article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2889936/ https://www.ncbi.nlm.nih.gov/pubmed/20500897 http://dx.doi.org/10.1186/1471-2105-11-279 |
work_keys_str_mv | AT nguyentungt importanceofreplicationinanalyzingtimeseriesgeneexpressiondatacorticosteroiddynamicsandcircadianpatternsinratliver AT almonrichardr importanceofreplicationinanalyzingtimeseriesgeneexpressiondatacorticosteroiddynamicsandcircadianpatternsinratliver AT duboisdebrac importanceofreplicationinanalyzingtimeseriesgeneexpressiondatacorticosteroiddynamicsandcircadianpatternsinratliver AT juskowilliamj importanceofreplicationinanalyzingtimeseriesgeneexpressiondatacorticosteroiddynamicsandcircadianpatternsinratliver AT androulakisioannisp importanceofreplicationinanalyzingtimeseriesgeneexpressiondatacorticosteroiddynamicsandcircadianpatternsinratliver |