Cargando…

Significance analysis of microarray transcript levels in time series experiments

BACKGROUND: Microarray time series studies are essential to understand the dynamics of molecular events. In order to limit the analysis to those genes that change expression over time, a first necessary step is to select differentially expressed transcripts. A variety of methods have been proposed t...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Camillo, Barbara, Toffolo, Gianna, Nair, Sreekumaran K, Greenlund, Laura J, Cobelli, Claudio
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1885839/
https://www.ncbi.nlm.nih.gov/pubmed/17430554
http://dx.doi.org/10.1186/1471-2105-8-S1-S10
_version_ 1782133653275410432
author Di Camillo, Barbara
Toffolo, Gianna
Nair, Sreekumaran K
Greenlund, Laura J
Cobelli, Claudio
author_facet Di Camillo, Barbara
Toffolo, Gianna
Nair, Sreekumaran K
Greenlund, Laura J
Cobelli, Claudio
author_sort Di Camillo, Barbara
collection PubMed
description BACKGROUND: Microarray time series studies are essential to understand the dynamics of molecular events. In order to limit the analysis to those genes that change expression over time, a first necessary step is to select differentially expressed transcripts. A variety of methods have been proposed to this purpose; however, these methods are seldom applicable in practice since they require a large number of replicates, often available only for a limited number of samples. In this data-poor context, we evaluate the performance of three selection methods, using synthetic data, over a range of experimental conditions. Application to real data is also discussed. RESULTS: Three methods are considered, to assess differentially expressed genes in data-poor conditions. Method 1 uses a threshold on individual samples based on a model of the experimental error. Method 2 calculates the area of the region bounded by the time series expression profiles, and considers the gene differentially expressed if the area exceeds a threshold based on a model of the experimental error. These two methods are compared to Method 3, recently proposed in the literature, which exploits splines fit to compare time series profiles. Application of the three methods to synthetic data indicates that Method 2 outperforms the other two both in Precision and Recall when short time series are analyzed, while Method 3 outperforms the other two for long time series. CONCLUSION: These results help to address the choice of the algorithm to be used in data-poor time series expression study, depending on the length of the time series.
format Text
id pubmed-1885839
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18858392007-06-05 Significance analysis of microarray transcript levels in time series experiments Di Camillo, Barbara Toffolo, Gianna Nair, Sreekumaran K Greenlund, Laura J Cobelli, Claudio BMC Bioinformatics Research BACKGROUND: Microarray time series studies are essential to understand the dynamics of molecular events. In order to limit the analysis to those genes that change expression over time, a first necessary step is to select differentially expressed transcripts. A variety of methods have been proposed to this purpose; however, these methods are seldom applicable in practice since they require a large number of replicates, often available only for a limited number of samples. In this data-poor context, we evaluate the performance of three selection methods, using synthetic data, over a range of experimental conditions. Application to real data is also discussed. RESULTS: Three methods are considered, to assess differentially expressed genes in data-poor conditions. Method 1 uses a threshold on individual samples based on a model of the experimental error. Method 2 calculates the area of the region bounded by the time series expression profiles, and considers the gene differentially expressed if the area exceeds a threshold based on a model of the experimental error. These two methods are compared to Method 3, recently proposed in the literature, which exploits splines fit to compare time series profiles. Application of the three methods to synthetic data indicates that Method 2 outperforms the other two both in Precision and Recall when short time series are analyzed, while Method 3 outperforms the other two for long time series. CONCLUSION: These results help to address the choice of the algorithm to be used in data-poor time series expression study, depending on the length of the time series. BioMed Central 2007-03-08 /pmc/articles/PMC1885839/ /pubmed/17430554 http://dx.doi.org/10.1186/1471-2105-8-S1-S10 Text en Copyright © 2007 Di Camillo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Di Camillo, Barbara
Toffolo, Gianna
Nair, Sreekumaran K
Greenlund, Laura J
Cobelli, Claudio
Significance analysis of microarray transcript levels in time series experiments
title Significance analysis of microarray transcript levels in time series experiments
title_full Significance analysis of microarray transcript levels in time series experiments
title_fullStr Significance analysis of microarray transcript levels in time series experiments
title_full_unstemmed Significance analysis of microarray transcript levels in time series experiments
title_short Significance analysis of microarray transcript levels in time series experiments
title_sort significance analysis of microarray transcript levels in time series experiments
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1885839/
https://www.ncbi.nlm.nih.gov/pubmed/17430554
http://dx.doi.org/10.1186/1471-2105-8-S1-S10
work_keys_str_mv AT dicamillobarbara significanceanalysisofmicroarraytranscriptlevelsintimeseriesexperiments
AT toffologianna significanceanalysisofmicroarraytranscriptlevelsintimeseriesexperiments
AT nairsreekumarank significanceanalysisofmicroarraytranscriptlevelsintimeseriesexperiments
AT greenlundlauraj significanceanalysisofmicroarraytranscriptlevelsintimeseriesexperiments
AT cobelliclaudio significanceanalysisofmicroarraytranscriptlevelsintimeseriesexperiments