Cargando…
Improving the accuracy of expression data analysis in time course experiments using resampling
BACKGROUND: As time series experiments in higher eukaryotes usually obtain data from different individuals collected at the different time points, a time series sample itself is not equivalent to a true biological replicate but is, rather, a combination of several biological replicates. The analysis...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4220062/ https://www.ncbi.nlm.nih.gov/pubmed/25344112 http://dx.doi.org/10.1186/s12859-014-0352-8 |
_version_ | 1782342687362383872 |
---|---|
author | Walter, Wencke Striberny, Bernd Gaquerel, Emmanuel Baldwin, Ian T Kim, Sang-Gyu Heiland, Ines |
author_facet | Walter, Wencke Striberny, Bernd Gaquerel, Emmanuel Baldwin, Ian T Kim, Sang-Gyu Heiland, Ines |
author_sort | Walter, Wencke |
collection | PubMed |
description | BACKGROUND: As time series experiments in higher eukaryotes usually obtain data from different individuals collected at the different time points, a time series sample itself is not equivalent to a true biological replicate but is, rather, a combination of several biological replicates. The analysis of expression data derived from a time series sample is therefore often performed with a low number of replicates due to budget limitations or limitations in sample availability. In addition, most algorithms developed to identify specific patterns in time series dataset do not consider biological variation in samples collected at the same conditions. RESULTS: Using artificial time course datasets, we show that resampling considerably improves the accuracy of transcripts identified as rhythmic. In particular, the number of false positives can be greatly reduced while at the same time the number of true positives can be maintained in the range of other methods currently used to determine rhythmically expressed genes. CONCLUSIONS: The resampling approach described here therefore increases the accuracy of time series expression data analysis and furthermore emphasizes the importance of biological replicates in identifying oscillating genes. Resampling can be used for any time series expression dataset as long as the samples are acquired from independent individuals at each time point. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0352-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4220062 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42200622014-11-07 Improving the accuracy of expression data analysis in time course experiments using resampling Walter, Wencke Striberny, Bernd Gaquerel, Emmanuel Baldwin, Ian T Kim, Sang-Gyu Heiland, Ines BMC Bioinformatics Methodology Article BACKGROUND: As time series experiments in higher eukaryotes usually obtain data from different individuals collected at the different time points, a time series sample itself is not equivalent to a true biological replicate but is, rather, a combination of several biological replicates. The analysis of expression data derived from a time series sample is therefore often performed with a low number of replicates due to budget limitations or limitations in sample availability. In addition, most algorithms developed to identify specific patterns in time series dataset do not consider biological variation in samples collected at the same conditions. RESULTS: Using artificial time course datasets, we show that resampling considerably improves the accuracy of transcripts identified as rhythmic. In particular, the number of false positives can be greatly reduced while at the same time the number of true positives can be maintained in the range of other methods currently used to determine rhythmically expressed genes. CONCLUSIONS: The resampling approach described here therefore increases the accuracy of time series expression data analysis and furthermore emphasizes the importance of biological replicates in identifying oscillating genes. Resampling can be used for any time series expression dataset as long as the samples are acquired from independent individuals at each time point. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0352-8) contains supplementary material, which is available to authorized users. BioMed Central 2014-10-25 /pmc/articles/PMC4220062/ /pubmed/25344112 http://dx.doi.org/10.1186/s12859-014-0352-8 Text en © Walter et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Walter, Wencke Striberny, Bernd Gaquerel, Emmanuel Baldwin, Ian T Kim, Sang-Gyu Heiland, Ines Improving the accuracy of expression data analysis in time course experiments using resampling |
title | Improving the accuracy of expression data analysis in time course experiments using resampling |
title_full | Improving the accuracy of expression data analysis in time course experiments using resampling |
title_fullStr | Improving the accuracy of expression data analysis in time course experiments using resampling |
title_full_unstemmed | Improving the accuracy of expression data analysis in time course experiments using resampling |
title_short | Improving the accuracy of expression data analysis in time course experiments using resampling |
title_sort | improving the accuracy of expression data analysis in time course experiments using resampling |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4220062/ https://www.ncbi.nlm.nih.gov/pubmed/25344112 http://dx.doi.org/10.1186/s12859-014-0352-8 |
work_keys_str_mv | AT walterwencke improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling AT stribernybernd improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling AT gaquerelemmanuel improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling AT baldwiniant improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling AT kimsanggyu improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling AT heilandines improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling |