Cargando…

Improving the accuracy of expression data analysis in time course experiments using resampling

BACKGROUND: As time series experiments in higher eukaryotes usually obtain data from different individuals collected at the different time points, a time series sample itself is not equivalent to a true biological replicate but is, rather, a combination of several biological replicates. The analysis...

Descripción completa

Detalles Bibliográficos
Autores principales: Walter, Wencke, Striberny, Bernd, Gaquerel, Emmanuel, Baldwin, Ian T, Kim, Sang-Gyu, Heiland, Ines
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4220062/
https://www.ncbi.nlm.nih.gov/pubmed/25344112
http://dx.doi.org/10.1186/s12859-014-0352-8
_version_ 1782342687362383872
author Walter, Wencke
Striberny, Bernd
Gaquerel, Emmanuel
Baldwin, Ian T
Kim, Sang-Gyu
Heiland, Ines
author_facet Walter, Wencke
Striberny, Bernd
Gaquerel, Emmanuel
Baldwin, Ian T
Kim, Sang-Gyu
Heiland, Ines
author_sort Walter, Wencke
collection PubMed
description BACKGROUND: As time series experiments in higher eukaryotes usually obtain data from different individuals collected at the different time points, a time series sample itself is not equivalent to a true biological replicate but is, rather, a combination of several biological replicates. The analysis of expression data derived from a time series sample is therefore often performed with a low number of replicates due to budget limitations or limitations in sample availability. In addition, most algorithms developed to identify specific patterns in time series dataset do not consider biological variation in samples collected at the same conditions. RESULTS: Using artificial time course datasets, we show that resampling considerably improves the accuracy of transcripts identified as rhythmic. In particular, the number of false positives can be greatly reduced while at the same time the number of true positives can be maintained in the range of other methods currently used to determine rhythmically expressed genes. CONCLUSIONS: The resampling approach described here therefore increases the accuracy of time series expression data analysis and furthermore emphasizes the importance of biological replicates in identifying oscillating genes. Resampling can be used for any time series expression dataset as long as the samples are acquired from independent individuals at each time point. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0352-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4220062
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42200622014-11-07 Improving the accuracy of expression data analysis in time course experiments using resampling Walter, Wencke Striberny, Bernd Gaquerel, Emmanuel Baldwin, Ian T Kim, Sang-Gyu Heiland, Ines BMC Bioinformatics Methodology Article BACKGROUND: As time series experiments in higher eukaryotes usually obtain data from different individuals collected at the different time points, a time series sample itself is not equivalent to a true biological replicate but is, rather, a combination of several biological replicates. The analysis of expression data derived from a time series sample is therefore often performed with a low number of replicates due to budget limitations or limitations in sample availability. In addition, most algorithms developed to identify specific patterns in time series dataset do not consider biological variation in samples collected at the same conditions. RESULTS: Using artificial time course datasets, we show that resampling considerably improves the accuracy of transcripts identified as rhythmic. In particular, the number of false positives can be greatly reduced while at the same time the number of true positives can be maintained in the range of other methods currently used to determine rhythmically expressed genes. CONCLUSIONS: The resampling approach described here therefore increases the accuracy of time series expression data analysis and furthermore emphasizes the importance of biological replicates in identifying oscillating genes. Resampling can be used for any time series expression dataset as long as the samples are acquired from independent individuals at each time point. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0352-8) contains supplementary material, which is available to authorized users. BioMed Central 2014-10-25 /pmc/articles/PMC4220062/ /pubmed/25344112 http://dx.doi.org/10.1186/s12859-014-0352-8 Text en © Walter et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Walter, Wencke
Striberny, Bernd
Gaquerel, Emmanuel
Baldwin, Ian T
Kim, Sang-Gyu
Heiland, Ines
Improving the accuracy of expression data analysis in time course experiments using resampling
title Improving the accuracy of expression data analysis in time course experiments using resampling
title_full Improving the accuracy of expression data analysis in time course experiments using resampling
title_fullStr Improving the accuracy of expression data analysis in time course experiments using resampling
title_full_unstemmed Improving the accuracy of expression data analysis in time course experiments using resampling
title_short Improving the accuracy of expression data analysis in time course experiments using resampling
title_sort improving the accuracy of expression data analysis in time course experiments using resampling
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4220062/
https://www.ncbi.nlm.nih.gov/pubmed/25344112
http://dx.doi.org/10.1186/s12859-014-0352-8
work_keys_str_mv AT walterwencke improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling
AT stribernybernd improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling
AT gaquerelemmanuel improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling
AT baldwiniant improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling
AT kimsanggyu improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling
AT heilandines improvingtheaccuracyofexpressiondataanalysisintimecourseexperimentsusingresampling