Cargando…
Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates
A major application of RNA-Seq is to perform differential gene expression analysis. Many tools exist to analyze differentially expressed genes in the presence of biological replicates. Frequently, however, RNA-Seq experiments have no or very few biological replicates and development of methods for d...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4248812/ https://www.ncbi.nlm.nih.gov/pubmed/25435284 http://dx.doi.org/10.1186/1471-2164-15-S8-S2 |
_version_ | 1782346847654772736 |
---|---|
author | Al Seesi, Sahar Tiagueu, Yvette Temate Zelikovsky, Alexander Măndoiu, Ion I |
author_facet | Al Seesi, Sahar Tiagueu, Yvette Temate Zelikovsky, Alexander Măndoiu, Ion I |
author_sort | Al Seesi, Sahar |
collection | PubMed |
description | A major application of RNA-Seq is to perform differential gene expression analysis. Many tools exist to analyze differentially expressed genes in the presence of biological replicates. Frequently, however, RNA-Seq experiments have no or very few biological replicates and development of methods for detecting differentially expressed genes in these scenarios is still an active research area. In this paper we introduce a novel method, called IsoDE, for differential gene expression analysis based on bootstrapping. We compared IsoDE against four existing methods (Fisher's exact test, GFOLD, edgeR and Cuffdiff) on RNA-Seq datasets generated using three different sequencing technologies, both with and without replicates. Experiments on MAQC RNA-Seq datasets without replicates show that IsoDE has consistently high accuracy as defined by the qPCR ground truth, frequently higher than that of the compared methods, particularly for low coverage data and at lower fold change thresholds. In experiments on RNA-Seq datasets with up to 7 replicates, IsoDE has also achieved high accuracy. Furthermore, unlike GFOLD and edgeR, IsoDE accuracy varies smoothly with the number of replicates, and is relatively uniform across the entire range of gene expression levels. The proposed non-parametric method based on bootstrapping has practical running time, and achieves robust performance over a broad range of technologies, number of replicates, sequencing depths, and minimum fold change thresholds. |
format | Online Article Text |
id | pubmed-4248812 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42488122014-12-04 Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates Al Seesi, Sahar Tiagueu, Yvette Temate Zelikovsky, Alexander Măndoiu, Ion I BMC Genomics Proceedings A major application of RNA-Seq is to perform differential gene expression analysis. Many tools exist to analyze differentially expressed genes in the presence of biological replicates. Frequently, however, RNA-Seq experiments have no or very few biological replicates and development of methods for detecting differentially expressed genes in these scenarios is still an active research area. In this paper we introduce a novel method, called IsoDE, for differential gene expression analysis based on bootstrapping. We compared IsoDE against four existing methods (Fisher's exact test, GFOLD, edgeR and Cuffdiff) on RNA-Seq datasets generated using three different sequencing technologies, both with and without replicates. Experiments on MAQC RNA-Seq datasets without replicates show that IsoDE has consistently high accuracy as defined by the qPCR ground truth, frequently higher than that of the compared methods, particularly for low coverage data and at lower fold change thresholds. In experiments on RNA-Seq datasets with up to 7 replicates, IsoDE has also achieved high accuracy. Furthermore, unlike GFOLD and edgeR, IsoDE accuracy varies smoothly with the number of replicates, and is relatively uniform across the entire range of gene expression levels. The proposed non-parametric method based on bootstrapping has practical running time, and achieves robust performance over a broad range of technologies, number of replicates, sequencing depths, and minimum fold change thresholds. BioMed Central 2014-11-13 /pmc/articles/PMC4248812/ /pubmed/25435284 http://dx.doi.org/10.1186/1471-2164-15-S8-S2 Text en Copyright © 2014 Seesi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Al Seesi, Sahar Tiagueu, Yvette Temate Zelikovsky, Alexander Măndoiu, Ion I Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates |
title | Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates |
title_full | Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates |
title_fullStr | Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates |
title_full_unstemmed | Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates |
title_short | Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates |
title_sort | bootstrap-based differential gene expression analysis for rna-seq data with and without replicates |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4248812/ https://www.ncbi.nlm.nih.gov/pubmed/25435284 http://dx.doi.org/10.1186/1471-2164-15-S8-S2 |
work_keys_str_mv | AT alseesisahar bootstrapbaseddifferentialgeneexpressionanalysisforrnaseqdatawithandwithoutreplicates AT tiagueuyvettetemate bootstrapbaseddifferentialgeneexpressionanalysisforrnaseqdatawithandwithoutreplicates AT zelikovskyalexander bootstrapbaseddifferentialgeneexpressionanalysisforrnaseqdatawithandwithoutreplicates AT mandoiuioni bootstrapbaseddifferentialgeneexpressionanalysisforrnaseqdatawithandwithoutreplicates |