Cargando…

Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates

A major application of RNA-Seq is to perform differential gene expression analysis. Many tools exist to analyze differentially expressed genes in the presence of biological replicates. Frequently, however, RNA-Seq experiments have no or very few biological replicates and development of methods for d...

Descripción completa

Detalles Bibliográficos
Autores principales: Al Seesi, Sahar, Tiagueu, Yvette Temate, Zelikovsky, Alexander, Măndoiu, Ion I
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4248812/
https://www.ncbi.nlm.nih.gov/pubmed/25435284
http://dx.doi.org/10.1186/1471-2164-15-S8-S2
_version_ 1782346847654772736
author Al Seesi, Sahar
Tiagueu, Yvette Temate
Zelikovsky, Alexander
Măndoiu, Ion I
author_facet Al Seesi, Sahar
Tiagueu, Yvette Temate
Zelikovsky, Alexander
Măndoiu, Ion I
author_sort Al Seesi, Sahar
collection PubMed
description A major application of RNA-Seq is to perform differential gene expression analysis. Many tools exist to analyze differentially expressed genes in the presence of biological replicates. Frequently, however, RNA-Seq experiments have no or very few biological replicates and development of methods for detecting differentially expressed genes in these scenarios is still an active research area. In this paper we introduce a novel method, called IsoDE, for differential gene expression analysis based on bootstrapping. We compared IsoDE against four existing methods (Fisher's exact test, GFOLD, edgeR and Cuffdiff) on RNA-Seq datasets generated using three different sequencing technologies, both with and without replicates. Experiments on MAQC RNA-Seq datasets without replicates show that IsoDE has consistently high accuracy as defined by the qPCR ground truth, frequently higher than that of the compared methods, particularly for low coverage data and at lower fold change thresholds. In experiments on RNA-Seq datasets with up to 7 replicates, IsoDE has also achieved high accuracy. Furthermore, unlike GFOLD and edgeR, IsoDE accuracy varies smoothly with the number of replicates, and is relatively uniform across the entire range of gene expression levels. The proposed non-parametric method based on bootstrapping has practical running time, and achieves robust performance over a broad range of technologies, number of replicates, sequencing depths, and minimum fold change thresholds.
format Online
Article
Text
id pubmed-4248812
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42488122014-12-04 Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates Al Seesi, Sahar Tiagueu, Yvette Temate Zelikovsky, Alexander Măndoiu, Ion I BMC Genomics Proceedings A major application of RNA-Seq is to perform differential gene expression analysis. Many tools exist to analyze differentially expressed genes in the presence of biological replicates. Frequently, however, RNA-Seq experiments have no or very few biological replicates and development of methods for detecting differentially expressed genes in these scenarios is still an active research area. In this paper we introduce a novel method, called IsoDE, for differential gene expression analysis based on bootstrapping. We compared IsoDE against four existing methods (Fisher's exact test, GFOLD, edgeR and Cuffdiff) on RNA-Seq datasets generated using three different sequencing technologies, both with and without replicates. Experiments on MAQC RNA-Seq datasets without replicates show that IsoDE has consistently high accuracy as defined by the qPCR ground truth, frequently higher than that of the compared methods, particularly for low coverage data and at lower fold change thresholds. In experiments on RNA-Seq datasets with up to 7 replicates, IsoDE has also achieved high accuracy. Furthermore, unlike GFOLD and edgeR, IsoDE accuracy varies smoothly with the number of replicates, and is relatively uniform across the entire range of gene expression levels. The proposed non-parametric method based on bootstrapping has practical running time, and achieves robust performance over a broad range of technologies, number of replicates, sequencing depths, and minimum fold change thresholds. BioMed Central 2014-11-13 /pmc/articles/PMC4248812/ /pubmed/25435284 http://dx.doi.org/10.1186/1471-2164-15-S8-S2 Text en Copyright © 2014 Seesi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Al Seesi, Sahar
Tiagueu, Yvette Temate
Zelikovsky, Alexander
Măndoiu, Ion I
Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates
title Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates
title_full Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates
title_fullStr Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates
title_full_unstemmed Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates
title_short Bootstrap-based differential gene expression analysis for RNA-Seq data with and without replicates
title_sort bootstrap-based differential gene expression analysis for rna-seq data with and without replicates
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4248812/
https://www.ncbi.nlm.nih.gov/pubmed/25435284
http://dx.doi.org/10.1186/1471-2164-15-S8-S2
work_keys_str_mv AT alseesisahar bootstrapbaseddifferentialgeneexpressionanalysisforrnaseqdatawithandwithoutreplicates
AT tiagueuyvettetemate bootstrapbaseddifferentialgeneexpressionanalysisforrnaseqdatawithandwithoutreplicates
AT zelikovskyalexander bootstrapbaseddifferentialgeneexpressionanalysisforrnaseqdatawithandwithoutreplicates
AT mandoiuioni bootstrapbaseddifferentialgeneexpressionanalysisforrnaseqdatawithandwithoutreplicates