Cargando…

NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data

BACKGROUND: RNA-seq, a massive parallel-sequencing-based transcriptome profiling method, provides digital data in the form of aligned sequence read counts. The comparative analyses of the data require appropriate statistical methods to estimate the differential expression of transcript variants acro...

Descripción completa

Detalles Bibliográficos
Autores principales: Bi, Yingtao, Davuluri, Ramana V
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3765716/
https://www.ncbi.nlm.nih.gov/pubmed/23981227
http://dx.doi.org/10.1186/1471-2105-14-262
_version_ 1782283374534066176
author Bi, Yingtao
Davuluri, Ramana V
author_facet Bi, Yingtao
Davuluri, Ramana V
author_sort Bi, Yingtao
collection PubMed
description BACKGROUND: RNA-seq, a massive parallel-sequencing-based transcriptome profiling method, provides digital data in the form of aligned sequence read counts. The comparative analyses of the data require appropriate statistical methods to estimate the differential expression of transcript variants across different cell/tissue types and disease conditions. RESULTS: We developed a novel nonparametric empirical Bayesian-based approach (NPEBseq) to model the RNA-seq data. The prior distribution of the Bayesian model is empirically estimated from the data without any parametric assumption, and hence the method is “nonparametric” in nature. Based on this model, we proposed a method for detecting differentially expressed genes across different conditions. We also extended this method to detect differential usage of exons from RNA-seq data. The evaluation of NPEBseq on both simulated and publicly available RNA-seq datasets and comparison with three popular methods showed improved results for experiments with or without biological replicates. CONCLUSIONS: NPEBseq can successfully detect differential expression between different conditions not only at gene level but also at exon level from RNA-seq datasets. In addition, NPEBSeq performs significantly better than current methods and can be applied to genome-wide RNA-seq datasets. Sample datasets and R package are available at http://bioinformatics.wistar.upenn.edu/NPEBseq.
format Online
Article
Text
id pubmed-3765716
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37657162013-09-11 NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data Bi, Yingtao Davuluri, Ramana V BMC Bioinformatics Software BACKGROUND: RNA-seq, a massive parallel-sequencing-based transcriptome profiling method, provides digital data in the form of aligned sequence read counts. The comparative analyses of the data require appropriate statistical methods to estimate the differential expression of transcript variants across different cell/tissue types and disease conditions. RESULTS: We developed a novel nonparametric empirical Bayesian-based approach (NPEBseq) to model the RNA-seq data. The prior distribution of the Bayesian model is empirically estimated from the data without any parametric assumption, and hence the method is “nonparametric” in nature. Based on this model, we proposed a method for detecting differentially expressed genes across different conditions. We also extended this method to detect differential usage of exons from RNA-seq data. The evaluation of NPEBseq on both simulated and publicly available RNA-seq datasets and comparison with three popular methods showed improved results for experiments with or without biological replicates. CONCLUSIONS: NPEBseq can successfully detect differential expression between different conditions not only at gene level but also at exon level from RNA-seq datasets. In addition, NPEBSeq performs significantly better than current methods and can be applied to genome-wide RNA-seq datasets. Sample datasets and R package are available at http://bioinformatics.wistar.upenn.edu/NPEBseq. BioMed Central 2013-08-27 /pmc/articles/PMC3765716/ /pubmed/23981227 http://dx.doi.org/10.1186/1471-2105-14-262 Text en Copyright © 2013 Bi and Davuluri; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Bi, Yingtao
Davuluri, Ramana V
NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data
title NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data
title_full NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data
title_fullStr NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data
title_full_unstemmed NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data
title_short NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data
title_sort npebseq: nonparametric empirical bayesian-based procedure for differential expression analysis of rna-seq data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3765716/
https://www.ncbi.nlm.nih.gov/pubmed/23981227
http://dx.doi.org/10.1186/1471-2105-14-262
work_keys_str_mv AT biyingtao npebseqnonparametricempiricalbayesianbasedprocedurefordifferentialexpressionanalysisofrnaseqdata
AT davuluriramanav npebseqnonparametricempiricalbayesianbasedprocedurefordifferentialexpressionanalysisofrnaseqdata