Cargando…

A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data

Recent developments in RNA-sequencing (RNA-seq) technology have led to a rapid increase in gene expression data in the form of counts. RNA-seq can be used for a variety of applications, however, identifying differential expression (DE) remains a key task in functional genomics. There have been a num...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Hao, Wang, Chi, Wu, Zhijin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3590927/
https://www.ncbi.nlm.nih.gov/pubmed/23001152
http://dx.doi.org/10.1093/biostatistics/kxs033
_version_ 1782261944003067904
author Wu, Hao
Wang, Chi
Wu, Zhijin
author_facet Wu, Hao
Wang, Chi
Wu, Zhijin
author_sort Wu, Hao
collection PubMed
description Recent developments in RNA-sequencing (RNA-seq) technology have led to a rapid increase in gene expression data in the form of counts. RNA-seq can be used for a variety of applications, however, identifying differential expression (DE) remains a key task in functional genomics. There have been a number of statistical methods for DE detection for RNA-seq data. One common feature of several leading methods is the use of the negative binomial (Gamma–Poisson mixture) model. That is, the unobserved gene expression is modeled by a gamma random variable and, given the expression, the sequencing read counts are modeled as Poisson. The distinct feature in various methods is how the variance, or dispersion, in the Gamma distribution is modeled and estimated. We evaluate several large public RNA-seq datasets and find that the estimated dispersion in existing methods does not adequately capture the heterogeneity of biological variance among samples. We present a new empirical Bayes shrinkage estimate of the dispersion parameters and demonstrate improved DE detection.
format Online
Article
Text
id pubmed-3590927
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35909272013-03-07 A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data Wu, Hao Wang, Chi Wu, Zhijin Biostatistics Articles Recent developments in RNA-sequencing (RNA-seq) technology have led to a rapid increase in gene expression data in the form of counts. RNA-seq can be used for a variety of applications, however, identifying differential expression (DE) remains a key task in functional genomics. There have been a number of statistical methods for DE detection for RNA-seq data. One common feature of several leading methods is the use of the negative binomial (Gamma–Poisson mixture) model. That is, the unobserved gene expression is modeled by a gamma random variable and, given the expression, the sequencing read counts are modeled as Poisson. The distinct feature in various methods is how the variance, or dispersion, in the Gamma distribution is modeled and estimated. We evaluate several large public RNA-seq datasets and find that the estimated dispersion in existing methods does not adequately capture the heterogeneity of biological variance among samples. We present a new empirical Bayes shrinkage estimate of the dispersion parameters and demonstrate improved DE detection. Oxford University Press 2013-04 2012-09-22 /pmc/articles/PMC3590927/ /pubmed/23001152 http://dx.doi.org/10.1093/biostatistics/kxs033 Text en © The Author 2012. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Wu, Hao
Wang, Chi
Wu, Zhijin
A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data
title A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data
title_full A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data
title_fullStr A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data
title_full_unstemmed A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data
title_short A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data
title_sort new shrinkage estimator for dispersion improves differential expression detection in rna-seq data
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3590927/
https://www.ncbi.nlm.nih.gov/pubmed/23001152
http://dx.doi.org/10.1093/biostatistics/kxs033
work_keys_str_mv AT wuhao anewshrinkageestimatorfordispersionimprovesdifferentialexpressiondetectioninrnaseqdata
AT wangchi anewshrinkageestimatorfordispersionimprovesdifferentialexpressiondetectioninrnaseqdata
AT wuzhijin anewshrinkageestimatorfordispersionimprovesdifferentialexpressiondetectioninrnaseqdata
AT wuhao newshrinkageestimatorfordispersionimprovesdifferentialexpressiondetectioninrnaseqdata
AT wangchi newshrinkageestimatorfordispersionimprovesdifferentialexpressiondetectioninrnaseqdata
AT wuzhijin newshrinkageestimatorfordispersionimprovesdifferentialexpressiondetectioninrnaseqdata