Cargando…

Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size

Motivation: RNA-seq experiments produce digital counts of reads that are affected by both biological and technical variation. To distinguish the systematic changes in expression between conditions from noise, the counts are frequently modeled by the Negative Binomial distribution. However, in experi...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Danni, Huber, Wolfgang, Vitek, Olga
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3654711/
https://www.ncbi.nlm.nih.gov/pubmed/23589650
http://dx.doi.org/10.1093/bioinformatics/btt143
_version_ 1782476065379188736
author Yu, Danni
Huber, Wolfgang
Vitek, Olga
author_facet Yu, Danni
Huber, Wolfgang
Vitek, Olga
author_sort Yu, Danni
collection PubMed
description Motivation: RNA-seq experiments produce digital counts of reads that are affected by both biological and technical variation. To distinguish the systematic changes in expression between conditions from noise, the counts are frequently modeled by the Negative Binomial distribution. However, in experiments with small sample size, the per-gene estimates of the dispersion parameter are unreliable. Method: We propose a simple and effective approach for estimating the dispersions. First, we obtain the initial estimates for each gene using the method of moments. Second, the estimates are regularized, i.e. shrunk towards a common value that minimizes the average squared difference between the initial estimates and the shrinkage estimates. The approach does not require extra modeling assumptions, is easy to compute and is compatible with the exact test of differential expression. Results: We evaluated the proposed approach using 10 simulated and experimental datasets and compared its performance with that of currently popular packages edgeR, DESeq, baySeq, BBSeq and SAMseq. For these datasets, sSeq performed favorably for experiments with small sample size in sensitivity, specificity and computational time. Availability: http://www.stat.purdue.edu/∼ovitek/Software.html and Bioconductor. Contact: ovitek@purdue.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3654711
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36547112013-05-17 Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size Yu, Danni Huber, Wolfgang Vitek, Olga Bioinformatics Original Papers Motivation: RNA-seq experiments produce digital counts of reads that are affected by both biological and technical variation. To distinguish the systematic changes in expression between conditions from noise, the counts are frequently modeled by the Negative Binomial distribution. However, in experiments with small sample size, the per-gene estimates of the dispersion parameter are unreliable. Method: We propose a simple and effective approach for estimating the dispersions. First, we obtain the initial estimates for each gene using the method of moments. Second, the estimates are regularized, i.e. shrunk towards a common value that minimizes the average squared difference between the initial estimates and the shrinkage estimates. The approach does not require extra modeling assumptions, is easy to compute and is compatible with the exact test of differential expression. Results: We evaluated the proposed approach using 10 simulated and experimental datasets and compared its performance with that of currently popular packages edgeR, DESeq, baySeq, BBSeq and SAMseq. For these datasets, sSeq performed favorably for experiments with small sample size in sensitivity, specificity and computational time. Availability: http://www.stat.purdue.edu/∼ovitek/Software.html and Bioconductor. Contact: ovitek@purdue.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-05-15 2013-04-14 /pmc/articles/PMC3654711/ /pubmed/23589650 http://dx.doi.org/10.1093/bioinformatics/btt143 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Yu, Danni
Huber, Wolfgang
Vitek, Olga
Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size
title Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size
title_full Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size
title_fullStr Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size
title_full_unstemmed Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size
title_short Shrinkage estimation of dispersion in Negative Binomial models for RNA-seq experiments with small sample size
title_sort shrinkage estimation of dispersion in negative binomial models for rna-seq experiments with small sample size
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3654711/
https://www.ncbi.nlm.nih.gov/pubmed/23589650
http://dx.doi.org/10.1093/bioinformatics/btt143
work_keys_str_mv AT yudanni shrinkageestimationofdispersioninnegativebinomialmodelsforrnaseqexperimentswithsmallsamplesize
AT huberwolfgang shrinkageestimationofdispersioninnegativebinomialmodelsforrnaseqexperimentswithsmallsamplesize
AT vitekolga shrinkageestimationofdispersioninnegativebinomialmodelsforrnaseqexperimentswithsmallsamplesize