Cargando…
Bayesian inference on quasi-sparse count data
There is growing interest in analysing high-dimensional count data, which often exhibit quasi-sparsity corresponding to an overabundance of zeros and small nonzero counts. Existing methods for analysing multivariate count data via Poisson or negative binomial log-linear hierarchical models with zero...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5793680/ https://www.ncbi.nlm.nih.gov/pubmed/29422693 http://dx.doi.org/10.1093/biomet/asw053 |
_version_ | 1783297007902785536 |
---|---|
author | Datta, Jyotishka Dunson, David B. |
author_facet | Datta, Jyotishka Dunson, David B. |
author_sort | Datta, Jyotishka |
collection | PubMed |
description | There is growing interest in analysing high-dimensional count data, which often exhibit quasi-sparsity corresponding to an overabundance of zeros and small nonzero counts. Existing methods for analysing multivariate count data via Poisson or negative binomial log-linear hierarchical models with zero-inflation cannot flexibly adapt to quasi-sparse settings. We develop a new class of continuous local-global shrinkage priors tailored to quasi-sparse counts. Theoretical properties are assessed, including flexible posterior concentration and stronger control of false discoveries in multiple testing. Simulation studies demonstrate excellent small-sample properties relative to competing methods. We use the method to detect rare mutational hotspots in exome sequencing data and to identify North American cities most impacted by terrorism. |
format | Online Article Text |
id | pubmed-5793680 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-57936802018-02-06 Bayesian inference on quasi-sparse count data Datta, Jyotishka Dunson, David B. Biometrika Articles There is growing interest in analysing high-dimensional count data, which often exhibit quasi-sparsity corresponding to an overabundance of zeros and small nonzero counts. Existing methods for analysing multivariate count data via Poisson or negative binomial log-linear hierarchical models with zero-inflation cannot flexibly adapt to quasi-sparse settings. We develop a new class of continuous local-global shrinkage priors tailored to quasi-sparse counts. Theoretical properties are assessed, including flexible posterior concentration and stronger control of false discoveries in multiple testing. Simulation studies demonstrate excellent small-sample properties relative to competing methods. We use the method to detect rare mutational hotspots in exome sequencing data and to identify North American cities most impacted by terrorism. Oxford University Press 2016-12 2016-12-08 /pmc/articles/PMC5793680/ /pubmed/29422693 http://dx.doi.org/10.1093/biomet/asw053 Text en © 2016 Biometrika Trust |
spellingShingle | Articles Datta, Jyotishka Dunson, David B. Bayesian inference on quasi-sparse count data |
title | Bayesian inference on quasi-sparse count data |
title_full | Bayesian inference on quasi-sparse count data |
title_fullStr | Bayesian inference on quasi-sparse count data |
title_full_unstemmed | Bayesian inference on quasi-sparse count data |
title_short | Bayesian inference on quasi-sparse count data |
title_sort | bayesian inference on quasi-sparse count data |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5793680/ https://www.ncbi.nlm.nih.gov/pubmed/29422693 http://dx.doi.org/10.1093/biomet/asw053 |
work_keys_str_mv | AT dattajyotishka bayesianinferenceonquasisparsecountdata AT dunsondavidb bayesianinferenceonquasisparsecountdata |