Cargando…

Bayesian inference on quasi-sparse count data

There is growing interest in analysing high-dimensional count data, which often exhibit quasi-sparsity corresponding to an overabundance of zeros and small nonzero counts. Existing methods for analysing multivariate count data via Poisson or negative binomial log-linear hierarchical models with zero...

Descripción completa

Detalles Bibliográficos
Autores principales: Datta, Jyotishka, Dunson, David B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5793680/
https://www.ncbi.nlm.nih.gov/pubmed/29422693
http://dx.doi.org/10.1093/biomet/asw053
_version_ 1783297007902785536
author Datta, Jyotishka
Dunson, David B.
author_facet Datta, Jyotishka
Dunson, David B.
author_sort Datta, Jyotishka
collection PubMed
description There is growing interest in analysing high-dimensional count data, which often exhibit quasi-sparsity corresponding to an overabundance of zeros and small nonzero counts. Existing methods for analysing multivariate count data via Poisson or negative binomial log-linear hierarchical models with zero-inflation cannot flexibly adapt to quasi-sparse settings. We develop a new class of continuous local-global shrinkage priors tailored to quasi-sparse counts. Theoretical properties are assessed, including flexible posterior concentration and stronger control of false discoveries in multiple testing. Simulation studies demonstrate excellent small-sample properties relative to competing methods. We use the method to detect rare mutational hotspots in exome sequencing data and to identify North American cities most impacted by terrorism.
format Online
Article
Text
id pubmed-5793680
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57936802018-02-06 Bayesian inference on quasi-sparse count data Datta, Jyotishka Dunson, David B. Biometrika Articles There is growing interest in analysing high-dimensional count data, which often exhibit quasi-sparsity corresponding to an overabundance of zeros and small nonzero counts. Existing methods for analysing multivariate count data via Poisson or negative binomial log-linear hierarchical models with zero-inflation cannot flexibly adapt to quasi-sparse settings. We develop a new class of continuous local-global shrinkage priors tailored to quasi-sparse counts. Theoretical properties are assessed, including flexible posterior concentration and stronger control of false discoveries in multiple testing. Simulation studies demonstrate excellent small-sample properties relative to competing methods. We use the method to detect rare mutational hotspots in exome sequencing data and to identify North American cities most impacted by terrorism. Oxford University Press 2016-12 2016-12-08 /pmc/articles/PMC5793680/ /pubmed/29422693 http://dx.doi.org/10.1093/biomet/asw053 Text en © 2016 Biometrika Trust
spellingShingle Articles
Datta, Jyotishka
Dunson, David B.
Bayesian inference on quasi-sparse count data
title Bayesian inference on quasi-sparse count data
title_full Bayesian inference on quasi-sparse count data
title_fullStr Bayesian inference on quasi-sparse count data
title_full_unstemmed Bayesian inference on quasi-sparse count data
title_short Bayesian inference on quasi-sparse count data
title_sort bayesian inference on quasi-sparse count data
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5793680/
https://www.ncbi.nlm.nih.gov/pubmed/29422693
http://dx.doi.org/10.1093/biomet/asw053
work_keys_str_mv AT dattajyotishka bayesianinferenceonquasisparsecountdata
AT dunsondavidb bayesianinferenceonquasisparsecountdata