Cargando…

Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference

An important feature of Bayesian statistics is the opportunity to do sequential inference: the posterior distribution obtained after seeing a dataset can be used as prior for a second inference. However, when Monte Carlo sampling methods are used for inference, we only have a set of samples from the...

Descripción completa

Detalles Bibliográficos
Autores principales: Thijssen, Bram, Wessels, Lodewyk F. A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7069631/
https://www.ncbi.nlm.nih.gov/pubmed/32168343
http://dx.doi.org/10.1371/journal.pone.0230101
_version_ 1783505814814720000
author Thijssen, Bram
Wessels, Lodewyk F. A.
author_facet Thijssen, Bram
Wessels, Lodewyk F. A.
author_sort Thijssen, Bram
collection PubMed
description An important feature of Bayesian statistics is the opportunity to do sequential inference: the posterior distribution obtained after seeing a dataset can be used as prior for a second inference. However, when Monte Carlo sampling methods are used for inference, we only have a set of samples from the posterior distribution. To do sequential inference, we then either have to evaluate the second posterior at only these locations and reweight the samples accordingly, or we can estimate a functional description of the posterior probability distribution from the samples and use that as prior for the second inference. Here, we investigated to what extent we can obtain an accurate joint posterior from two datasets if the inference is done sequentially rather than jointly, under the condition that each inference step is done using Monte Carlo sampling. To test this, we evaluated the accuracy of kernel density estimates, Gaussian mixtures, mixtures of factor analyzers, vine copulas and Gaussian processes in approximating posterior distributions, and then tested whether these approximations can be used in sequential inference. In low dimensionality, Gaussian processes are more accurate, whereas in higher dimensionality Gaussian mixtures, mixtures of factor analyzers or vine copulas perform better. In our test cases of sequential inference, using posterior approximations gives more accurate results than direct sample reweighting, but joint inference is still preferable over sequential inference whenever possible. Since the performance is case-specific, we provide an R package mvdens with a unified interface for the density approximation methods.
format Online
Article
Text
id pubmed-7069631
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70696312020-03-23 Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference Thijssen, Bram Wessels, Lodewyk F. A. PLoS One Research Article An important feature of Bayesian statistics is the opportunity to do sequential inference: the posterior distribution obtained after seeing a dataset can be used as prior for a second inference. However, when Monte Carlo sampling methods are used for inference, we only have a set of samples from the posterior distribution. To do sequential inference, we then either have to evaluate the second posterior at only these locations and reweight the samples accordingly, or we can estimate a functional description of the posterior probability distribution from the samples and use that as prior for the second inference. Here, we investigated to what extent we can obtain an accurate joint posterior from two datasets if the inference is done sequentially rather than jointly, under the condition that each inference step is done using Monte Carlo sampling. To test this, we evaluated the accuracy of kernel density estimates, Gaussian mixtures, mixtures of factor analyzers, vine copulas and Gaussian processes in approximating posterior distributions, and then tested whether these approximations can be used in sequential inference. In low dimensionality, Gaussian processes are more accurate, whereas in higher dimensionality Gaussian mixtures, mixtures of factor analyzers or vine copulas perform better. In our test cases of sequential inference, using posterior approximations gives more accurate results than direct sample reweighting, but joint inference is still preferable over sequential inference whenever possible. Since the performance is case-specific, we provide an R package mvdens with a unified interface for the density approximation methods. Public Library of Science 2020-03-13 /pmc/articles/PMC7069631/ /pubmed/32168343 http://dx.doi.org/10.1371/journal.pone.0230101 Text en © 2020 Thijssen, Wessels http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Thijssen, Bram
Wessels, Lodewyk F. A.
Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference
title Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference
title_full Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference
title_fullStr Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference
title_full_unstemmed Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference
title_short Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference
title_sort approximating multivariate posterior distribution functions from monte carlo samples for sequential bayesian inference
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7069631/
https://www.ncbi.nlm.nih.gov/pubmed/32168343
http://dx.doi.org/10.1371/journal.pone.0230101
work_keys_str_mv AT thijssenbram approximatingmultivariateposteriordistributionfunctionsfrommontecarlosamplesforsequentialbayesianinference
AT wesselslodewykfa approximatingmultivariateposteriordistributionfunctionsfrommontecarlosamplesforsequentialbayesianinference