Cargando…

Overfitting Bayesian Mixture Models with an Unknown Number of Components

This paper proposes solutions to three issues pertaining to the estimation of finite mixture models with an unknown number of components: the non-identifiability induced by overfitting the number of components, the mixing limitations of standard Markov Chain Monte Carlo (MCMC) sampling techniques, a...

Descripción completa

Detalles Bibliográficos
Autores principales: van Havre, Zoé, White, Nicole, Rousseau, Judith, Mengersen, Kerrie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4503697/
https://www.ncbi.nlm.nih.gov/pubmed/26177375
http://dx.doi.org/10.1371/journal.pone.0131739
_version_ 1782381343574851584
author van Havre, Zoé
White, Nicole
Rousseau, Judith
Mengersen, Kerrie
author_facet van Havre, Zoé
White, Nicole
Rousseau, Judith
Mengersen, Kerrie
author_sort van Havre, Zoé
collection PubMed
description This paper proposes solutions to three issues pertaining to the estimation of finite mixture models with an unknown number of components: the non-identifiability induced by overfitting the number of components, the mixing limitations of standard Markov Chain Monte Carlo (MCMC) sampling techniques, and the related label switching problem. An overfitting approach is used to estimate the number of components in a finite mixture model via a Zmix algorithm. Zmix provides a bridge between multidimensional samplers and test based estimation methods, whereby priors are chosen to encourage extra groups to have weights approaching zero. MCMC sampling is made possible by the implementation of prior parallel tempering, an extension of parallel tempering. Zmix can accurately estimate the number of components, posterior parameter estimates and allocation probabilities given a sufficiently large sample size. The results will reflect uncertainty in the final model and will report the range of possible candidate models and their respective estimated probabilities from a single run. Label switching is resolved with a computationally light-weight method, Zswitch, developed for overfitted mixtures by exploiting the intuitiveness of allocation-based relabelling algorithms and the precision of label-invariant loss functions. Four simulation studies are included to illustrate Zmix and Zswitch, as well as three case studies from the literature. All methods are available as part of the R package Zmix, which can currently be applied to univariate Gaussian mixture models.
format Online
Article
Text
id pubmed-4503697
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45036972015-07-17 Overfitting Bayesian Mixture Models with an Unknown Number of Components van Havre, Zoé White, Nicole Rousseau, Judith Mengersen, Kerrie PLoS One Research Article This paper proposes solutions to three issues pertaining to the estimation of finite mixture models with an unknown number of components: the non-identifiability induced by overfitting the number of components, the mixing limitations of standard Markov Chain Monte Carlo (MCMC) sampling techniques, and the related label switching problem. An overfitting approach is used to estimate the number of components in a finite mixture model via a Zmix algorithm. Zmix provides a bridge between multidimensional samplers and test based estimation methods, whereby priors are chosen to encourage extra groups to have weights approaching zero. MCMC sampling is made possible by the implementation of prior parallel tempering, an extension of parallel tempering. Zmix can accurately estimate the number of components, posterior parameter estimates and allocation probabilities given a sufficiently large sample size. The results will reflect uncertainty in the final model and will report the range of possible candidate models and their respective estimated probabilities from a single run. Label switching is resolved with a computationally light-weight method, Zswitch, developed for overfitted mixtures by exploiting the intuitiveness of allocation-based relabelling algorithms and the precision of label-invariant loss functions. Four simulation studies are included to illustrate Zmix and Zswitch, as well as three case studies from the literature. All methods are available as part of the R package Zmix, which can currently be applied to univariate Gaussian mixture models. Public Library of Science 2015-07-15 /pmc/articles/PMC4503697/ /pubmed/26177375 http://dx.doi.org/10.1371/journal.pone.0131739 Text en © 2015 van Havre et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
van Havre, Zoé
White, Nicole
Rousseau, Judith
Mengersen, Kerrie
Overfitting Bayesian Mixture Models with an Unknown Number of Components
title Overfitting Bayesian Mixture Models with an Unknown Number of Components
title_full Overfitting Bayesian Mixture Models with an Unknown Number of Components
title_fullStr Overfitting Bayesian Mixture Models with an Unknown Number of Components
title_full_unstemmed Overfitting Bayesian Mixture Models with an Unknown Number of Components
title_short Overfitting Bayesian Mixture Models with an Unknown Number of Components
title_sort overfitting bayesian mixture models with an unknown number of components
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4503697/
https://www.ncbi.nlm.nih.gov/pubmed/26177375
http://dx.doi.org/10.1371/journal.pone.0131739
work_keys_str_mv AT vanhavrezoe overfittingbayesianmixturemodelswithanunknownnumberofcomponents
AT whitenicole overfittingbayesianmixturemodelswithanunknownnumberofcomponents
AT rousseaujudith overfittingbayesianmixturemodelswithanunknownnumberofcomponents
AT mengersenkerrie overfittingbayesianmixturemodelswithanunknownnumberofcomponents