Cargando…

Bayesian cluster analysis

Bayesian cluster analysis offers substantial benefits over algorithmic approaches by providing not only point estimates but also uncertainty in the clustering structure and patterns within each cluster. An overview of Bayesian cluster analysis is provided, including both model-based and loss-based a...

Descripción completa

Detalles Bibliográficos
Autor principal: Wade, S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041359/
https://www.ncbi.nlm.nih.gov/pubmed/36970819
http://dx.doi.org/10.1098/rsta.2022.0149
_version_ 1784912699017134080
author Wade, S.
author_facet Wade, S.
author_sort Wade, S.
collection PubMed
description Bayesian cluster analysis offers substantial benefits over algorithmic approaches by providing not only point estimates but also uncertainty in the clustering structure and patterns within each cluster. An overview of Bayesian cluster analysis is provided, including both model-based and loss-based approaches, along with a discussion on the importance of the kernel or loss selected and prior specification. Advantages are demonstrated in an application to cluster cells and discover latent cell types in single-cell RNA sequencing data to study embryonic cellular development. Lastly, we focus on the ongoing debate between finite and infinite mixtures in a model-based approach and robustness to model misspecification. While much of the debate and asymptotic theory focuses on the marginal posterior of the number of clusters, we empirically show that quite a different behaviour is obtained when estimating the full clustering structure. This article is part of the theme issue ‘Bayesian inference: challenges, perspectives, and prospects’.
format Online
Article
Text
id pubmed-10041359
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-100413592023-03-28 Bayesian cluster analysis Wade, S. Philos Trans A Math Phys Eng Sci Articles Bayesian cluster analysis offers substantial benefits over algorithmic approaches by providing not only point estimates but also uncertainty in the clustering structure and patterns within each cluster. An overview of Bayesian cluster analysis is provided, including both model-based and loss-based approaches, along with a discussion on the importance of the kernel or loss selected and prior specification. Advantages are demonstrated in an application to cluster cells and discover latent cell types in single-cell RNA sequencing data to study embryonic cellular development. Lastly, we focus on the ongoing debate between finite and infinite mixtures in a model-based approach and robustness to model misspecification. While much of the debate and asymptotic theory focuses on the marginal posterior of the number of clusters, we empirically show that quite a different behaviour is obtained when estimating the full clustering structure. This article is part of the theme issue ‘Bayesian inference: challenges, perspectives, and prospects’. The Royal Society 2023-05-15 2023-03-27 /pmc/articles/PMC10041359/ /pubmed/36970819 http://dx.doi.org/10.1098/rsta.2022.0149 Text en © 2023 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited.
spellingShingle Articles
Wade, S.
Bayesian cluster analysis
title Bayesian cluster analysis
title_full Bayesian cluster analysis
title_fullStr Bayesian cluster analysis
title_full_unstemmed Bayesian cluster analysis
title_short Bayesian cluster analysis
title_sort bayesian cluster analysis
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041359/
https://www.ncbi.nlm.nih.gov/pubmed/36970819
http://dx.doi.org/10.1098/rsta.2022.0149
work_keys_str_mv AT wades bayesianclusteranalysis