Cargando…

Bayesian estimation of community size and overlap from random subsamples

Counting the number of species, items, or genes that are shared between two groups, sets, or communities is a simple calculation when sampling is complete. However, when only partial samples are available, quantifying the overlap between two communities becomes an estimation problem. Furthermore, to...

Descripción completa

Detalles Bibliográficos
Autores principales: Johnson, Erik K., Larremore, Daniel B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9522272/
https://www.ncbi.nlm.nih.gov/pubmed/36121879
http://dx.doi.org/10.1371/journal.pcbi.1010451
_version_ 1784800027919515648
author Johnson, Erik K.
Larremore, Daniel B.
author_facet Johnson, Erik K.
Larremore, Daniel B.
author_sort Johnson, Erik K.
collection PubMed
description Counting the number of species, items, or genes that are shared between two groups, sets, or communities is a simple calculation when sampling is complete. However, when only partial samples are available, quantifying the overlap between two communities becomes an estimation problem. Furthermore, to calculate normalized measures of β-diversity, such as the Jaccard and Sorenson-Dice indices, one must also estimate the total sizes of the communities being compared. Previous efforts to address these problems have assumed knowledge of total community sizes and then used Bayesian methods to produce unbiased estimates with quantified uncertainty. Here, we address communities of unknown size and show that this produces systematically better estimates—both in terms of central estimates and quantification of uncertainty in those estimates. We further show how to use species, item, or gene count data to refine estimates of community size in a Bayesian joint model of community size and overlap.
format Online
Article
Text
id pubmed-9522272
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-95222722022-09-30 Bayesian estimation of community size and overlap from random subsamples Johnson, Erik K. Larremore, Daniel B. PLoS Comput Biol Research Article Counting the number of species, items, or genes that are shared between two groups, sets, or communities is a simple calculation when sampling is complete. However, when only partial samples are available, quantifying the overlap between two communities becomes an estimation problem. Furthermore, to calculate normalized measures of β-diversity, such as the Jaccard and Sorenson-Dice indices, one must also estimate the total sizes of the communities being compared. Previous efforts to address these problems have assumed knowledge of total community sizes and then used Bayesian methods to produce unbiased estimates with quantified uncertainty. Here, we address communities of unknown size and show that this produces systematically better estimates—both in terms of central estimates and quantification of uncertainty in those estimates. We further show how to use species, item, or gene count data to refine estimates of community size in a Bayesian joint model of community size and overlap. Public Library of Science 2022-09-19 /pmc/articles/PMC9522272/ /pubmed/36121879 http://dx.doi.org/10.1371/journal.pcbi.1010451 Text en © 2022 Johnson, Larremore https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Johnson, Erik K.
Larremore, Daniel B.
Bayesian estimation of community size and overlap from random subsamples
title Bayesian estimation of community size and overlap from random subsamples
title_full Bayesian estimation of community size and overlap from random subsamples
title_fullStr Bayesian estimation of community size and overlap from random subsamples
title_full_unstemmed Bayesian estimation of community size and overlap from random subsamples
title_short Bayesian estimation of community size and overlap from random subsamples
title_sort bayesian estimation of community size and overlap from random subsamples
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9522272/
https://www.ncbi.nlm.nih.gov/pubmed/36121879
http://dx.doi.org/10.1371/journal.pcbi.1010451
work_keys_str_mv AT johnsonerikk bayesianestimationofcommunitysizeandoverlapfromrandomsubsamples
AT larremoredanielb bayesianestimationofcommunitysizeandoverlapfromrandomsubsamples