Cargando…
A more reliable species richness estimator based on the Gamma–Poisson model
BACKGROUND: Accurately estimating the true richness of a target community is still a statistical challenge, particularly in highly diverse communities. Due to sampling limitations or limited resources, undetected species are present in many surveys and observed richness is an underestimate of true r...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9828287/ https://www.ncbi.nlm.nih.gov/pubmed/36632143 http://dx.doi.org/10.7717/peerj.14540 |
_version_ | 1784867238036111360 |
---|---|
author | Chiu, Chun-Huo |
author_facet | Chiu, Chun-Huo |
author_sort | Chiu, Chun-Huo |
collection | PubMed |
description | BACKGROUND: Accurately estimating the true richness of a target community is still a statistical challenge, particularly in highly diverse communities. Due to sampling limitations or limited resources, undetected species are present in many surveys and observed richness is an underestimate of true richness. In the literature, methods for estimating the undetected richness of a sample are generally divided into two categories: parametric and nonparametric estimators. Imposing no assumptions on species detection rates, nonparametric methods demonstrate robust statistical performance and are widely used in ecological studies. However, nonparametric estimators may seriously underestimate richness when species composition has a high degree of heterogeneity. Parametric approaches, which reduce the number of parameters by assuming that species-specific detection probabilities follow a given statistical distribution, use traditional statistical inference to calculate species richness estimates. When species detection rates meet the model assumption, the parametric approach could supply a nearly unbiased estimator. However, the infeasibility and inefficiency of solving maximum likelihood functions limit the application of parametric methods in ecological studies when the model assumption is violated, or the collected data is sparse. METHOD: To overcome these estimating challenges associated with parametric methods, an estimator employing the moment estimation method instead of the maximum likelihood estimation method is proposed to estimate parameters based on a Gamma-Poisson mixture model. Drawing on the concept of the Good-Turing frequency formula, the proposed estimator only uses the number of singletons, doubletons, and tripletons in a sample for undetected richness estimation. RESULTS: The statistical behavior of the new estimator was evaluated by using real and simulated data sets from various species abundance models. Simulation results indicated that the new estimator reduces the bias presented in traditional nonparametric estimators, presents more robust statistical behavior compared to other parametric estimators, and provides confidence intervals with better coverage among the discussed estimators, especially in assemblages with high species composition heterogeneity. |
format | Online Article Text |
id | pubmed-9828287 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-98282872023-01-10 A more reliable species richness estimator based on the Gamma–Poisson model Chiu, Chun-Huo PeerJ Biodiversity BACKGROUND: Accurately estimating the true richness of a target community is still a statistical challenge, particularly in highly diverse communities. Due to sampling limitations or limited resources, undetected species are present in many surveys and observed richness is an underestimate of true richness. In the literature, methods for estimating the undetected richness of a sample are generally divided into two categories: parametric and nonparametric estimators. Imposing no assumptions on species detection rates, nonparametric methods demonstrate robust statistical performance and are widely used in ecological studies. However, nonparametric estimators may seriously underestimate richness when species composition has a high degree of heterogeneity. Parametric approaches, which reduce the number of parameters by assuming that species-specific detection probabilities follow a given statistical distribution, use traditional statistical inference to calculate species richness estimates. When species detection rates meet the model assumption, the parametric approach could supply a nearly unbiased estimator. However, the infeasibility and inefficiency of solving maximum likelihood functions limit the application of parametric methods in ecological studies when the model assumption is violated, or the collected data is sparse. METHOD: To overcome these estimating challenges associated with parametric methods, an estimator employing the moment estimation method instead of the maximum likelihood estimation method is proposed to estimate parameters based on a Gamma-Poisson mixture model. Drawing on the concept of the Good-Turing frequency formula, the proposed estimator only uses the number of singletons, doubletons, and tripletons in a sample for undetected richness estimation. RESULTS: The statistical behavior of the new estimator was evaluated by using real and simulated data sets from various species abundance models. Simulation results indicated that the new estimator reduces the bias presented in traditional nonparametric estimators, presents more robust statistical behavior compared to other parametric estimators, and provides confidence intervals with better coverage among the discussed estimators, especially in assemblages with high species composition heterogeneity. PeerJ Inc. 2023-01-06 /pmc/articles/PMC9828287/ /pubmed/36632143 http://dx.doi.org/10.7717/peerj.14540 Text en ©2023 Chiu https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Biodiversity Chiu, Chun-Huo A more reliable species richness estimator based on the Gamma–Poisson model |
title | A more reliable species richness estimator based on the Gamma–Poisson model |
title_full | A more reliable species richness estimator based on the Gamma–Poisson model |
title_fullStr | A more reliable species richness estimator based on the Gamma–Poisson model |
title_full_unstemmed | A more reliable species richness estimator based on the Gamma–Poisson model |
title_short | A more reliable species richness estimator based on the Gamma–Poisson model |
title_sort | more reliable species richness estimator based on the gamma–poisson model |
topic | Biodiversity |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9828287/ https://www.ncbi.nlm.nih.gov/pubmed/36632143 http://dx.doi.org/10.7717/peerj.14540 |
work_keys_str_mv | AT chiuchunhuo amorereliablespeciesrichnessestimatorbasedonthegammapoissonmodel AT chiuchunhuo morereliablespeciesrichnessestimatorbasedonthegammapoissonmodel |