Cargando…

Tuning parameter selection for a penalized estimator of species richness

Our goal is to estimate the true number of classes in a population, called the species richness. We consider the case where multiple frequency count tables have been collected from a homogeneous population and investigate a penalized maximum likelihood estimator under a negative binomial model. Beca...

Descripción completa

Detalles Bibliográficos
Autores principales: Paynter, Alex, Willis, Amy D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8098713/
https://www.ncbi.nlm.nih.gov/pubmed/33967371
http://dx.doi.org/10.1080/02664763.2020.1754359
_version_ 1783688462774304768
author Paynter, Alex
Willis, Amy D.
author_facet Paynter, Alex
Willis, Amy D.
author_sort Paynter, Alex
collection PubMed
description Our goal is to estimate the true number of classes in a population, called the species richness. We consider the case where multiple frequency count tables have been collected from a homogeneous population and investigate a penalized maximum likelihood estimator under a negative binomial model. Because high probabilities of unobserved classes increase the variance of species richness estimates, our method penalizes the probability of a class being unobserved. Tuning the penalization parameter is challenging because the true species richness is never known, and so we propose and validate four novel methods for tuning the penalization parameter. We illustrate and contrast the performance of the proposed methods by estimating the strain-level microbial diversity of Lake Champlain over three consecutive years, and global human host-associated species-level microbial richness.
format Online
Article
Text
id pubmed-8098713
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-80987132022-01-01 Tuning parameter selection for a penalized estimator of species richness Paynter, Alex Willis, Amy D. J Appl Stat Articles Our goal is to estimate the true number of classes in a population, called the species richness. We consider the case where multiple frequency count tables have been collected from a homogeneous population and investigate a penalized maximum likelihood estimator under a negative binomial model. Because high probabilities of unobserved classes increase the variance of species richness estimates, our method penalizes the probability of a class being unobserved. Tuning the penalization parameter is challenging because the true species richness is never known, and so we propose and validate four novel methods for tuning the penalization parameter. We illustrate and contrast the performance of the proposed methods by estimating the strain-level microbial diversity of Lake Champlain over three consecutive years, and global human host-associated species-level microbial richness. Taylor & Francis 2020-04-19 /pmc/articles/PMC8098713/ /pubmed/33967371 http://dx.doi.org/10.1080/02664763.2020.1754359 Text en © 2020 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way.
spellingShingle Articles
Paynter, Alex
Willis, Amy D.
Tuning parameter selection for a penalized estimator of species richness
title Tuning parameter selection for a penalized estimator of species richness
title_full Tuning parameter selection for a penalized estimator of species richness
title_fullStr Tuning parameter selection for a penalized estimator of species richness
title_full_unstemmed Tuning parameter selection for a penalized estimator of species richness
title_short Tuning parameter selection for a penalized estimator of species richness
title_sort tuning parameter selection for a penalized estimator of species richness
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8098713/
https://www.ncbi.nlm.nih.gov/pubmed/33967371
http://dx.doi.org/10.1080/02664763.2020.1754359
work_keys_str_mv AT paynteralex tuningparameterselectionforapenalizedestimatorofspeciesrichness
AT willisamyd tuningparameterselectionforapenalizedestimatorofspeciesrichness