Cargando…

Tuning parameter selection for a penalized estimator of species richness

Our goal is to estimate the true number of classes in a population, called the species richness. We consider the case where multiple frequency count tables have been collected from a homogeneous population and investigate a penalized maximum likelihood estimator under a negative binomial model. Beca...

Descripción completa

Detalles Bibliográficos
Autores principales: Paynter, Alex, Willis, Amy D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8098713/
https://www.ncbi.nlm.nih.gov/pubmed/33967371
http://dx.doi.org/10.1080/02664763.2020.1754359
Descripción
Sumario:Our goal is to estimate the true number of classes in a population, called the species richness. We consider the case where multiple frequency count tables have been collected from a homogeneous population and investigate a penalized maximum likelihood estimator under a negative binomial model. Because high probabilities of unobserved classes increase the variance of species richness estimates, our method penalizes the probability of a class being unobserved. Tuning the penalization parameter is challenging because the true species richness is never known, and so we propose and validate four novel methods for tuning the penalization parameter. We illustrate and contrast the performance of the proposed methods by estimating the strain-level microbial diversity of Lake Champlain over three consecutive years, and global human host-associated species-level microbial richness.