Cargando…

A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables

SIMPLE SUMMARY: Researchers identified a challenge in analyzing count-dependent variables in species related through a shared ancestry using traditional regression models, as these models often overlook the inherent interdependence from common lineage. To address this, a new phylogenetic negative bi...

Descripción completa

Detalles Bibliográficos
Autores principales: Jhwueng, Dwueng-Chwuan, Wu, Chi-Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10452298/
https://www.ncbi.nlm.nih.gov/pubmed/37627032
http://dx.doi.org/10.3390/biology12081148
_version_ 1785095634203705344
author Jhwueng, Dwueng-Chwuan
Wu, Chi-Yu
author_facet Jhwueng, Dwueng-Chwuan
Wu, Chi-Yu
author_sort Jhwueng, Dwueng-Chwuan
collection PubMed
description SIMPLE SUMMARY: Researchers identified a challenge in analyzing count-dependent variables in species related through a shared ancestry using traditional regression models, as these models often overlook the inherent interdependence from common lineage. To address this, a new phylogenetic negative binomial regression model was developed that recognizes this lineage dependence and allows for overdispersion, surpassing the limitations of the conventional generalized linear models (GLMs). Using the generalized estimating equation (GEE) framework, this model offers precise parameter estimation. This innovation offers a more accurate analysis tool for understanding species data, emphasizing the influence of shared ancestry and promises enhanced research methodologies, bringing valuable perspectives to the fields of evolutionary biology and ecology. ABSTRACT: Regression models are extensively used to explore the relationship between a dependent variable and its covariates. These models work well when the dependent variable is categorical and the data are supposedly independent, as is the case with generalized linear models (GLMs). However, trait data from related species do not operate under these conditions due to their shared common ancestry, leading to dependence that can be illustrated through a phylogenetic tree. In response to the analytical challenges of count-dependent variables in phylogenetically related species, we have developed a novel phylogenetic negative binomial regression model that allows for overdispersion, a limitation present in the phylogenetic Poisson regression model in the literature. This model overcomes limitations of conventional GLMs, which overlook the inherent dependence arising from shared lineage. Instead, our proposed model acknowledges this factor and uses the generalized estimating equation (GEE) framework for precise parameter estimation. The effectiveness of the proposed model was corroborated by a rigorous simulation study, which, despite the need for careful convergence monitoring, demonstrated its reasonable efficacy. The empirical application of the model to lizard egg-laying count and mammalian litter size data further highlighted its practical relevance. In particular, our results identified negative correlations between increases in egg mass, litter size, ovulation rate, and gestation length with respective yearly counts, while a positive correlation was observed with species lifespan. This study underscores the importance of our proposed model in providing nuanced and accurate analyses of count-dependent variables in related species, highlighting the often overlooked impact of shared ancestry. The model represents a critical advance in research methodologies, opening new avenues for interpretation of related species data in the field.
format Online
Article
Text
id pubmed-10452298
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104522982023-08-26 A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables Jhwueng, Dwueng-Chwuan Wu, Chi-Yu Biology (Basel) Article SIMPLE SUMMARY: Researchers identified a challenge in analyzing count-dependent variables in species related through a shared ancestry using traditional regression models, as these models often overlook the inherent interdependence from common lineage. To address this, a new phylogenetic negative binomial regression model was developed that recognizes this lineage dependence and allows for overdispersion, surpassing the limitations of the conventional generalized linear models (GLMs). Using the generalized estimating equation (GEE) framework, this model offers precise parameter estimation. This innovation offers a more accurate analysis tool for understanding species data, emphasizing the influence of shared ancestry and promises enhanced research methodologies, bringing valuable perspectives to the fields of evolutionary biology and ecology. ABSTRACT: Regression models are extensively used to explore the relationship between a dependent variable and its covariates. These models work well when the dependent variable is categorical and the data are supposedly independent, as is the case with generalized linear models (GLMs). However, trait data from related species do not operate under these conditions due to their shared common ancestry, leading to dependence that can be illustrated through a phylogenetic tree. In response to the analytical challenges of count-dependent variables in phylogenetically related species, we have developed a novel phylogenetic negative binomial regression model that allows for overdispersion, a limitation present in the phylogenetic Poisson regression model in the literature. This model overcomes limitations of conventional GLMs, which overlook the inherent dependence arising from shared lineage. Instead, our proposed model acknowledges this factor and uses the generalized estimating equation (GEE) framework for precise parameter estimation. The effectiveness of the proposed model was corroborated by a rigorous simulation study, which, despite the need for careful convergence monitoring, demonstrated its reasonable efficacy. The empirical application of the model to lizard egg-laying count and mammalian litter size data further highlighted its practical relevance. In particular, our results identified negative correlations between increases in egg mass, litter size, ovulation rate, and gestation length with respective yearly counts, while a positive correlation was observed with species lifespan. This study underscores the importance of our proposed model in providing nuanced and accurate analyses of count-dependent variables in related species, highlighting the often overlooked impact of shared ancestry. The model represents a critical advance in research methodologies, opening new avenues for interpretation of related species data in the field. MDPI 2023-08-19 /pmc/articles/PMC10452298/ /pubmed/37627032 http://dx.doi.org/10.3390/biology12081148 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jhwueng, Dwueng-Chwuan
Wu, Chi-Yu
A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables
title A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables
title_full A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables
title_fullStr A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables
title_full_unstemmed A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables
title_short A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables
title_sort novel phylogenetic negative binomial regression model for count-dependent variables
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10452298/
https://www.ncbi.nlm.nih.gov/pubmed/37627032
http://dx.doi.org/10.3390/biology12081148
work_keys_str_mv AT jhwuengdwuengchwuan anovelphylogeneticnegativebinomialregressionmodelforcountdependentvariables
AT wuchiyu anovelphylogeneticnegativebinomialregressionmodelforcountdependentvariables
AT jhwuengdwuengchwuan novelphylogeneticnegativebinomialregressionmodelforcountdependentvariables
AT wuchiyu novelphylogeneticnegativebinomialregressionmodelforcountdependentvariables