Cargando…
A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables
SIMPLE SUMMARY: Researchers identified a challenge in analyzing count-dependent variables in species related through a shared ancestry using traditional regression models, as these models often overlook the inherent interdependence from common lineage. To address this, a new phylogenetic negative bi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10452298/ https://www.ncbi.nlm.nih.gov/pubmed/37627032 http://dx.doi.org/10.3390/biology12081148 |
_version_ | 1785095634203705344 |
---|---|
author | Jhwueng, Dwueng-Chwuan Wu, Chi-Yu |
author_facet | Jhwueng, Dwueng-Chwuan Wu, Chi-Yu |
author_sort | Jhwueng, Dwueng-Chwuan |
collection | PubMed |
description | SIMPLE SUMMARY: Researchers identified a challenge in analyzing count-dependent variables in species related through a shared ancestry using traditional regression models, as these models often overlook the inherent interdependence from common lineage. To address this, a new phylogenetic negative binomial regression model was developed that recognizes this lineage dependence and allows for overdispersion, surpassing the limitations of the conventional generalized linear models (GLMs). Using the generalized estimating equation (GEE) framework, this model offers precise parameter estimation. This innovation offers a more accurate analysis tool for understanding species data, emphasizing the influence of shared ancestry and promises enhanced research methodologies, bringing valuable perspectives to the fields of evolutionary biology and ecology. ABSTRACT: Regression models are extensively used to explore the relationship between a dependent variable and its covariates. These models work well when the dependent variable is categorical and the data are supposedly independent, as is the case with generalized linear models (GLMs). However, trait data from related species do not operate under these conditions due to their shared common ancestry, leading to dependence that can be illustrated through a phylogenetic tree. In response to the analytical challenges of count-dependent variables in phylogenetically related species, we have developed a novel phylogenetic negative binomial regression model that allows for overdispersion, a limitation present in the phylogenetic Poisson regression model in the literature. This model overcomes limitations of conventional GLMs, which overlook the inherent dependence arising from shared lineage. Instead, our proposed model acknowledges this factor and uses the generalized estimating equation (GEE) framework for precise parameter estimation. The effectiveness of the proposed model was corroborated by a rigorous simulation study, which, despite the need for careful convergence monitoring, demonstrated its reasonable efficacy. The empirical application of the model to lizard egg-laying count and mammalian litter size data further highlighted its practical relevance. In particular, our results identified negative correlations between increases in egg mass, litter size, ovulation rate, and gestation length with respective yearly counts, while a positive correlation was observed with species lifespan. This study underscores the importance of our proposed model in providing nuanced and accurate analyses of count-dependent variables in related species, highlighting the often overlooked impact of shared ancestry. The model represents a critical advance in research methodologies, opening new avenues for interpretation of related species data in the field. |
format | Online Article Text |
id | pubmed-10452298 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-104522982023-08-26 A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables Jhwueng, Dwueng-Chwuan Wu, Chi-Yu Biology (Basel) Article SIMPLE SUMMARY: Researchers identified a challenge in analyzing count-dependent variables in species related through a shared ancestry using traditional regression models, as these models often overlook the inherent interdependence from common lineage. To address this, a new phylogenetic negative binomial regression model was developed that recognizes this lineage dependence and allows for overdispersion, surpassing the limitations of the conventional generalized linear models (GLMs). Using the generalized estimating equation (GEE) framework, this model offers precise parameter estimation. This innovation offers a more accurate analysis tool for understanding species data, emphasizing the influence of shared ancestry and promises enhanced research methodologies, bringing valuable perspectives to the fields of evolutionary biology and ecology. ABSTRACT: Regression models are extensively used to explore the relationship between a dependent variable and its covariates. These models work well when the dependent variable is categorical and the data are supposedly independent, as is the case with generalized linear models (GLMs). However, trait data from related species do not operate under these conditions due to their shared common ancestry, leading to dependence that can be illustrated through a phylogenetic tree. In response to the analytical challenges of count-dependent variables in phylogenetically related species, we have developed a novel phylogenetic negative binomial regression model that allows for overdispersion, a limitation present in the phylogenetic Poisson regression model in the literature. This model overcomes limitations of conventional GLMs, which overlook the inherent dependence arising from shared lineage. Instead, our proposed model acknowledges this factor and uses the generalized estimating equation (GEE) framework for precise parameter estimation. The effectiveness of the proposed model was corroborated by a rigorous simulation study, which, despite the need for careful convergence monitoring, demonstrated its reasonable efficacy. The empirical application of the model to lizard egg-laying count and mammalian litter size data further highlighted its practical relevance. In particular, our results identified negative correlations between increases in egg mass, litter size, ovulation rate, and gestation length with respective yearly counts, while a positive correlation was observed with species lifespan. This study underscores the importance of our proposed model in providing nuanced and accurate analyses of count-dependent variables in related species, highlighting the often overlooked impact of shared ancestry. The model represents a critical advance in research methodologies, opening new avenues for interpretation of related species data in the field. MDPI 2023-08-19 /pmc/articles/PMC10452298/ /pubmed/37627032 http://dx.doi.org/10.3390/biology12081148 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Jhwueng, Dwueng-Chwuan Wu, Chi-Yu A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables |
title | A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables |
title_full | A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables |
title_fullStr | A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables |
title_full_unstemmed | A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables |
title_short | A Novel Phylogenetic Negative Binomial Regression Model for Count-Dependent Variables |
title_sort | novel phylogenetic negative binomial regression model for count-dependent variables |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10452298/ https://www.ncbi.nlm.nih.gov/pubmed/37627032 http://dx.doi.org/10.3390/biology12081148 |
work_keys_str_mv | AT jhwuengdwuengchwuan anovelphylogeneticnegativebinomialregressionmodelforcountdependentvariables AT wuchiyu anovelphylogeneticnegativebinomialregressionmodelforcountdependentvariables AT jhwuengdwuengchwuan novelphylogeneticnegativebinomialregressionmodelforcountdependentvariables AT wuchiyu novelphylogeneticnegativebinomialregressionmodelforcountdependentvariables |