Cargando…

A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data

Count datasets are traditionally analyzed using the ordinary Poisson distribution. However, said model has its applicability limited, as it can be somewhat restrictive to handling specific data structures. In this case, the need arises for obtaining alternative models that accommodate, for example,...

Descripción completa

Detalles Bibliográficos
Autores principales: Bertoli, Wesley, Conceição, Katiane S., Andrade, Marinho G., Louzada, Francisco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8224290/
https://www.ncbi.nlm.nih.gov/pubmed/34064281
http://dx.doi.org/10.3390/e23060646
_version_ 1783711856292003840
author Bertoli, Wesley
Conceição, Katiane S.
Andrade, Marinho G.
Louzada, Francisco
author_facet Bertoli, Wesley
Conceição, Katiane S.
Andrade, Marinho G.
Louzada, Francisco
author_sort Bertoli, Wesley
collection PubMed
description Count datasets are traditionally analyzed using the ordinary Poisson distribution. However, said model has its applicability limited, as it can be somewhat restrictive to handling specific data structures. In this case, the need arises for obtaining alternative models that accommodate, for example, overdispersion and zero modification (inflation/deflation at the frequency of zeros). In practical terms, these are the most prevalent structures ruling the nature of discrete phenomena nowadays. Hence, this paper’s primary goal was to jointly address these issues by deriving a fixed-effects regression model based on the hurdle version of the Poisson–Sujatha distribution. In this framework, the zero modification is incorporated by considering that a binary probability model determines which outcomes are zero-valued, and a zero-truncated process is responsible for generating positive observations. Posterior inferences for the model parameters were obtained from a fully Bayesian approach based on the g-prior method. Intensive Monte Carlo simulation studies were performed to assess the Bayesian estimators’ empirical properties, and the obtained results have been discussed. The proposed model was considered for analyzing a real dataset, and its competitiveness regarding some well-established fixed-effects models for count data was evaluated. A sensitivity analysis to detect observations that may impact parameter estimates was performed based on standard divergence measures. The Bayesian p-value and the randomized quantile residuals were considered for the task of model validation.
format Online
Article
Text
id pubmed-8224290
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-82242902021-06-25 A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data Bertoli, Wesley Conceição, Katiane S. Andrade, Marinho G. Louzada, Francisco Entropy (Basel) Article Count datasets are traditionally analyzed using the ordinary Poisson distribution. However, said model has its applicability limited, as it can be somewhat restrictive to handling specific data structures. In this case, the need arises for obtaining alternative models that accommodate, for example, overdispersion and zero modification (inflation/deflation at the frequency of zeros). In practical terms, these are the most prevalent structures ruling the nature of discrete phenomena nowadays. Hence, this paper’s primary goal was to jointly address these issues by deriving a fixed-effects regression model based on the hurdle version of the Poisson–Sujatha distribution. In this framework, the zero modification is incorporated by considering that a binary probability model determines which outcomes are zero-valued, and a zero-truncated process is responsible for generating positive observations. Posterior inferences for the model parameters were obtained from a fully Bayesian approach based on the g-prior method. Intensive Monte Carlo simulation studies were performed to assess the Bayesian estimators’ empirical properties, and the obtained results have been discussed. The proposed model was considered for analyzing a real dataset, and its competitiveness regarding some well-established fixed-effects models for count data was evaluated. A sensitivity analysis to detect observations that may impact parameter estimates was performed based on standard divergence measures. The Bayesian p-value and the randomized quantile residuals were considered for the task of model validation. MDPI 2021-05-21 /pmc/articles/PMC8224290/ /pubmed/34064281 http://dx.doi.org/10.3390/e23060646 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Bertoli, Wesley
Conceição, Katiane S.
Andrade, Marinho G.
Louzada, Francisco
A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data
title A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data
title_full A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data
title_fullStr A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data
title_full_unstemmed A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data
title_short A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data
title_sort new regression model for the analysis of overdispersed and zero-modified count data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8224290/
https://www.ncbi.nlm.nih.gov/pubmed/34064281
http://dx.doi.org/10.3390/e23060646
work_keys_str_mv AT bertoliwesley anewregressionmodelfortheanalysisofoverdispersedandzeromodifiedcountdata
AT conceicaokatianes anewregressionmodelfortheanalysisofoverdispersedandzeromodifiedcountdata
AT andrademarinhog anewregressionmodelfortheanalysisofoverdispersedandzeromodifiedcountdata
AT louzadafrancisco anewregressionmodelfortheanalysisofoverdispersedandzeromodifiedcountdata
AT bertoliwesley newregressionmodelfortheanalysisofoverdispersedandzeromodifiedcountdata
AT conceicaokatianes newregressionmodelfortheanalysisofoverdispersedandzeromodifiedcountdata
AT andrademarinhog newregressionmodelfortheanalysisofoverdispersedandzeromodifiedcountdata
AT louzadafrancisco newregressionmodelfortheanalysisofoverdispersedandzeromodifiedcountdata