Cargando…

Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data

In this paper, performance of hurdle models in rare events data is improved by modifying their binary component. The rare-event weighted logistic regression model is adopted in place of logistic regression to deal with class imbalance due to rare events. Poisson Hurdle Rare Event Weighted Logistic R...

Descripción completa

Detalles Bibliográficos
Autores principales: Awuor Okello, Sharon, Otieno Omondi, Evans, Odhiambo, Collins O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10445027/
https://www.ncbi.nlm.nih.gov/pubmed/37621657
http://dx.doi.org/10.1098/rsos.221226
_version_ 1785094084891770880
author Awuor Okello, Sharon
Otieno Omondi, Evans
Odhiambo, Collins O.
author_facet Awuor Okello, Sharon
Otieno Omondi, Evans
Odhiambo, Collins O.
author_sort Awuor Okello, Sharon
collection PubMed
description In this paper, performance of hurdle models in rare events data is improved by modifying their binary component. The rare-event weighted logistic regression model is adopted in place of logistic regression to deal with class imbalance due to rare events. Poisson Hurdle Rare Event Weighted Logistic Regression (REWLR) and Negative Binomial Hurdle (NBH) REWLR are developed as two-part models which use the REWLR model to estimate the probability of a positive count and a Poisson or NB zero-truncated count model to estimate non-zero counts. This research aimed to develop and assess the performance of the Poisson and Negative Binomial (NB) Hurdle Rare Event Weighted Logistic Regression (REWLR) models, applied to simulated data with various degrees of zero inflation and to Nairobi county’s maternal mortality data. The study data on maternal mortality were pulled from JPHES. The data contain the number of maternal deaths, which is the outcome variable, and other obstetric and demographic factors recorded in MNCH facilities in Nairobi between October 2021 and January 2022. The models were also fit and evaluated based on simulated data with varying degrees of zero inflation. The obtained results are numerically validated and then discussed from both the mathematical and the maternal mortality perspective. Numerical simulations are also presented to give a more complete representation of the model dynamics. Results obtained suggest that NB Hurdle REWLR is the best performing model for zero inflated count data due to rare events.
format Online
Article
Text
id pubmed-10445027
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-104450272023-08-24 Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data Awuor Okello, Sharon Otieno Omondi, Evans Odhiambo, Collins O. R Soc Open Sci Mathematics In this paper, performance of hurdle models in rare events data is improved by modifying their binary component. The rare-event weighted logistic regression model is adopted in place of logistic regression to deal with class imbalance due to rare events. Poisson Hurdle Rare Event Weighted Logistic Regression (REWLR) and Negative Binomial Hurdle (NBH) REWLR are developed as two-part models which use the REWLR model to estimate the probability of a positive count and a Poisson or NB zero-truncated count model to estimate non-zero counts. This research aimed to develop and assess the performance of the Poisson and Negative Binomial (NB) Hurdle Rare Event Weighted Logistic Regression (REWLR) models, applied to simulated data with various degrees of zero inflation and to Nairobi county’s maternal mortality data. The study data on maternal mortality were pulled from JPHES. The data contain the number of maternal deaths, which is the outcome variable, and other obstetric and demographic factors recorded in MNCH facilities in Nairobi between October 2021 and January 2022. The models were also fit and evaluated based on simulated data with varying degrees of zero inflation. The obtained results are numerically validated and then discussed from both the mathematical and the maternal mortality perspective. Numerical simulations are also presented to give a more complete representation of the model dynamics. Results obtained suggest that NB Hurdle REWLR is the best performing model for zero inflated count data due to rare events. The Royal Society 2023-08-23 /pmc/articles/PMC10445027/ /pubmed/37621657 http://dx.doi.org/10.1098/rsos.221226 Text en © 2023 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited.
spellingShingle Mathematics
Awuor Okello, Sharon
Otieno Omondi, Evans
Odhiambo, Collins O.
Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data
title Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data
title_full Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data
title_fullStr Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data
title_full_unstemmed Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data
title_short Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data
title_sort improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data
topic Mathematics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10445027/
https://www.ncbi.nlm.nih.gov/pubmed/37621657
http://dx.doi.org/10.1098/rsos.221226
work_keys_str_mv AT awuorokellosharon improvingperformanceofhurdlemodelsusingrareeventweightedlogisticregressionanapplicationtomaternalmortalitydata
AT otienoomondievans improvingperformanceofhurdlemodelsusingrareeventweightedlogisticregressionanapplicationtomaternalmortalitydata
AT odhiambocollinso improvingperformanceofhurdlemodelsusingrareeventweightedlogisticregressionanapplicationtomaternalmortalitydata