Cargando…
Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data
In this paper, performance of hurdle models in rare events data is improved by modifying their binary component. The rare-event weighted logistic regression model is adopted in place of logistic regression to deal with class imbalance due to rare events. Poisson Hurdle Rare Event Weighted Logistic R...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10445027/ https://www.ncbi.nlm.nih.gov/pubmed/37621657 http://dx.doi.org/10.1098/rsos.221226 |
_version_ | 1785094084891770880 |
---|---|
author | Awuor Okello, Sharon Otieno Omondi, Evans Odhiambo, Collins O. |
author_facet | Awuor Okello, Sharon Otieno Omondi, Evans Odhiambo, Collins O. |
author_sort | Awuor Okello, Sharon |
collection | PubMed |
description | In this paper, performance of hurdle models in rare events data is improved by modifying their binary component. The rare-event weighted logistic regression model is adopted in place of logistic regression to deal with class imbalance due to rare events. Poisson Hurdle Rare Event Weighted Logistic Regression (REWLR) and Negative Binomial Hurdle (NBH) REWLR are developed as two-part models which use the REWLR model to estimate the probability of a positive count and a Poisson or NB zero-truncated count model to estimate non-zero counts. This research aimed to develop and assess the performance of the Poisson and Negative Binomial (NB) Hurdle Rare Event Weighted Logistic Regression (REWLR) models, applied to simulated data with various degrees of zero inflation and to Nairobi county’s maternal mortality data. The study data on maternal mortality were pulled from JPHES. The data contain the number of maternal deaths, which is the outcome variable, and other obstetric and demographic factors recorded in MNCH facilities in Nairobi between October 2021 and January 2022. The models were also fit and evaluated based on simulated data with varying degrees of zero inflation. The obtained results are numerically validated and then discussed from both the mathematical and the maternal mortality perspective. Numerical simulations are also presented to give a more complete representation of the model dynamics. Results obtained suggest that NB Hurdle REWLR is the best performing model for zero inflated count data due to rare events. |
format | Online Article Text |
id | pubmed-10445027 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-104450272023-08-24 Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data Awuor Okello, Sharon Otieno Omondi, Evans Odhiambo, Collins O. R Soc Open Sci Mathematics In this paper, performance of hurdle models in rare events data is improved by modifying their binary component. The rare-event weighted logistic regression model is adopted in place of logistic regression to deal with class imbalance due to rare events. Poisson Hurdle Rare Event Weighted Logistic Regression (REWLR) and Negative Binomial Hurdle (NBH) REWLR are developed as two-part models which use the REWLR model to estimate the probability of a positive count and a Poisson or NB zero-truncated count model to estimate non-zero counts. This research aimed to develop and assess the performance of the Poisson and Negative Binomial (NB) Hurdle Rare Event Weighted Logistic Regression (REWLR) models, applied to simulated data with various degrees of zero inflation and to Nairobi county’s maternal mortality data. The study data on maternal mortality were pulled from JPHES. The data contain the number of maternal deaths, which is the outcome variable, and other obstetric and demographic factors recorded in MNCH facilities in Nairobi between October 2021 and January 2022. The models were also fit and evaluated based on simulated data with varying degrees of zero inflation. The obtained results are numerically validated and then discussed from both the mathematical and the maternal mortality perspective. Numerical simulations are also presented to give a more complete representation of the model dynamics. Results obtained suggest that NB Hurdle REWLR is the best performing model for zero inflated count data due to rare events. The Royal Society 2023-08-23 /pmc/articles/PMC10445027/ /pubmed/37621657 http://dx.doi.org/10.1098/rsos.221226 Text en © 2023 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Mathematics Awuor Okello, Sharon Otieno Omondi, Evans Odhiambo, Collins O. Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data |
title | Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data |
title_full | Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data |
title_fullStr | Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data |
title_full_unstemmed | Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data |
title_short | Improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data |
title_sort | improving performance of hurdle models using rare-event weighted logistic regression: an application to maternal mortality data |
topic | Mathematics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10445027/ https://www.ncbi.nlm.nih.gov/pubmed/37621657 http://dx.doi.org/10.1098/rsos.221226 |
work_keys_str_mv | AT awuorokellosharon improvingperformanceofhurdlemodelsusingrareeventweightedlogisticregressionanapplicationtomaternalmortalitydata AT otienoomondievans improvingperformanceofhurdlemodelsusingrareeventweightedlogisticregressionanapplicationtomaternalmortalitydata AT odhiambocollinso improvingperformanceofhurdlemodelsusingrareeventweightedlogisticregressionanapplicationtomaternalmortalitydata |