Cargando…

Assessing the Nationwide COVID-19 Risk in Mexico through the Lens of Comorbidity by an XGBoost-Based Logistic Regression Model

The outbreak of the new COVID-19 disease is a serious health problem that has affected a large part of the world population, especially older adults and people who suffer from a previous comorbidity. In this work, we proposed a classifier model that allows for deciding whether or not a patient might...

Descripción completa

Detalles Bibliográficos
Autores principales: Venancio-Guzmán, Sonia, Aguirre-Salado, Alejandro Ivan, Soubervielle-Montalvo, Carlos, Jiménez-Hernández, José del Carmen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9565716/
https://www.ncbi.nlm.nih.gov/pubmed/36231290
http://dx.doi.org/10.3390/ijerph191911992
_version_ 1784808958705270784
author Venancio-Guzmán, Sonia
Aguirre-Salado, Alejandro Ivan
Soubervielle-Montalvo, Carlos
Jiménez-Hernández, José del Carmen
author_facet Venancio-Guzmán, Sonia
Aguirre-Salado, Alejandro Ivan
Soubervielle-Montalvo, Carlos
Jiménez-Hernández, José del Carmen
author_sort Venancio-Guzmán, Sonia
collection PubMed
description The outbreak of the new COVID-19 disease is a serious health problem that has affected a large part of the world population, especially older adults and people who suffer from a previous comorbidity. In this work, we proposed a classifier model that allows for deciding whether or not a patient might suffer from the COVID-19 disease, considering spatio-temporal variables, physical characteristics of the patients and the presence of previous diseases. We used XGBoost to maximize the likelihood function of the multivariate logistic regression model. The estimated and observed values of percentage occurrence of cases were very similar, and indicated that the proposed model was suitable to predict new cases (AUC = [Formula: see text]). The main results revealed that patients without comorbidities are less likely to be COVID-19 positive, unlike people with diabetes, obesity and pneumonia. The distribution function by age group showed that, during the first and second wave of COVID-19, young people aged [Formula: see text] were the least affected by the pandemic, while the most affected were people between 20 and 40 years, followed by adults older than 40 years. In the case of the third and fourth wave, there was an increased risk for young individuals (under 20 years), while older adults over 40 years decreased their chances of infection. Estimates of positive COVID cases with both the XGBoost-LR model and the multivariate logistic regression model were used to create maps to visualize the spatial distribution of positive cases across the country. Spatial analysis was carried out to determine, through the data, the main geographical areas where a greater number of positive cases occurred. The results showed that the areas most affected by COVID-19 were in the central and northern regions of Mexico.
format Online
Article
Text
id pubmed-9565716
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95657162022-10-15 Assessing the Nationwide COVID-19 Risk in Mexico through the Lens of Comorbidity by an XGBoost-Based Logistic Regression Model Venancio-Guzmán, Sonia Aguirre-Salado, Alejandro Ivan Soubervielle-Montalvo, Carlos Jiménez-Hernández, José del Carmen Int J Environ Res Public Health Article The outbreak of the new COVID-19 disease is a serious health problem that has affected a large part of the world population, especially older adults and people who suffer from a previous comorbidity. In this work, we proposed a classifier model that allows for deciding whether or not a patient might suffer from the COVID-19 disease, considering spatio-temporal variables, physical characteristics of the patients and the presence of previous diseases. We used XGBoost to maximize the likelihood function of the multivariate logistic regression model. The estimated and observed values of percentage occurrence of cases were very similar, and indicated that the proposed model was suitable to predict new cases (AUC = [Formula: see text]). The main results revealed that patients without comorbidities are less likely to be COVID-19 positive, unlike people with diabetes, obesity and pneumonia. The distribution function by age group showed that, during the first and second wave of COVID-19, young people aged [Formula: see text] were the least affected by the pandemic, while the most affected were people between 20 and 40 years, followed by adults older than 40 years. In the case of the third and fourth wave, there was an increased risk for young individuals (under 20 years), while older adults over 40 years decreased their chances of infection. Estimates of positive COVID cases with both the XGBoost-LR model and the multivariate logistic regression model were used to create maps to visualize the spatial distribution of positive cases across the country. Spatial analysis was carried out to determine, through the data, the main geographical areas where a greater number of positive cases occurred. The results showed that the areas most affected by COVID-19 were in the central and northern regions of Mexico. MDPI 2022-09-22 /pmc/articles/PMC9565716/ /pubmed/36231290 http://dx.doi.org/10.3390/ijerph191911992 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Venancio-Guzmán, Sonia
Aguirre-Salado, Alejandro Ivan
Soubervielle-Montalvo, Carlos
Jiménez-Hernández, José del Carmen
Assessing the Nationwide COVID-19 Risk in Mexico through the Lens of Comorbidity by an XGBoost-Based Logistic Regression Model
title Assessing the Nationwide COVID-19 Risk in Mexico through the Lens of Comorbidity by an XGBoost-Based Logistic Regression Model
title_full Assessing the Nationwide COVID-19 Risk in Mexico through the Lens of Comorbidity by an XGBoost-Based Logistic Regression Model
title_fullStr Assessing the Nationwide COVID-19 Risk in Mexico through the Lens of Comorbidity by an XGBoost-Based Logistic Regression Model
title_full_unstemmed Assessing the Nationwide COVID-19 Risk in Mexico through the Lens of Comorbidity by an XGBoost-Based Logistic Regression Model
title_short Assessing the Nationwide COVID-19 Risk in Mexico through the Lens of Comorbidity by an XGBoost-Based Logistic Regression Model
title_sort assessing the nationwide covid-19 risk in mexico through the lens of comorbidity by an xgboost-based logistic regression model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9565716/
https://www.ncbi.nlm.nih.gov/pubmed/36231290
http://dx.doi.org/10.3390/ijerph191911992
work_keys_str_mv AT venancioguzmansonia assessingthenationwidecovid19riskinmexicothroughthelensofcomorbiditybyanxgboostbasedlogisticregressionmodel
AT aguirresaladoalejandroivan assessingthenationwidecovid19riskinmexicothroughthelensofcomorbiditybyanxgboostbasedlogisticregressionmodel
AT souberviellemontalvocarlos assessingthenationwidecovid19riskinmexicothroughthelensofcomorbiditybyanxgboostbasedlogisticregressionmodel
AT jimenezhernandezjosedelcarmen assessingthenationwidecovid19riskinmexicothroughthelensofcomorbiditybyanxgboostbasedlogisticregressionmodel