Cargando…
Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index
In Colombia, the first case of COVID-19 was confirmed on 6 March 2020. On 13 March 2023, Colombia registered 6,360,780 confirmed positive cases of COVID-19, representing 12.18% of the total population. The National Administrative Department of Statistics (DANE) in Colombia published in 2020 a COVID-...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10381838/ https://www.ncbi.nlm.nih.gov/pubmed/37511754 http://dx.doi.org/10.3390/jpm13071141 |
_version_ | 1785080542935384064 |
---|---|
author | Rosero Perez, Paula Andrea Realpe Gonzalez, Juan Sebastián Salazar-Cabrera, Ricardo Restrepo, David López, Diego M. Blobel, Bernd |
author_facet | Rosero Perez, Paula Andrea Realpe Gonzalez, Juan Sebastián Salazar-Cabrera, Ricardo Restrepo, David López, Diego M. Blobel, Bernd |
author_sort | Rosero Perez, Paula Andrea |
collection | PubMed |
description | In Colombia, the first case of COVID-19 was confirmed on 6 March 2020. On 13 March 2023, Colombia registered 6,360,780 confirmed positive cases of COVID-19, representing 12.18% of the total population. The National Administrative Department of Statistics (DANE) in Colombia published in 2020 a COVID-19 vulnerability index, which estimates the vulnerability (per city block) of being infected with COVID-19. Unfortunately, DANE did not consider multiple factors that could increase the risk of COVID-19 (in addition to demographic and health), such as environmental and mobility data (found in the related literature). The proposed multidimensional index considers variables of different types (unemployment rate, gross domestic product, citizens’ mobility, vaccination data, and climatological and spatial information) in which the incidence of COVID-19 is calculated and compared with the incidence of the COVID-19 vulnerability index provided by DANE. The collection, data preparation, modeling, and evaluation phases of the Cross-Industry Standard Process for Data Mining methodology (CRISP-DM) were considered for constructing the index. The multidimensional index was evaluated using multiple machine learning models to calculate the incidence of COVID-19 cases in the main cities of Colombia. The results showed that the best-performing model to predict the incidence of COVID-19 in Colombia is the Extra Trees Regressor algorithm, obtaining an R-squared of 0.829. This work is the first step toward a multidimensional analysis of COVID-19 risk factors, which has the potential to support decision making in public health programs. The results are also relevant for calculating vulnerability indexes for other viral diseases, such as dengue. |
format | Online Article Text |
id | pubmed-10381838 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-103818382023-07-29 Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index Rosero Perez, Paula Andrea Realpe Gonzalez, Juan Sebastián Salazar-Cabrera, Ricardo Restrepo, David López, Diego M. Blobel, Bernd J Pers Med Article In Colombia, the first case of COVID-19 was confirmed on 6 March 2020. On 13 March 2023, Colombia registered 6,360,780 confirmed positive cases of COVID-19, representing 12.18% of the total population. The National Administrative Department of Statistics (DANE) in Colombia published in 2020 a COVID-19 vulnerability index, which estimates the vulnerability (per city block) of being infected with COVID-19. Unfortunately, DANE did not consider multiple factors that could increase the risk of COVID-19 (in addition to demographic and health), such as environmental and mobility data (found in the related literature). The proposed multidimensional index considers variables of different types (unemployment rate, gross domestic product, citizens’ mobility, vaccination data, and climatological and spatial information) in which the incidence of COVID-19 is calculated and compared with the incidence of the COVID-19 vulnerability index provided by DANE. The collection, data preparation, modeling, and evaluation phases of the Cross-Industry Standard Process for Data Mining methodology (CRISP-DM) were considered for constructing the index. The multidimensional index was evaluated using multiple machine learning models to calculate the incidence of COVID-19 cases in the main cities of Colombia. The results showed that the best-performing model to predict the incidence of COVID-19 in Colombia is the Extra Trees Regressor algorithm, obtaining an R-squared of 0.829. This work is the first step toward a multidimensional analysis of COVID-19 risk factors, which has the potential to support decision making in public health programs. The results are also relevant for calculating vulnerability indexes for other viral diseases, such as dengue. MDPI 2023-07-15 /pmc/articles/PMC10381838/ /pubmed/37511754 http://dx.doi.org/10.3390/jpm13071141 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Rosero Perez, Paula Andrea Realpe Gonzalez, Juan Sebastián Salazar-Cabrera, Ricardo Restrepo, David López, Diego M. Blobel, Bernd Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index |
title | Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index |
title_full | Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index |
title_fullStr | Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index |
title_full_unstemmed | Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index |
title_short | Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index |
title_sort | multidimensional machine learning model to calculate a covid-19 vulnerability index |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10381838/ https://www.ncbi.nlm.nih.gov/pubmed/37511754 http://dx.doi.org/10.3390/jpm13071141 |
work_keys_str_mv | AT roseroperezpaulaandrea multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex AT realpegonzalezjuansebastian multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex AT salazarcabreraricardo multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex AT restrepodavid multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex AT lopezdiegom multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex AT blobelbernd multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex |