Cargando…

Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index

In Colombia, the first case of COVID-19 was confirmed on 6 March 2020. On 13 March 2023, Colombia registered 6,360,780 confirmed positive cases of COVID-19, representing 12.18% of the total population. The National Administrative Department of Statistics (DANE) in Colombia published in 2020 a COVID-...

Descripción completa

Detalles Bibliográficos
Autores principales: Rosero Perez, Paula Andrea, Realpe Gonzalez, Juan Sebastián, Salazar-Cabrera, Ricardo, Restrepo, David, López, Diego M., Blobel, Bernd
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10381838/
https://www.ncbi.nlm.nih.gov/pubmed/37511754
http://dx.doi.org/10.3390/jpm13071141
_version_ 1785080542935384064
author Rosero Perez, Paula Andrea
Realpe Gonzalez, Juan Sebastián
Salazar-Cabrera, Ricardo
Restrepo, David
López, Diego M.
Blobel, Bernd
author_facet Rosero Perez, Paula Andrea
Realpe Gonzalez, Juan Sebastián
Salazar-Cabrera, Ricardo
Restrepo, David
López, Diego M.
Blobel, Bernd
author_sort Rosero Perez, Paula Andrea
collection PubMed
description In Colombia, the first case of COVID-19 was confirmed on 6 March 2020. On 13 March 2023, Colombia registered 6,360,780 confirmed positive cases of COVID-19, representing 12.18% of the total population. The National Administrative Department of Statistics (DANE) in Colombia published in 2020 a COVID-19 vulnerability index, which estimates the vulnerability (per city block) of being infected with COVID-19. Unfortunately, DANE did not consider multiple factors that could increase the risk of COVID-19 (in addition to demographic and health), such as environmental and mobility data (found in the related literature). The proposed multidimensional index considers variables of different types (unemployment rate, gross domestic product, citizens’ mobility, vaccination data, and climatological and spatial information) in which the incidence of COVID-19 is calculated and compared with the incidence of the COVID-19 vulnerability index provided by DANE. The collection, data preparation, modeling, and evaluation phases of the Cross-Industry Standard Process for Data Mining methodology (CRISP-DM) were considered for constructing the index. The multidimensional index was evaluated using multiple machine learning models to calculate the incidence of COVID-19 cases in the main cities of Colombia. The results showed that the best-performing model to predict the incidence of COVID-19 in Colombia is the Extra Trees Regressor algorithm, obtaining an R-squared of 0.829. This work is the first step toward a multidimensional analysis of COVID-19 risk factors, which has the potential to support decision making in public health programs. The results are also relevant for calculating vulnerability indexes for other viral diseases, such as dengue.
format Online
Article
Text
id pubmed-10381838
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103818382023-07-29 Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index Rosero Perez, Paula Andrea Realpe Gonzalez, Juan Sebastián Salazar-Cabrera, Ricardo Restrepo, David López, Diego M. Blobel, Bernd J Pers Med Article In Colombia, the first case of COVID-19 was confirmed on 6 March 2020. On 13 March 2023, Colombia registered 6,360,780 confirmed positive cases of COVID-19, representing 12.18% of the total population. The National Administrative Department of Statistics (DANE) in Colombia published in 2020 a COVID-19 vulnerability index, which estimates the vulnerability (per city block) of being infected with COVID-19. Unfortunately, DANE did not consider multiple factors that could increase the risk of COVID-19 (in addition to demographic and health), such as environmental and mobility data (found in the related literature). The proposed multidimensional index considers variables of different types (unemployment rate, gross domestic product, citizens’ mobility, vaccination data, and climatological and spatial information) in which the incidence of COVID-19 is calculated and compared with the incidence of the COVID-19 vulnerability index provided by DANE. The collection, data preparation, modeling, and evaluation phases of the Cross-Industry Standard Process for Data Mining methodology (CRISP-DM) were considered for constructing the index. The multidimensional index was evaluated using multiple machine learning models to calculate the incidence of COVID-19 cases in the main cities of Colombia. The results showed that the best-performing model to predict the incidence of COVID-19 in Colombia is the Extra Trees Regressor algorithm, obtaining an R-squared of 0.829. This work is the first step toward a multidimensional analysis of COVID-19 risk factors, which has the potential to support decision making in public health programs. The results are also relevant for calculating vulnerability indexes for other viral diseases, such as dengue. MDPI 2023-07-15 /pmc/articles/PMC10381838/ /pubmed/37511754 http://dx.doi.org/10.3390/jpm13071141 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Rosero Perez, Paula Andrea
Realpe Gonzalez, Juan Sebastián
Salazar-Cabrera, Ricardo
Restrepo, David
López, Diego M.
Blobel, Bernd
Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index
title Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index
title_full Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index
title_fullStr Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index
title_full_unstemmed Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index
title_short Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index
title_sort multidimensional machine learning model to calculate a covid-19 vulnerability index
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10381838/
https://www.ncbi.nlm.nih.gov/pubmed/37511754
http://dx.doi.org/10.3390/jpm13071141
work_keys_str_mv AT roseroperezpaulaandrea multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex
AT realpegonzalezjuansebastian multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex
AT salazarcabreraricardo multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex
AT restrepodavid multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex
AT lopezdiegom multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex
AT blobelbernd multidimensionalmachinelearningmodeltocalculateacovid19vulnerabilityindex