Cargando…

Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning

Coronavirus disease 2019 (COVID-19) has resulted in approximately 5 million deaths around the world with unprecedented consequences in people’s daily routines and in the global economy. Despite vast increases in time and money spent on COVID-19-related research, there is still limited information ab...

Descripción completa

Detalles Bibliográficos
Autores principales: Moustakidis, Serafeim, Kokkotis, Christos, Tsaopoulos, Dimitrios, Sfikakis, Petros, Tsiodras, Sotirios, Sypsa, Vana, Zaoutis, Theoklis E., Paraskevis, Dimitrios
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8955542/
https://www.ncbi.nlm.nih.gov/pubmed/35337032
http://dx.doi.org/10.3390/v14030625
_version_ 1784676361446621184
author Moustakidis, Serafeim
Kokkotis, Christos
Tsaopoulos, Dimitrios
Sfikakis, Petros
Tsiodras, Sotirios
Sypsa, Vana
Zaoutis, Theoklis E.
Paraskevis, Dimitrios
author_facet Moustakidis, Serafeim
Kokkotis, Christos
Tsaopoulos, Dimitrios
Sfikakis, Petros
Tsiodras, Sotirios
Sypsa, Vana
Zaoutis, Theoklis E.
Paraskevis, Dimitrios
author_sort Moustakidis, Serafeim
collection PubMed
description Coronavirus disease 2019 (COVID-19) has resulted in approximately 5 million deaths around the world with unprecedented consequences in people’s daily routines and in the global economy. Despite vast increases in time and money spent on COVID-19-related research, there is still limited information about the factors at the country level that affected COVID-19 transmission and fatality in EU. The paper focuses on the identification of these risk factors using a machine learning (ML) predictive pipeline and an associated explainability analysis. To achieve this, a hybrid dataset was created employing publicly available sources comprising heterogeneous parameters from the majority of EU countries, e.g., mobility measures, policy responses, vaccinations, and demographics/generic country-level parameters. Data pre-processing and data exploration techniques were initially applied to normalize the available data and decrease the feature dimensionality of the data problem considered. Then, a linear ε-Support Vector Machine (ε-SVM) model was employed to implement the regression task of predicting the number of deaths for each one of the three first pandemic waves (with mean square error of 0.027 for wave 1 and less than 0.02 for waves 2 and 3). Post hoc explainability analysis was finally applied to uncover the rationale behind the decision-making mechanisms of the ML pipeline and thus enhance our understanding with respect to the contribution of the selected country-level parameters to the prediction of COVID-19 deaths in EU.
format Online
Article
Text
id pubmed-8955542
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89555422022-03-26 Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning Moustakidis, Serafeim Kokkotis, Christos Tsaopoulos, Dimitrios Sfikakis, Petros Tsiodras, Sotirios Sypsa, Vana Zaoutis, Theoklis E. Paraskevis, Dimitrios Viruses Article Coronavirus disease 2019 (COVID-19) has resulted in approximately 5 million deaths around the world with unprecedented consequences in people’s daily routines and in the global economy. Despite vast increases in time and money spent on COVID-19-related research, there is still limited information about the factors at the country level that affected COVID-19 transmission and fatality in EU. The paper focuses on the identification of these risk factors using a machine learning (ML) predictive pipeline and an associated explainability analysis. To achieve this, a hybrid dataset was created employing publicly available sources comprising heterogeneous parameters from the majority of EU countries, e.g., mobility measures, policy responses, vaccinations, and demographics/generic country-level parameters. Data pre-processing and data exploration techniques were initially applied to normalize the available data and decrease the feature dimensionality of the data problem considered. Then, a linear ε-Support Vector Machine (ε-SVM) model was employed to implement the regression task of predicting the number of deaths for each one of the three first pandemic waves (with mean square error of 0.027 for wave 1 and less than 0.02 for waves 2 and 3). Post hoc explainability analysis was finally applied to uncover the rationale behind the decision-making mechanisms of the ML pipeline and thus enhance our understanding with respect to the contribution of the selected country-level parameters to the prediction of COVID-19 deaths in EU. MDPI 2022-03-17 /pmc/articles/PMC8955542/ /pubmed/35337032 http://dx.doi.org/10.3390/v14030625 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Moustakidis, Serafeim
Kokkotis, Christos
Tsaopoulos, Dimitrios
Sfikakis, Petros
Tsiodras, Sotirios
Sypsa, Vana
Zaoutis, Theoklis E.
Paraskevis, Dimitrios
Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning
title Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning
title_full Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning
title_fullStr Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning
title_full_unstemmed Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning
title_short Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning
title_sort identifying country-level risk factors for the spread of covid-19 in europe using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8955542/
https://www.ncbi.nlm.nih.gov/pubmed/35337032
http://dx.doi.org/10.3390/v14030625
work_keys_str_mv AT moustakidisserafeim identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning
AT kokkotischristos identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning
AT tsaopoulosdimitrios identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning
AT sfikakispetros identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning
AT tsiodrassotirios identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning
AT sypsavana identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning
AT zaoutistheoklise identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning
AT paraskevisdimitrios identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning