Cargando…
Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning
Coronavirus disease 2019 (COVID-19) has resulted in approximately 5 million deaths around the world with unprecedented consequences in people’s daily routines and in the global economy. Despite vast increases in time and money spent on COVID-19-related research, there is still limited information ab...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8955542/ https://www.ncbi.nlm.nih.gov/pubmed/35337032 http://dx.doi.org/10.3390/v14030625 |
_version_ | 1784676361446621184 |
---|---|
author | Moustakidis, Serafeim Kokkotis, Christos Tsaopoulos, Dimitrios Sfikakis, Petros Tsiodras, Sotirios Sypsa, Vana Zaoutis, Theoklis E. Paraskevis, Dimitrios |
author_facet | Moustakidis, Serafeim Kokkotis, Christos Tsaopoulos, Dimitrios Sfikakis, Petros Tsiodras, Sotirios Sypsa, Vana Zaoutis, Theoklis E. Paraskevis, Dimitrios |
author_sort | Moustakidis, Serafeim |
collection | PubMed |
description | Coronavirus disease 2019 (COVID-19) has resulted in approximately 5 million deaths around the world with unprecedented consequences in people’s daily routines and in the global economy. Despite vast increases in time and money spent on COVID-19-related research, there is still limited information about the factors at the country level that affected COVID-19 transmission and fatality in EU. The paper focuses on the identification of these risk factors using a machine learning (ML) predictive pipeline and an associated explainability analysis. To achieve this, a hybrid dataset was created employing publicly available sources comprising heterogeneous parameters from the majority of EU countries, e.g., mobility measures, policy responses, vaccinations, and demographics/generic country-level parameters. Data pre-processing and data exploration techniques were initially applied to normalize the available data and decrease the feature dimensionality of the data problem considered. Then, a linear ε-Support Vector Machine (ε-SVM) model was employed to implement the regression task of predicting the number of deaths for each one of the three first pandemic waves (with mean square error of 0.027 for wave 1 and less than 0.02 for waves 2 and 3). Post hoc explainability analysis was finally applied to uncover the rationale behind the decision-making mechanisms of the ML pipeline and thus enhance our understanding with respect to the contribution of the selected country-level parameters to the prediction of COVID-19 deaths in EU. |
format | Online Article Text |
id | pubmed-8955542 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-89555422022-03-26 Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning Moustakidis, Serafeim Kokkotis, Christos Tsaopoulos, Dimitrios Sfikakis, Petros Tsiodras, Sotirios Sypsa, Vana Zaoutis, Theoklis E. Paraskevis, Dimitrios Viruses Article Coronavirus disease 2019 (COVID-19) has resulted in approximately 5 million deaths around the world with unprecedented consequences in people’s daily routines and in the global economy. Despite vast increases in time and money spent on COVID-19-related research, there is still limited information about the factors at the country level that affected COVID-19 transmission and fatality in EU. The paper focuses on the identification of these risk factors using a machine learning (ML) predictive pipeline and an associated explainability analysis. To achieve this, a hybrid dataset was created employing publicly available sources comprising heterogeneous parameters from the majority of EU countries, e.g., mobility measures, policy responses, vaccinations, and demographics/generic country-level parameters. Data pre-processing and data exploration techniques were initially applied to normalize the available data and decrease the feature dimensionality of the data problem considered. Then, a linear ε-Support Vector Machine (ε-SVM) model was employed to implement the regression task of predicting the number of deaths for each one of the three first pandemic waves (with mean square error of 0.027 for wave 1 and less than 0.02 for waves 2 and 3). Post hoc explainability analysis was finally applied to uncover the rationale behind the decision-making mechanisms of the ML pipeline and thus enhance our understanding with respect to the contribution of the selected country-level parameters to the prediction of COVID-19 deaths in EU. MDPI 2022-03-17 /pmc/articles/PMC8955542/ /pubmed/35337032 http://dx.doi.org/10.3390/v14030625 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Moustakidis, Serafeim Kokkotis, Christos Tsaopoulos, Dimitrios Sfikakis, Petros Tsiodras, Sotirios Sypsa, Vana Zaoutis, Theoklis E. Paraskevis, Dimitrios Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning |
title | Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning |
title_full | Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning |
title_fullStr | Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning |
title_full_unstemmed | Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning |
title_short | Identifying Country-Level Risk Factors for the Spread of COVID-19 in Europe Using Machine Learning |
title_sort | identifying country-level risk factors for the spread of covid-19 in europe using machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8955542/ https://www.ncbi.nlm.nih.gov/pubmed/35337032 http://dx.doi.org/10.3390/v14030625 |
work_keys_str_mv | AT moustakidisserafeim identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning AT kokkotischristos identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning AT tsaopoulosdimitrios identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning AT sfikakispetros identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning AT tsiodrassotirios identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning AT sypsavana identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning AT zaoutistheoklise identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning AT paraskevisdimitrios identifyingcountrylevelriskfactorsforthespreadofcovid19ineuropeusingmachinelearning |