Cargando…

Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study

In this work the applicability of an ensemble of population and machine learning models to predict the evolution of the COVID-19 pandemic in Spain is evaluated, relying solely on public datasets. Firstly, using only incidence data, we trained machine learning models and adjusted classical ODE-based...

Descripción completa

Detalles Bibliográficos
Autores principales: Heredia Cacha, Ignacio, Sáinz-Pardo Díaz, Judith, Castrillo, María, López García, Álvaro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127188/
https://www.ncbi.nlm.nih.gov/pubmed/37185927
http://dx.doi.org/10.1038/s41598-023-33795-8
_version_ 1785030410655236096
author Heredia Cacha, Ignacio
Sáinz-Pardo Díaz, Judith
Castrillo, María
López García, Álvaro
author_facet Heredia Cacha, Ignacio
Sáinz-Pardo Díaz, Judith
Castrillo, María
López García, Álvaro
author_sort Heredia Cacha, Ignacio
collection PubMed
description In this work the applicability of an ensemble of population and machine learning models to predict the evolution of the COVID-19 pandemic in Spain is evaluated, relying solely on public datasets. Firstly, using only incidence data, we trained machine learning models and adjusted classical ODE-based population models, especially suited to capture long term trends. As a novel approach, we then made an ensemble of these two families of models in order to obtain a more robust and accurate prediction. We then proceed to improve machine learning models by adding more input features: vaccination, human mobility and weather conditions. However, these improvements did not translate to the overall ensemble, as the different model families had also different prediction patterns. Additionally, machine learning models degraded when new COVID variants appeared after training. We finally used Shapley Additive Explanation values to discern the relative importance of the different input features for the machine learning models’ predictions. The conclusion of this work is that the ensemble of machine learning models and population models can be a promising alternative to SEIR-like compartmental models, especially given that the former do not need data from recovered patients, which are hard to collect and generally unavailable.
format Online
Article
Text
id pubmed-10127188
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-101271882023-04-27 Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study Heredia Cacha, Ignacio Sáinz-Pardo Díaz, Judith Castrillo, María López García, Álvaro Sci Rep Article In this work the applicability of an ensemble of population and machine learning models to predict the evolution of the COVID-19 pandemic in Spain is evaluated, relying solely on public datasets. Firstly, using only incidence data, we trained machine learning models and adjusted classical ODE-based population models, especially suited to capture long term trends. As a novel approach, we then made an ensemble of these two families of models in order to obtain a more robust and accurate prediction. We then proceed to improve machine learning models by adding more input features: vaccination, human mobility and weather conditions. However, these improvements did not translate to the overall ensemble, as the different model families had also different prediction patterns. Additionally, machine learning models degraded when new COVID variants appeared after training. We finally used Shapley Additive Explanation values to discern the relative importance of the different input features for the machine learning models’ predictions. The conclusion of this work is that the ensemble of machine learning models and population models can be a promising alternative to SEIR-like compartmental models, especially given that the former do not need data from recovered patients, which are hard to collect and generally unavailable. Nature Publishing Group UK 2023-04-25 /pmc/articles/PMC10127188/ /pubmed/37185927 http://dx.doi.org/10.1038/s41598-023-33795-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Heredia Cacha, Ignacio
Sáinz-Pardo Díaz, Judith
Castrillo, María
López García, Álvaro
Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study
title Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study
title_full Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study
title_fullStr Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study
title_full_unstemmed Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study
title_short Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study
title_sort forecasting covid-19 spreading through an ensemble of classical and machine learning models: spain’s case study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127188/
https://www.ncbi.nlm.nih.gov/pubmed/37185927
http://dx.doi.org/10.1038/s41598-023-33795-8
work_keys_str_mv AT herediacachaignacio forecastingcovid19spreadingthroughanensembleofclassicalandmachinelearningmodelsspainscasestudy
AT sainzpardodiazjudith forecastingcovid19spreadingthroughanensembleofclassicalandmachinelearningmodelsspainscasestudy
AT castrillomaria forecastingcovid19spreadingthroughanensembleofclassicalandmachinelearningmodelsspainscasestudy
AT lopezgarciaalvaro forecastingcovid19spreadingthroughanensembleofclassicalandmachinelearningmodelsspainscasestudy