Cargando…
Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study
In this work the applicability of an ensemble of population and machine learning models to predict the evolution of the COVID-19 pandemic in Spain is evaluated, relying solely on public datasets. Firstly, using only incidence data, we trained machine learning models and adjusted classical ODE-based...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127188/ https://www.ncbi.nlm.nih.gov/pubmed/37185927 http://dx.doi.org/10.1038/s41598-023-33795-8 |
_version_ | 1785030410655236096 |
---|---|
author | Heredia Cacha, Ignacio Sáinz-Pardo Díaz, Judith Castrillo, María López García, Álvaro |
author_facet | Heredia Cacha, Ignacio Sáinz-Pardo Díaz, Judith Castrillo, María López García, Álvaro |
author_sort | Heredia Cacha, Ignacio |
collection | PubMed |
description | In this work the applicability of an ensemble of population and machine learning models to predict the evolution of the COVID-19 pandemic in Spain is evaluated, relying solely on public datasets. Firstly, using only incidence data, we trained machine learning models and adjusted classical ODE-based population models, especially suited to capture long term trends. As a novel approach, we then made an ensemble of these two families of models in order to obtain a more robust and accurate prediction. We then proceed to improve machine learning models by adding more input features: vaccination, human mobility and weather conditions. However, these improvements did not translate to the overall ensemble, as the different model families had also different prediction patterns. Additionally, machine learning models degraded when new COVID variants appeared after training. We finally used Shapley Additive Explanation values to discern the relative importance of the different input features for the machine learning models’ predictions. The conclusion of this work is that the ensemble of machine learning models and population models can be a promising alternative to SEIR-like compartmental models, especially given that the former do not need data from recovered patients, which are hard to collect and generally unavailable. |
format | Online Article Text |
id | pubmed-10127188 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-101271882023-04-27 Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study Heredia Cacha, Ignacio Sáinz-Pardo Díaz, Judith Castrillo, María López García, Álvaro Sci Rep Article In this work the applicability of an ensemble of population and machine learning models to predict the evolution of the COVID-19 pandemic in Spain is evaluated, relying solely on public datasets. Firstly, using only incidence data, we trained machine learning models and adjusted classical ODE-based population models, especially suited to capture long term trends. As a novel approach, we then made an ensemble of these two families of models in order to obtain a more robust and accurate prediction. We then proceed to improve machine learning models by adding more input features: vaccination, human mobility and weather conditions. However, these improvements did not translate to the overall ensemble, as the different model families had also different prediction patterns. Additionally, machine learning models degraded when new COVID variants appeared after training. We finally used Shapley Additive Explanation values to discern the relative importance of the different input features for the machine learning models’ predictions. The conclusion of this work is that the ensemble of machine learning models and population models can be a promising alternative to SEIR-like compartmental models, especially given that the former do not need data from recovered patients, which are hard to collect and generally unavailable. Nature Publishing Group UK 2023-04-25 /pmc/articles/PMC10127188/ /pubmed/37185927 http://dx.doi.org/10.1038/s41598-023-33795-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Heredia Cacha, Ignacio Sáinz-Pardo Díaz, Judith Castrillo, María López García, Álvaro Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study |
title | Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study |
title_full | Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study |
title_fullStr | Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study |
title_full_unstemmed | Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study |
title_short | Forecasting COVID-19 spreading through an ensemble of classical and machine learning models: Spain’s case study |
title_sort | forecasting covid-19 spreading through an ensemble of classical and machine learning models: spain’s case study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10127188/ https://www.ncbi.nlm.nih.gov/pubmed/37185927 http://dx.doi.org/10.1038/s41598-023-33795-8 |
work_keys_str_mv | AT herediacachaignacio forecastingcovid19spreadingthroughanensembleofclassicalandmachinelearningmodelsspainscasestudy AT sainzpardodiazjudith forecastingcovid19spreadingthroughanensembleofclassicalandmachinelearningmodelsspainscasestudy AT castrillomaria forecastingcovid19spreadingthroughanensembleofclassicalandmachinelearningmodelsspainscasestudy AT lopezgarciaalvaro forecastingcovid19spreadingthroughanensembleofclassicalandmachinelearningmodelsspainscasestudy |