Cargando…

A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models

This study aims to develop an assumption-free data-driven model to accurately forecast COVID-19 spread. Towards this end, we firstly employed Bayesian optimization to tune the Gaussian process regression (GPR) hyperparameters to develop an efficient GPR-based model for forecasting the recovered and...

Descripción completa

Detalles Bibliográficos
Autores principales: Alali, Yasminah, Harrou, Fouzi, Sun, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8844088/
https://www.ncbi.nlm.nih.gov/pubmed/35165290
http://dx.doi.org/10.1038/s41598-022-06218-3
_version_ 1784651406893907968
author Alali, Yasminah
Harrou, Fouzi
Sun, Ying
author_facet Alali, Yasminah
Harrou, Fouzi
Sun, Ying
author_sort Alali, Yasminah
collection PubMed
description This study aims to develop an assumption-free data-driven model to accurately forecast COVID-19 spread. Towards this end, we firstly employed Bayesian optimization to tune the Gaussian process regression (GPR) hyperparameters to develop an efficient GPR-based model for forecasting the recovered and confirmed COVID-19 cases in two highly impacted countries, India and Brazil. However, machine learning models do not consider the time dependency in the COVID-19 data series. Here, dynamic information has been taken into account to alleviate this limitation by introducing lagged measurements in constructing the investigated machine learning models. Additionally, we assessed the contribution of the incorporated features to the COVID-19 prediction using the Random Forest algorithm. Results reveal that significant improvement can be obtained using the proposed dynamic machine learning models. In addition, the results highlighted the superior performance of the dynamic GPR compared to the other models (i.e., Support vector regression, Boosted trees, Bagged trees, Decision tree, Random Forest, and XGBoost) by achieving an averaged mean absolute percentage error of around 0.1%. Finally, we provided the confidence level of the predicted results based on the dynamic GPR model and showed that the predictions are within the 95% confidence interval. This study presents a promising shallow and simple approach for predicting COVID-19 spread.
format Online
Article
Text
id pubmed-8844088
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-88440882022-02-16 A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models Alali, Yasminah Harrou, Fouzi Sun, Ying Sci Rep Article This study aims to develop an assumption-free data-driven model to accurately forecast COVID-19 spread. Towards this end, we firstly employed Bayesian optimization to tune the Gaussian process regression (GPR) hyperparameters to develop an efficient GPR-based model for forecasting the recovered and confirmed COVID-19 cases in two highly impacted countries, India and Brazil. However, machine learning models do not consider the time dependency in the COVID-19 data series. Here, dynamic information has been taken into account to alleviate this limitation by introducing lagged measurements in constructing the investigated machine learning models. Additionally, we assessed the contribution of the incorporated features to the COVID-19 prediction using the Random Forest algorithm. Results reveal that significant improvement can be obtained using the proposed dynamic machine learning models. In addition, the results highlighted the superior performance of the dynamic GPR compared to the other models (i.e., Support vector regression, Boosted trees, Bagged trees, Decision tree, Random Forest, and XGBoost) by achieving an averaged mean absolute percentage error of around 0.1%. Finally, we provided the confidence level of the predicted results based on the dynamic GPR model and showed that the predictions are within the 95% confidence interval. This study presents a promising shallow and simple approach for predicting COVID-19 spread. Nature Publishing Group UK 2022-02-14 /pmc/articles/PMC8844088/ /pubmed/35165290 http://dx.doi.org/10.1038/s41598-022-06218-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Alali, Yasminah
Harrou, Fouzi
Sun, Ying
A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models
title A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models
title_full A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models
title_fullStr A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models
title_full_unstemmed A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models
title_short A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models
title_sort proficient approach to forecast covid-19 spread via optimized dynamic machine learning models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8844088/
https://www.ncbi.nlm.nih.gov/pubmed/35165290
http://dx.doi.org/10.1038/s41598-022-06218-3
work_keys_str_mv AT alaliyasminah aproficientapproachtoforecastcovid19spreadviaoptimizeddynamicmachinelearningmodels
AT harroufouzi aproficientapproachtoforecastcovid19spreadviaoptimizeddynamicmachinelearningmodels
AT sunying aproficientapproachtoforecastcovid19spreadviaoptimizeddynamicmachinelearningmodels
AT alaliyasminah proficientapproachtoforecastcovid19spreadviaoptimizeddynamicmachinelearningmodels
AT harroufouzi proficientapproachtoforecastcovid19spreadviaoptimizeddynamicmachinelearningmodels
AT sunying proficientapproachtoforecastcovid19spreadviaoptimizeddynamicmachinelearningmodels