Cargando…
Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models
BACKGROUND: COVID-19 poses a severe threat to global human health, especially the USA, Brazil, and India cases continue to increase dynamically, which has a far-reaching impact on people's health, social activities, and the local economic situation. METHODS: The study proposed the ARIMA, SARIMA...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9131989/ https://www.ncbi.nlm.nih.gov/pubmed/35614387 http://dx.doi.org/10.1186/s12879-022-07472-6 |
_version_ | 1784713289435971584 |
---|---|
author | Wang, Yanding Yan, Zehui Wang, Ding Yang, Meitao Li, Zhiqiang Gong, Xinran Wu, Di Zhai, Lingling Zhang, Wenyi Wang, Yong |
author_facet | Wang, Yanding Yan, Zehui Wang, Ding Yang, Meitao Li, Zhiqiang Gong, Xinran Wu, Di Zhai, Lingling Zhang, Wenyi Wang, Yong |
author_sort | Wang, Yanding |
collection | PubMed |
description | BACKGROUND: COVID-19 poses a severe threat to global human health, especially the USA, Brazil, and India cases continue to increase dynamically, which has a far-reaching impact on people's health, social activities, and the local economic situation. METHODS: The study proposed the ARIMA, SARIMA and Prophet models to predict daily new cases and cumulative confirmed cases in the USA, Brazil and India over the next 30 days based on the COVID-19 new confirmed cases and cumulative confirmed cases data set(May 1, 2020, and November 30, 2021) published by the official WHO, Three models were implemented in the R 4.1.1 software with forecast and prophet package. The performance of different models was evaluated by using root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE). RESULTS: Through the fitting and prediction of daily new case data, we reveal that the Prophet model has more advantages in the prediction of the COVID-19 of the USA, which could compose data components and capture periodic characteristics when the data changes significantly, while SARIMA is more likely to appear over-fitting in the USA. And the SARIMA model captured a seven-day period hidden in daily COVID-19 new cases from 3 countries. While in the prediction of new cumulative cases, the ARIMA model has a better ability to fit and predict the data with a positive growth trend in different countries(Brazil and India). CONCLUSIONS: This study can shed light on understanding the outbreak trends and give an insight into the epidemiological control of these regions. Further, the prediction of the Prophet model showed sufficient accuracy in the daily COVID-19 new cases of the USA. The ARIMA model is suitable for predicting Brazil and India, which can help take precautions and policy formulation for this epidemic in other countries. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12879-022-07472-6. |
format | Online Article Text |
id | pubmed-9131989 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-91319892022-05-26 Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models Wang, Yanding Yan, Zehui Wang, Ding Yang, Meitao Li, Zhiqiang Gong, Xinran Wu, Di Zhai, Lingling Zhang, Wenyi Wang, Yong BMC Infect Dis Research BACKGROUND: COVID-19 poses a severe threat to global human health, especially the USA, Brazil, and India cases continue to increase dynamically, which has a far-reaching impact on people's health, social activities, and the local economic situation. METHODS: The study proposed the ARIMA, SARIMA and Prophet models to predict daily new cases and cumulative confirmed cases in the USA, Brazil and India over the next 30 days based on the COVID-19 new confirmed cases and cumulative confirmed cases data set(May 1, 2020, and November 30, 2021) published by the official WHO, Three models were implemented in the R 4.1.1 software with forecast and prophet package. The performance of different models was evaluated by using root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE). RESULTS: Through the fitting and prediction of daily new case data, we reveal that the Prophet model has more advantages in the prediction of the COVID-19 of the USA, which could compose data components and capture periodic characteristics when the data changes significantly, while SARIMA is more likely to appear over-fitting in the USA. And the SARIMA model captured a seven-day period hidden in daily COVID-19 new cases from 3 countries. While in the prediction of new cumulative cases, the ARIMA model has a better ability to fit and predict the data with a positive growth trend in different countries(Brazil and India). CONCLUSIONS: This study can shed light on understanding the outbreak trends and give an insight into the epidemiological control of these regions. Further, the prediction of the Prophet model showed sufficient accuracy in the daily COVID-19 new cases of the USA. The ARIMA model is suitable for predicting Brazil and India, which can help take precautions and policy formulation for this epidemic in other countries. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12879-022-07472-6. BioMed Central 2022-05-25 /pmc/articles/PMC9131989/ /pubmed/35614387 http://dx.doi.org/10.1186/s12879-022-07472-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Wang, Yanding Yan, Zehui Wang, Ding Yang, Meitao Li, Zhiqiang Gong, Xinran Wu, Di Zhai, Lingling Zhang, Wenyi Wang, Yong Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models |
title | Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models |
title_full | Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models |
title_fullStr | Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models |
title_full_unstemmed | Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models |
title_short | Prediction and analysis of COVID-19 daily new cases and cumulative cases: times series forecasting and machine learning models |
title_sort | prediction and analysis of covid-19 daily new cases and cumulative cases: times series forecasting and machine learning models |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9131989/ https://www.ncbi.nlm.nih.gov/pubmed/35614387 http://dx.doi.org/10.1186/s12879-022-07472-6 |
work_keys_str_mv | AT wangyanding predictionandanalysisofcovid19dailynewcasesandcumulativecasestimesseriesforecastingandmachinelearningmodels AT yanzehui predictionandanalysisofcovid19dailynewcasesandcumulativecasestimesseriesforecastingandmachinelearningmodels AT wangding predictionandanalysisofcovid19dailynewcasesandcumulativecasestimesseriesforecastingandmachinelearningmodels AT yangmeitao predictionandanalysisofcovid19dailynewcasesandcumulativecasestimesseriesforecastingandmachinelearningmodels AT lizhiqiang predictionandanalysisofcovid19dailynewcasesandcumulativecasestimesseriesforecastingandmachinelearningmodels AT gongxinran predictionandanalysisofcovid19dailynewcasesandcumulativecasestimesseriesforecastingandmachinelearningmodels AT wudi predictionandanalysisofcovid19dailynewcasesandcumulativecasestimesseriesforecastingandmachinelearningmodels AT zhailingling predictionandanalysisofcovid19dailynewcasesandcumulativecasestimesseriesforecastingandmachinelearningmodels AT zhangwenyi predictionandanalysisofcovid19dailynewcasesandcumulativecasestimesseriesforecastingandmachinelearningmodels AT wangyong predictionandanalysisofcovid19dailynewcasesandcumulativecasestimesseriesforecastingandmachinelearningmodels |