Cargando…

A novel bidirectional LSTM deep learning approach for COVID-19 forecasting

COVID-19 has resulted in significant morbidity and mortality globally. We develop a model that uses data from thirty days before a fixed time point to forecast the daily number of new COVID-19 cases fourteen days later in the early stages of the pandemic. Various time-dependent factors including the...

Descripción completa

Detalles Bibliográficos
Autores principales: Aung, Nway Nway, Pang, Junxiong, Chua, Matthew Chin Heng, Tan, Hui Xing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10589260/
https://www.ncbi.nlm.nih.gov/pubmed/37863921
http://dx.doi.org/10.1038/s41598-023-44924-8
_version_ 1785123752066940928
author Aung, Nway Nway
Pang, Junxiong
Chua, Matthew Chin Heng
Tan, Hui Xing
author_facet Aung, Nway Nway
Pang, Junxiong
Chua, Matthew Chin Heng
Tan, Hui Xing
author_sort Aung, Nway Nway
collection PubMed
description COVID-19 has resulted in significant morbidity and mortality globally. We develop a model that uses data from thirty days before a fixed time point to forecast the daily number of new COVID-19 cases fourteen days later in the early stages of the pandemic. Various time-dependent factors including the number of daily confirmed cases, reproduction number, policy measures, mobility and flight numbers were collected. A deep-learning model using Bidirectional Long-Short Term Memory (Bi-LSTM) architecture was trained on data from 22nd Jan 2020 to 8 Jan 2021 to forecast the new daily number of COVID-19 cases 14 days in advance across 190 countries, from 9 to 31 Jan 2021. A second model with fewer variables but similar architecture was developed. Results were summarised by mean absolute error (MAE), root mean squared error (RMSE), mean absolute percentage error (MAPE), and total absolute percentage error and compared against results from a classical ARIMA model. Median MAE was 157 daily cases (IQR: 26–666) under the first model, and 150 (IQR: 26–716) under the second. Countries with more accurate forecasts had more daily cases and experienced more waves of COVID-19 infections. Among countries with over 10,000 cases over the prediction period, median total absolute percentage error was 33% (IQR: 18–59%) and 34% (IQR: 16–66%) for the first and second models respectively. Both models had comparable median total absolute percentage errors but lower maximum total absolute percentage errors as compared to the classical ARIMA model. A deep-learning approach using Bi-LSTM architecture and open-source data was validated on 190 countries to forecast the daily number of cases in the early stages of the COVID-19 outbreak. Fewer variables could potentially be used without impacting prediction accuracy.
format Online
Article
Text
id pubmed-10589260
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-105892602023-10-22 A novel bidirectional LSTM deep learning approach for COVID-19 forecasting Aung, Nway Nway Pang, Junxiong Chua, Matthew Chin Heng Tan, Hui Xing Sci Rep Article COVID-19 has resulted in significant morbidity and mortality globally. We develop a model that uses data from thirty days before a fixed time point to forecast the daily number of new COVID-19 cases fourteen days later in the early stages of the pandemic. Various time-dependent factors including the number of daily confirmed cases, reproduction number, policy measures, mobility and flight numbers were collected. A deep-learning model using Bidirectional Long-Short Term Memory (Bi-LSTM) architecture was trained on data from 22nd Jan 2020 to 8 Jan 2021 to forecast the new daily number of COVID-19 cases 14 days in advance across 190 countries, from 9 to 31 Jan 2021. A second model with fewer variables but similar architecture was developed. Results were summarised by mean absolute error (MAE), root mean squared error (RMSE), mean absolute percentage error (MAPE), and total absolute percentage error and compared against results from a classical ARIMA model. Median MAE was 157 daily cases (IQR: 26–666) under the first model, and 150 (IQR: 26–716) under the second. Countries with more accurate forecasts had more daily cases and experienced more waves of COVID-19 infections. Among countries with over 10,000 cases over the prediction period, median total absolute percentage error was 33% (IQR: 18–59%) and 34% (IQR: 16–66%) for the first and second models respectively. Both models had comparable median total absolute percentage errors but lower maximum total absolute percentage errors as compared to the classical ARIMA model. A deep-learning approach using Bi-LSTM architecture and open-source data was validated on 190 countries to forecast the daily number of cases in the early stages of the COVID-19 outbreak. Fewer variables could potentially be used without impacting prediction accuracy. Nature Publishing Group UK 2023-10-20 /pmc/articles/PMC10589260/ /pubmed/37863921 http://dx.doi.org/10.1038/s41598-023-44924-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Aung, Nway Nway
Pang, Junxiong
Chua, Matthew Chin Heng
Tan, Hui Xing
A novel bidirectional LSTM deep learning approach for COVID-19 forecasting
title A novel bidirectional LSTM deep learning approach for COVID-19 forecasting
title_full A novel bidirectional LSTM deep learning approach for COVID-19 forecasting
title_fullStr A novel bidirectional LSTM deep learning approach for COVID-19 forecasting
title_full_unstemmed A novel bidirectional LSTM deep learning approach for COVID-19 forecasting
title_short A novel bidirectional LSTM deep learning approach for COVID-19 forecasting
title_sort novel bidirectional lstm deep learning approach for covid-19 forecasting
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10589260/
https://www.ncbi.nlm.nih.gov/pubmed/37863921
http://dx.doi.org/10.1038/s41598-023-44924-8
work_keys_str_mv AT aungnwaynway anovelbidirectionallstmdeeplearningapproachforcovid19forecasting
AT pangjunxiong anovelbidirectionallstmdeeplearningapproachforcovid19forecasting
AT chuamatthewchinheng anovelbidirectionallstmdeeplearningapproachforcovid19forecasting
AT tanhuixing anovelbidirectionallstmdeeplearningapproachforcovid19forecasting
AT aungnwaynway novelbidirectionallstmdeeplearningapproachforcovid19forecasting
AT pangjunxiong novelbidirectionallstmdeeplearningapproachforcovid19forecasting
AT chuamatthewchinheng novelbidirectionallstmdeeplearningapproachforcovid19forecasting
AT tanhuixing novelbidirectionallstmdeeplearningapproachforcovid19forecasting