Cargando…

Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission

In this study, a hybrid deep-learning model termed as ODANN, built upon neural networks (NN) coupled with data assimilation and natural language processing (NLP) features extraction methods, has been constructed to concurrently process daily COVID-19 time-series records and large volumes of COVID-19...

Descripción completa

Detalles Bibliográficos
Autores principales: Chew, Alvin Wei Ze, Pan, Yue, Wang, Ying, Zhang, Limao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8522122/
https://www.ncbi.nlm.nih.gov/pubmed/34690447
http://dx.doi.org/10.1016/j.knosys.2021.107417
_version_ 1784585032212414464
author Chew, Alvin Wei Ze
Pan, Yue
Wang, Ying
Zhang, Limao
author_facet Chew, Alvin Wei Ze
Pan, Yue
Wang, Ying
Zhang, Limao
author_sort Chew, Alvin Wei Ze
collection PubMed
description In this study, a hybrid deep-learning model termed as ODANN, built upon neural networks (NN) coupled with data assimilation and natural language processing (NLP) features extraction methods, has been constructed to concurrently process daily COVID-19 time-series records and large volumes of COVID-19 related Twitter data, as representative of the global community’s aggregated emotional responses towards the current pandemic, to model the growth rate in the number of confirmed COVID-19 cases globally via a proposed G parameter. Overall, there were 3 key components to ODANN’s development phase, namely: (i) data hydration and pre-processing were performed on COVID-19 related Twitter data ranging between 23 January 2020 and 10 May 2020, which amounted to over 100 million Tweets written in English language; (ii) multiple NLP features extraction methods were subsequently leveraged to encode the hydrated Twitter data into useful semantic word vectors for training ODANN under an optimal set of hyperparameters; and (iii) historical time-series data of defined characteristics were also assimilated into ODANN’s selected hidden layer(s) to model the G parameter daily with a lead-time of 1 day. By far, our experimental results demonstrated that by adopting a rolling time-window size of 5 days, with respect to the number of historical time-series records for assimilating different data features, enabled ODANN to outperform other traditional time-series models and recent studies, in terms of the computed RMSE and MAE scores attained from the model’s testing step. Overall, the summarized results from ODANN demonstrated its competitive edge in modelling and forecasting the growth rate in the number of COVID-19 cases globally.
format Online
Article
Text
id pubmed-8522122
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-85221222021-10-18 Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission Chew, Alvin Wei Ze Pan, Yue Wang, Ying Zhang, Limao Knowl Based Syst Article In this study, a hybrid deep-learning model termed as ODANN, built upon neural networks (NN) coupled with data assimilation and natural language processing (NLP) features extraction methods, has been constructed to concurrently process daily COVID-19 time-series records and large volumes of COVID-19 related Twitter data, as representative of the global community’s aggregated emotional responses towards the current pandemic, to model the growth rate in the number of confirmed COVID-19 cases globally via a proposed G parameter. Overall, there were 3 key components to ODANN’s development phase, namely: (i) data hydration and pre-processing were performed on COVID-19 related Twitter data ranging between 23 January 2020 and 10 May 2020, which amounted to over 100 million Tweets written in English language; (ii) multiple NLP features extraction methods were subsequently leveraged to encode the hydrated Twitter data into useful semantic word vectors for training ODANN under an optimal set of hyperparameters; and (iii) historical time-series data of defined characteristics were also assimilated into ODANN’s selected hidden layer(s) to model the G parameter daily with a lead-time of 1 day. By far, our experimental results demonstrated that by adopting a rolling time-window size of 5 days, with respect to the number of historical time-series records for assimilating different data features, enabled ODANN to outperform other traditional time-series models and recent studies, in terms of the computed RMSE and MAE scores attained from the model’s testing step. Overall, the summarized results from ODANN demonstrated its competitive edge in modelling and forecasting the growth rate in the number of COVID-19 cases globally. Elsevier B.V. 2021-12-05 2021-08-24 /pmc/articles/PMC8522122/ /pubmed/34690447 http://dx.doi.org/10.1016/j.knosys.2021.107417 Text en © 2021 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Chew, Alvin Wei Ze
Pan, Yue
Wang, Ying
Zhang, Limao
Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title_full Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title_fullStr Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title_full_unstemmed Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title_short Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission
title_sort hybrid deep learning of social media big data for predicting the evolution of covid-19 transmission
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8522122/
https://www.ncbi.nlm.nih.gov/pubmed/34690447
http://dx.doi.org/10.1016/j.knosys.2021.107417
work_keys_str_mv AT chewalvinweize hybriddeeplearningofsocialmediabigdataforpredictingtheevolutionofcovid19transmission
AT panyue hybriddeeplearningofsocialmediabigdataforpredictingtheevolutionofcovid19transmission
AT wangying hybriddeeplearningofsocialmediabigdataforpredictingtheevolutionofcovid19transmission
AT zhanglimao hybriddeeplearningofsocialmediabigdataforpredictingtheevolutionofcovid19transmission