Cargando…
Twitter conversations predict the daily confirmed COVID-19 cases
As of writing this paper, COVID-19 (Coronavirus disease 2019) has spread to more than 220 countries and territories. Following the outbreak, the pandemic’s seriousness has made people more active on social media, especially on the microblogging platforms such as Twitter and Weibo. The pandemic-speci...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier B.V.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9444159/ https://www.ncbi.nlm.nih.gov/pubmed/36092470 http://dx.doi.org/10.1016/j.asoc.2022.109603 |
_version_ | 1784783155124764672 |
---|---|
author | Lamsal, Rabindra Harwood, Aaron Read, Maria Rodriguez |
author_facet | Lamsal, Rabindra Harwood, Aaron Read, Maria Rodriguez |
author_sort | Lamsal, Rabindra |
collection | PubMed |
description | As of writing this paper, COVID-19 (Coronavirus disease 2019) has spread to more than 220 countries and territories. Following the outbreak, the pandemic’s seriousness has made people more active on social media, especially on the microblogging platforms such as Twitter and Weibo. The pandemic-specific discourse has remained on-trend on these platforms for months now. Previous studies have confirmed the contributions of such socially generated conversations towards situational awareness of crisis events. The early forecasts of cases are essential to authorities to estimate the requirements of resources needed to cope with the outgrowths of the virus. Therefore, this study attempts to incorporate the public discourse in the design of forecasting models particularly targeted for the steep-hill region of an ongoing wave. We propose a sentiment-involved topic-based latent variables search methodology for designing forecasting models from publicly available Twitter conversations. As a use case, we implement the proposed methodology on Australian COVID-19 daily cases and Twitter conversations generated within the country. Experimental results: (i) show the presence of latent social media variables that Granger-cause the daily COVID-19 confirmed cases, and (ii) confirm that those variables offer additional prediction capability to forecasting models. Further, the results show that the inclusion of social media variables introduces 48.83%–51.38% improvements on RMSE over the baseline models. We also release the large-scale COVID-19 specific geotagged global tweets dataset, MegaGeoCOV, to the public anticipating that the geotagged data of this scale would aid in understanding the conversational dynamics of the pandemic through other spatial and temporal contexts. |
format | Online Article Text |
id | pubmed-9444159 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-94441592022-09-06 Twitter conversations predict the daily confirmed COVID-19 cases Lamsal, Rabindra Harwood, Aaron Read, Maria Rodriguez Appl Soft Comput Article As of writing this paper, COVID-19 (Coronavirus disease 2019) has spread to more than 220 countries and territories. Following the outbreak, the pandemic’s seriousness has made people more active on social media, especially on the microblogging platforms such as Twitter and Weibo. The pandemic-specific discourse has remained on-trend on these platforms for months now. Previous studies have confirmed the contributions of such socially generated conversations towards situational awareness of crisis events. The early forecasts of cases are essential to authorities to estimate the requirements of resources needed to cope with the outgrowths of the virus. Therefore, this study attempts to incorporate the public discourse in the design of forecasting models particularly targeted for the steep-hill region of an ongoing wave. We propose a sentiment-involved topic-based latent variables search methodology for designing forecasting models from publicly available Twitter conversations. As a use case, we implement the proposed methodology on Australian COVID-19 daily cases and Twitter conversations generated within the country. Experimental results: (i) show the presence of latent social media variables that Granger-cause the daily COVID-19 confirmed cases, and (ii) confirm that those variables offer additional prediction capability to forecasting models. Further, the results show that the inclusion of social media variables introduces 48.83%–51.38% improvements on RMSE over the baseline models. We also release the large-scale COVID-19 specific geotagged global tweets dataset, MegaGeoCOV, to the public anticipating that the geotagged data of this scale would aid in understanding the conversational dynamics of the pandemic through other spatial and temporal contexts. Elsevier B.V. 2022-11 2022-09-05 /pmc/articles/PMC9444159/ /pubmed/36092470 http://dx.doi.org/10.1016/j.asoc.2022.109603 Text en © 2022 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Lamsal, Rabindra Harwood, Aaron Read, Maria Rodriguez Twitter conversations predict the daily confirmed COVID-19 cases |
title | Twitter conversations predict the daily confirmed COVID-19 cases |
title_full | Twitter conversations predict the daily confirmed COVID-19 cases |
title_fullStr | Twitter conversations predict the daily confirmed COVID-19 cases |
title_full_unstemmed | Twitter conversations predict the daily confirmed COVID-19 cases |
title_short | Twitter conversations predict the daily confirmed COVID-19 cases |
title_sort | twitter conversations predict the daily confirmed covid-19 cases |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9444159/ https://www.ncbi.nlm.nih.gov/pubmed/36092470 http://dx.doi.org/10.1016/j.asoc.2022.109603 |
work_keys_str_mv | AT lamsalrabindra twitterconversationspredictthedailyconfirmedcovid19cases AT harwoodaaron twitterconversationspredictthedailyconfirmedcovid19cases AT readmariarodriguez twitterconversationspredictthedailyconfirmedcovid19cases |