Cargando…

Prediction of Number of Cases of 2019 Novel Coronavirus (COVID-19) Using Social Media Search Index

Predicting the number of new suspected or confirmed cases of novel coronavirus disease 2019 (COVID-19) is crucial in the prevention and control of the COVID-19 outbreak. Social media search indexes (SMSI) for dry cough, fever, chest distress, coronavirus, and pneumonia were collected from 31 Decembe...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Lei, Sun, Qiang, Wang, Yidan, Wu, Ke-Fei, Chen, Mingchih, Shia, Ben-Chang, Wu, Szu-Yuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7177617/
https://www.ncbi.nlm.nih.gov/pubmed/32244425
http://dx.doi.org/10.3390/ijerph17072365
_version_ 1783525260695437312
author Qin, Lei
Sun, Qiang
Wang, Yidan
Wu, Ke-Fei
Chen, Mingchih
Shia, Ben-Chang
Wu, Szu-Yuan
author_facet Qin, Lei
Sun, Qiang
Wang, Yidan
Wu, Ke-Fei
Chen, Mingchih
Shia, Ben-Chang
Wu, Szu-Yuan
author_sort Qin, Lei
collection PubMed
description Predicting the number of new suspected or confirmed cases of novel coronavirus disease 2019 (COVID-19) is crucial in the prevention and control of the COVID-19 outbreak. Social media search indexes (SMSI) for dry cough, fever, chest distress, coronavirus, and pneumonia were collected from 31 December 2019 to 9 February 2020. The new suspected cases of COVID-19 data were collected from 20 January 2020 to 9 February 2020. We used the lagged series of SMSI to predict new suspected COVID-19 case numbers during this period. To avoid overfitting, five methods, namely subset selection, forward selection, lasso regression, ridge regression, and elastic net, were used to estimate coefficients. We selected the optimal method to predict new suspected COVID-19 case numbers from 20 January 2020 to 9 February 2020. We further validated the optimal method for new confirmed cases of COVID-19 from 31 December 2019 to 17 February 2020. The new suspected COVID-19 case numbers correlated significantly with the lagged series of SMSI. SMSI could be detected 6–9 days earlier than new suspected cases of COVID-19. The optimal method was the subset selection method, which had the lowest estimation error and a moderate number of predictors. The subset selection method also significantly correlated with the new confirmed COVID-19 cases after validation. SMSI findings on lag day 10 were significantly correlated with new confirmed COVID-19 cases. SMSI could be a significant predictor of the number of COVID-19 infections. SMSI could be an effective early predictor, which would enable governments’ health departments to locate potential and high-risk outbreak areas.
format Online
Article
Text
id pubmed-7177617
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-71776172020-04-28 Prediction of Number of Cases of 2019 Novel Coronavirus (COVID-19) Using Social Media Search Index Qin, Lei Sun, Qiang Wang, Yidan Wu, Ke-Fei Chen, Mingchih Shia, Ben-Chang Wu, Szu-Yuan Int J Environ Res Public Health Article Predicting the number of new suspected or confirmed cases of novel coronavirus disease 2019 (COVID-19) is crucial in the prevention and control of the COVID-19 outbreak. Social media search indexes (SMSI) for dry cough, fever, chest distress, coronavirus, and pneumonia were collected from 31 December 2019 to 9 February 2020. The new suspected cases of COVID-19 data were collected from 20 January 2020 to 9 February 2020. We used the lagged series of SMSI to predict new suspected COVID-19 case numbers during this period. To avoid overfitting, five methods, namely subset selection, forward selection, lasso regression, ridge regression, and elastic net, were used to estimate coefficients. We selected the optimal method to predict new suspected COVID-19 case numbers from 20 January 2020 to 9 February 2020. We further validated the optimal method for new confirmed cases of COVID-19 from 31 December 2019 to 17 February 2020. The new suspected COVID-19 case numbers correlated significantly with the lagged series of SMSI. SMSI could be detected 6–9 days earlier than new suspected cases of COVID-19. The optimal method was the subset selection method, which had the lowest estimation error and a moderate number of predictors. The subset selection method also significantly correlated with the new confirmed COVID-19 cases after validation. SMSI findings on lag day 10 were significantly correlated with new confirmed COVID-19 cases. SMSI could be a significant predictor of the number of COVID-19 infections. SMSI could be an effective early predictor, which would enable governments’ health departments to locate potential and high-risk outbreak areas. MDPI 2020-03-31 2020-04 /pmc/articles/PMC7177617/ /pubmed/32244425 http://dx.doi.org/10.3390/ijerph17072365 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Qin, Lei
Sun, Qiang
Wang, Yidan
Wu, Ke-Fei
Chen, Mingchih
Shia, Ben-Chang
Wu, Szu-Yuan
Prediction of Number of Cases of 2019 Novel Coronavirus (COVID-19) Using Social Media Search Index
title Prediction of Number of Cases of 2019 Novel Coronavirus (COVID-19) Using Social Media Search Index
title_full Prediction of Number of Cases of 2019 Novel Coronavirus (COVID-19) Using Social Media Search Index
title_fullStr Prediction of Number of Cases of 2019 Novel Coronavirus (COVID-19) Using Social Media Search Index
title_full_unstemmed Prediction of Number of Cases of 2019 Novel Coronavirus (COVID-19) Using Social Media Search Index
title_short Prediction of Number of Cases of 2019 Novel Coronavirus (COVID-19) Using Social Media Search Index
title_sort prediction of number of cases of 2019 novel coronavirus (covid-19) using social media search index
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7177617/
https://www.ncbi.nlm.nih.gov/pubmed/32244425
http://dx.doi.org/10.3390/ijerph17072365
work_keys_str_mv AT qinlei predictionofnumberofcasesof2019novelcoronaviruscovid19usingsocialmediasearchindex
AT sunqiang predictionofnumberofcasesof2019novelcoronaviruscovid19usingsocialmediasearchindex
AT wangyidan predictionofnumberofcasesof2019novelcoronaviruscovid19usingsocialmediasearchindex
AT wukefei predictionofnumberofcasesof2019novelcoronaviruscovid19usingsocialmediasearchindex
AT chenmingchih predictionofnumberofcasesof2019novelcoronaviruscovid19usingsocialmediasearchindex
AT shiabenchang predictionofnumberofcasesof2019novelcoronaviruscovid19usingsocialmediasearchindex
AT wuszuyuan predictionofnumberofcasesof2019novelcoronaviruscovid19usingsocialmediasearchindex