Cargando…

Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study

BACKGROUND: The recent global outbreak of coronavirus disease (COVID-19) is affecting many countries worldwide. Iran is one of the top 10 most affected countries. Search engines provide useful data from populations, and these data might be useful to analyze epidemics. Utilizing data mining methods o...

Descripción completa

Detalles Bibliográficos
Autores principales: Ayyoubzadeh, Seyed Mohammad, Ayyoubzadeh, Seyed Mehdi, Zahedi, Hoda, Ahmadi, Mahnaz, R Niakan Kalhori, Sharareh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7159058/
https://www.ncbi.nlm.nih.gov/pubmed/32234709
http://dx.doi.org/10.2196/18828
_version_ 1783522600905867264
author Ayyoubzadeh, Seyed Mohammad
Ayyoubzadeh, Seyed Mehdi
Zahedi, Hoda
Ahmadi, Mahnaz
R Niakan Kalhori, Sharareh
author_facet Ayyoubzadeh, Seyed Mohammad
Ayyoubzadeh, Seyed Mehdi
Zahedi, Hoda
Ahmadi, Mahnaz
R Niakan Kalhori, Sharareh
author_sort Ayyoubzadeh, Seyed Mohammad
collection PubMed
description BACKGROUND: The recent global outbreak of coronavirus disease (COVID-19) is affecting many countries worldwide. Iran is one of the top 10 most affected countries. Search engines provide useful data from populations, and these data might be useful to analyze epidemics. Utilizing data mining methods on electronic resources’ data might provide a better insight into the COVID-19 outbreak to manage the health crisis in each country and worldwide. OBJECTIVE: This study aimed to predict the incidence of COVID-19 in Iran. METHODS: Data were obtained from the Google Trends website. Linear regression and long short-term memory (LSTM) models were used to estimate the number of positive COVID-19 cases. All models were evaluated using 10-fold cross-validation, and root mean square error (RMSE) was used as the performance metric. RESULTS: The linear regression model predicted the incidence with an RMSE of 7.562 (SD 6.492). The most effective factors besides previous day incidence included the search frequency of handwashing, hand sanitizer, and antiseptic topics. The RMSE of the LSTM model was 27.187 (SD 20.705). CONCLUSIONS: Data mining algorithms can be employed to predict trends of outbreaks. This prediction might support policymakers and health care managers to plan and allocate health care resources accordingly.
format Online
Article
Text
id pubmed-7159058
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-71590582020-04-22 Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study Ayyoubzadeh, Seyed Mohammad Ayyoubzadeh, Seyed Mehdi Zahedi, Hoda Ahmadi, Mahnaz R Niakan Kalhori, Sharareh JMIR Public Health Surveill Original Paper BACKGROUND: The recent global outbreak of coronavirus disease (COVID-19) is affecting many countries worldwide. Iran is one of the top 10 most affected countries. Search engines provide useful data from populations, and these data might be useful to analyze epidemics. Utilizing data mining methods on electronic resources’ data might provide a better insight into the COVID-19 outbreak to manage the health crisis in each country and worldwide. OBJECTIVE: This study aimed to predict the incidence of COVID-19 in Iran. METHODS: Data were obtained from the Google Trends website. Linear regression and long short-term memory (LSTM) models were used to estimate the number of positive COVID-19 cases. All models were evaluated using 10-fold cross-validation, and root mean square error (RMSE) was used as the performance metric. RESULTS: The linear regression model predicted the incidence with an RMSE of 7.562 (SD 6.492). The most effective factors besides previous day incidence included the search frequency of handwashing, hand sanitizer, and antiseptic topics. The RMSE of the LSTM model was 27.187 (SD 20.705). CONCLUSIONS: Data mining algorithms can be employed to predict trends of outbreaks. This prediction might support policymakers and health care managers to plan and allocate health care resources accordingly. JMIR Publications 2020-04-14 /pmc/articles/PMC7159058/ /pubmed/32234709 http://dx.doi.org/10.2196/18828 Text en ©Seyed Mohammad Ayyoubzadeh, Seyed Mehdi Ayyoubzadeh, Hoda Zahedi, Mahnaz Ahmadi, Sharareh R Niakan Kalhori. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 14.04.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on http://publichealth.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Ayyoubzadeh, Seyed Mohammad
Ayyoubzadeh, Seyed Mehdi
Zahedi, Hoda
Ahmadi, Mahnaz
R Niakan Kalhori, Sharareh
Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study
title Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study
title_full Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study
title_fullStr Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study
title_full_unstemmed Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study
title_short Predicting COVID-19 Incidence Through Analysis of Google Trends Data in Iran: Data Mining and Deep Learning Pilot Study
title_sort predicting covid-19 incidence through analysis of google trends data in iran: data mining and deep learning pilot study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7159058/
https://www.ncbi.nlm.nih.gov/pubmed/32234709
http://dx.doi.org/10.2196/18828
work_keys_str_mv AT ayyoubzadehseyedmohammad predictingcovid19incidencethroughanalysisofgoogletrendsdatainirandatamininganddeeplearningpilotstudy
AT ayyoubzadehseyedmehdi predictingcovid19incidencethroughanalysisofgoogletrendsdatainirandatamininganddeeplearningpilotstudy
AT zahedihoda predictingcovid19incidencethroughanalysisofgoogletrendsdatainirandatamininganddeeplearningpilotstudy
AT ahmadimahnaz predictingcovid19incidencethroughanalysisofgoogletrendsdatainirandatamininganddeeplearningpilotstudy
AT rniakankalhorisharareh predictingcovid19incidencethroughanalysisofgoogletrendsdatainirandatamininganddeeplearningpilotstudy