Cargando…
Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns
BACKGROUND: Timely allocation of medical resources for coronavirus disease (COVID-19) requires early detection of regional outbreaks. Internet browsing data may predict case outbreaks in local populations that are yet to be confirmed. OBJECTIVE: We investigated whether search-engine query patterns c...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7394521/ https://www.ncbi.nlm.nih.gov/pubmed/32692691 http://dx.doi.org/10.2196/19483 |
_version_ | 1783565241534119936 |
---|---|
author | Cousins, Henry C Cousins, Clara C Harris, Alon Pasquale, Louis R |
author_facet | Cousins, Henry C Cousins, Clara C Harris, Alon Pasquale, Louis R |
author_sort | Cousins, Henry C |
collection | PubMed |
description | BACKGROUND: Timely allocation of medical resources for coronavirus disease (COVID-19) requires early detection of regional outbreaks. Internet browsing data may predict case outbreaks in local populations that are yet to be confirmed. OBJECTIVE: We investigated whether search-engine query patterns can help to predict COVID-19 case rates at the state and metropolitan area levels in the United States. METHODS: We used regional confirmed case data from the New York Times and Google Trends results from 50 states and 166 county-based designated market areas (DMA). We identified search terms whose activity precedes and correlates with confirmed case rates at the national level. We used univariate regression to construct a composite explanatory variable based on best-fitting search queries offset by temporal lags. We measured the raw and z-transformed Pearson correlation and root-mean-square error (RMSE) of the explanatory variable with out-of-sample case rate data at the state and DMA levels. RESULTS: Predictions were highly correlated with confirmed case rates at the state (mean r=0.69, 95% CI 0.51-0.81; median RMSE 1.27, IQR 1.48) and DMA levels (mean r=0.51, 95% CI 0.39-0.61; median RMSE 4.38, IQR 1.80), using search data available up to 10 days prior to confirmed case rates. They fit case-rate activity in 49 of 50 states and in 103 of 166 DMA at a significance level of .05. CONCLUSIONS: Identifiable patterns in search query activity may help to predict emerging regional outbreaks of COVID-19, although they remain vulnerable to stochastic changes in search intensity. |
format | Online Article Text |
id | pubmed-7394521 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-73945212020-08-13 Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns Cousins, Henry C Cousins, Clara C Harris, Alon Pasquale, Louis R J Med Internet Res Original Paper BACKGROUND: Timely allocation of medical resources for coronavirus disease (COVID-19) requires early detection of regional outbreaks. Internet browsing data may predict case outbreaks in local populations that are yet to be confirmed. OBJECTIVE: We investigated whether search-engine query patterns can help to predict COVID-19 case rates at the state and metropolitan area levels in the United States. METHODS: We used regional confirmed case data from the New York Times and Google Trends results from 50 states and 166 county-based designated market areas (DMA). We identified search terms whose activity precedes and correlates with confirmed case rates at the national level. We used univariate regression to construct a composite explanatory variable based on best-fitting search queries offset by temporal lags. We measured the raw and z-transformed Pearson correlation and root-mean-square error (RMSE) of the explanatory variable with out-of-sample case rate data at the state and DMA levels. RESULTS: Predictions were highly correlated with confirmed case rates at the state (mean r=0.69, 95% CI 0.51-0.81; median RMSE 1.27, IQR 1.48) and DMA levels (mean r=0.51, 95% CI 0.39-0.61; median RMSE 4.38, IQR 1.80), using search data available up to 10 days prior to confirmed case rates. They fit case-rate activity in 49 of 50 states and in 103 of 166 DMA at a significance level of .05. CONCLUSIONS: Identifiable patterns in search query activity may help to predict emerging regional outbreaks of COVID-19, although they remain vulnerable to stochastic changes in search intensity. JMIR Publications 2020-07-30 /pmc/articles/PMC7394521/ /pubmed/32692691 http://dx.doi.org/10.2196/19483 Text en ©Henry C Cousins, Clara C Cousins, Alon Harris, Louis R Pasquale. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 30.07.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Cousins, Henry C Cousins, Clara C Harris, Alon Pasquale, Louis R Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns |
title | Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns |
title_full | Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns |
title_fullStr | Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns |
title_full_unstemmed | Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns |
title_short | Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns |
title_sort | regional infoveillance of covid-19 case rates: analysis of search-engine query patterns |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7394521/ https://www.ncbi.nlm.nih.gov/pubmed/32692691 http://dx.doi.org/10.2196/19483 |
work_keys_str_mv | AT cousinshenryc regionalinfoveillanceofcovid19caseratesanalysisofsearchenginequerypatterns AT cousinsclarac regionalinfoveillanceofcovid19caseratesanalysisofsearchenginequerypatterns AT harrisalon regionalinfoveillanceofcovid19caseratesanalysisofsearchenginequerypatterns AT pasqualelouisr regionalinfoveillanceofcovid19caseratesanalysisofsearchenginequerypatterns |