Cargando…

Subregional Nowcasts of Seasonal Influenza Using Search Trends

BACKGROUND: Limiting the adverse effects of seasonal influenza outbreaks at state or city level requires close monitoring of localized outbreaks and reliable forecasts of their progression. Whereas forecasting models for influenza or influenza-like illness (ILI) are becoming increasingly available,...

Descripción completa

Detalles Bibliográficos
Autores principales: Kandula, Sasikiran, Hsu, Daniel, Shaman, Jeffrey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5696582/
https://www.ncbi.nlm.nih.gov/pubmed/29109069
http://dx.doi.org/10.2196/jmir.7486
_version_ 1783280479251726336
author Kandula, Sasikiran
Hsu, Daniel
Shaman, Jeffrey
author_facet Kandula, Sasikiran
Hsu, Daniel
Shaman, Jeffrey
author_sort Kandula, Sasikiran
collection PubMed
description BACKGROUND: Limiting the adverse effects of seasonal influenza outbreaks at state or city level requires close monitoring of localized outbreaks and reliable forecasts of their progression. Whereas forecasting models for influenza or influenza-like illness (ILI) are becoming increasingly available, their applicability to localized outbreaks is limited by the nonavailability of real-time observations of the current outbreak state at local scales. Surveillance data collected by various health departments are widely accepted as the reference standard for estimating the state of outbreaks, and in the absence of surveillance data, nowcast proxies built using Web-based activities such as search engine queries, tweets, and access of health-related webpages can be useful. Nowcast estimates of state and municipal ILI were previously published by Google Flu Trends (GFT); however, validations of these estimates were seldom reported. OBJECTIVE: The aim of this study was to develop and validate models to nowcast ILI at subregional geographic scales. METHODS: We built nowcast models based on autoregressive (autoregressive integrated moving average; ARIMA) and supervised regression methods (Random forests) at the US state level using regional weighted ILI and Web-based search activity derived from Google's Extended Trends application programming interface. We validated the performance of these methods using actual surveillance data for the 50 states across six seasons. We also built state-level nowcast models using state-level estimates of ILI and compared the accuracy of these estimates with the estimates of the regional models extrapolated to the state level and with the nowcast estimates published by GFT. RESULTS: Models built using regional ILI extrapolated to state level had a median correlation of 0.84 (interquartile range: 0.74-0.91) and a median root mean square error (RMSE) of 1.01 (IQR: 0.74-1.50), with noticeable variability across seasons and by state population size. Model forms that hypothesize the availability of timely state-level surveillance data show significantly lower errors of 0.83 (0.55-0.23). Compared with GFT, the latter model forms have lower errors but also lower correlation. CONCLUSIONS: These results suggest that the proposed methods may be an alternative to the discontinued GFT and that further improvements in the quality of subregional nowcasts may require increased access to more finely resolved surveillance data.
format Online
Article
Text
id pubmed-5696582
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-56965822017-11-29 Subregional Nowcasts of Seasonal Influenza Using Search Trends Kandula, Sasikiran Hsu, Daniel Shaman, Jeffrey J Med Internet Res Original Paper BACKGROUND: Limiting the adverse effects of seasonal influenza outbreaks at state or city level requires close monitoring of localized outbreaks and reliable forecasts of their progression. Whereas forecasting models for influenza or influenza-like illness (ILI) are becoming increasingly available, their applicability to localized outbreaks is limited by the nonavailability of real-time observations of the current outbreak state at local scales. Surveillance data collected by various health departments are widely accepted as the reference standard for estimating the state of outbreaks, and in the absence of surveillance data, nowcast proxies built using Web-based activities such as search engine queries, tweets, and access of health-related webpages can be useful. Nowcast estimates of state and municipal ILI were previously published by Google Flu Trends (GFT); however, validations of these estimates were seldom reported. OBJECTIVE: The aim of this study was to develop and validate models to nowcast ILI at subregional geographic scales. METHODS: We built nowcast models based on autoregressive (autoregressive integrated moving average; ARIMA) and supervised regression methods (Random forests) at the US state level using regional weighted ILI and Web-based search activity derived from Google's Extended Trends application programming interface. We validated the performance of these methods using actual surveillance data for the 50 states across six seasons. We also built state-level nowcast models using state-level estimates of ILI and compared the accuracy of these estimates with the estimates of the regional models extrapolated to the state level and with the nowcast estimates published by GFT. RESULTS: Models built using regional ILI extrapolated to state level had a median correlation of 0.84 (interquartile range: 0.74-0.91) and a median root mean square error (RMSE) of 1.01 (IQR: 0.74-1.50), with noticeable variability across seasons and by state population size. Model forms that hypothesize the availability of timely state-level surveillance data show significantly lower errors of 0.83 (0.55-0.23). Compared with GFT, the latter model forms have lower errors but also lower correlation. CONCLUSIONS: These results suggest that the proposed methods may be an alternative to the discontinued GFT and that further improvements in the quality of subregional nowcasts may require increased access to more finely resolved surveillance data. JMIR Publications 2017-11-06 /pmc/articles/PMC5696582/ /pubmed/29109069 http://dx.doi.org/10.2196/jmir.7486 Text en ©Sasikiran Kandula, Daniel Hsu, Jeffrey Shaman. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 06.11.2017. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Kandula, Sasikiran
Hsu, Daniel
Shaman, Jeffrey
Subregional Nowcasts of Seasonal Influenza Using Search Trends
title Subregional Nowcasts of Seasonal Influenza Using Search Trends
title_full Subregional Nowcasts of Seasonal Influenza Using Search Trends
title_fullStr Subregional Nowcasts of Seasonal Influenza Using Search Trends
title_full_unstemmed Subregional Nowcasts of Seasonal Influenza Using Search Trends
title_short Subregional Nowcasts of Seasonal Influenza Using Search Trends
title_sort subregional nowcasts of seasonal influenza using search trends
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5696582/
https://www.ncbi.nlm.nih.gov/pubmed/29109069
http://dx.doi.org/10.2196/jmir.7486
work_keys_str_mv AT kandulasasikiran subregionalnowcastsofseasonalinfluenzausingsearchtrends
AT hsudaniel subregionalnowcastsofseasonalinfluenzausingsearchtrends
AT shamanjeffrey subregionalnowcastsofseasonalinfluenzausingsearchtrends