Cargando…

Reappraising the utility of Google Flu Trends

Estimation of influenza-like illness (ILI) using search trends activity was intended to supplement traditional surveillance systems, and was a motivation behind the development of Google Flu Trends (GFT). However, several studies have previously reported large errors in GFT estimates of ILI in the U...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kandula, Sasikiran, Shaman, Jeffrey
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6693776/ https://www.ncbi.nlm.nih.gov/pubmed/31374088 http://dx.doi.org/10.1371/journal.pcbi.1007258

_version_	1783443737110642688
author	Kandula, Sasikiran Shaman, Jeffrey
author_facet	Kandula, Sasikiran Shaman, Jeffrey
author_sort	Kandula, Sasikiran
collection	PubMed
description	Estimation of influenza-like illness (ILI) using search trends activity was intended to supplement traditional surveillance systems, and was a motivation behind the development of Google Flu Trends (GFT). However, several studies have previously reported large errors in GFT estimates of ILI in the US. Following recent release of time-stamped surveillance data, which better reflects real-time operational scenarios, we reanalyzed GFT errors. Using three data sources—GFT: an archive of weekly ILI estimates from Google Flu Trends; ILIf: fully-observed ILI rates from ILINet; and, ILIp: ILI rates available in real-time based on partial reporting—five influenza seasons were analyzed and mean square errors (MSE) of GFT and ILIp as estimates of ILIf were computed. To correct GFT errors, a random forest regression model was built with ILI and GFT rates from the previous three weeks as predictors. An overall reduction in error of 44% was observed and the errors of the corrected GFT are lower than those of ILIp. An 80% reduction in error during 2012/13, when GFT had large errors, shows that extreme failures of GFT could have been avoided. Using autoregressive integrated moving average (ARIMA) models, one- to four-week ahead forecasts were generated with two separate data streams: ILIp alone, and with both ILIp and corrected GFT. At all forecast targets and seasons, and for all but two regions, inclusion of GFT lowered MSE. Results from two alternative error measures, mean absolute error and mean absolute proportional error, were largely consistent with results from MSE. Taken together these findings provide an error profile of GFT in the US, establish strong evidence for the adoption of search trends based 'nowcasts' in influenza forecast systems, and encourage reevaluation of the utility of this data source in diverse domains.
format	Online Article Text
id	pubmed-6693776
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-66937762019-08-16 Reappraising the utility of Google Flu Trends Kandula, Sasikiran Shaman, Jeffrey PLoS Comput Biol Research Article Estimation of influenza-like illness (ILI) using search trends activity was intended to supplement traditional surveillance systems, and was a motivation behind the development of Google Flu Trends (GFT). However, several studies have previously reported large errors in GFT estimates of ILI in the US. Following recent release of time-stamped surveillance data, which better reflects real-time operational scenarios, we reanalyzed GFT errors. Using three data sources—GFT: an archive of weekly ILI estimates from Google Flu Trends; ILIf: fully-observed ILI rates from ILINet; and, ILIp: ILI rates available in real-time based on partial reporting—five influenza seasons were analyzed and mean square errors (MSE) of GFT and ILIp as estimates of ILIf were computed. To correct GFT errors, a random forest regression model was built with ILI and GFT rates from the previous three weeks as predictors. An overall reduction in error of 44% was observed and the errors of the corrected GFT are lower than those of ILIp. An 80% reduction in error during 2012/13, when GFT had large errors, shows that extreme failures of GFT could have been avoided. Using autoregressive integrated moving average (ARIMA) models, one- to four-week ahead forecasts were generated with two separate data streams: ILIp alone, and with both ILIp and corrected GFT. At all forecast targets and seasons, and for all but two regions, inclusion of GFT lowered MSE. Results from two alternative error measures, mean absolute error and mean absolute proportional error, were largely consistent with results from MSE. Taken together these findings provide an error profile of GFT in the US, establish strong evidence for the adoption of search trends based 'nowcasts' in influenza forecast systems, and encourage reevaluation of the utility of this data source in diverse domains. Public Library of Science 2019-08-02 /pmc/articles/PMC6693776/ /pubmed/31374088 http://dx.doi.org/10.1371/journal.pcbi.1007258 Text en © 2019 Kandula, Shaman http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Kandula, Sasikiran Shaman, Jeffrey Reappraising the utility of Google Flu Trends
title	Reappraising the utility of Google Flu Trends
title_full	Reappraising the utility of Google Flu Trends
title_fullStr	Reappraising the utility of Google Flu Trends
title_full_unstemmed	Reappraising the utility of Google Flu Trends
title_short	Reappraising the utility of Google Flu Trends
title_sort	reappraising the utility of google flu trends
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6693776/ https://www.ncbi.nlm.nih.gov/pubmed/31374088 http://dx.doi.org/10.1371/journal.pcbi.1007258
work_keys_str_mv	AT kandulasasikiran reappraisingtheutilityofgoogleflutrends AT shamanjeffrey reappraisingtheutilityofgoogleflutrends

Reappraising the utility of Google Flu Trends

Ejemplares similares