Cargando…

Advances in nowcasting influenza-like illness rates using search query logs

User-generated content can assist epidemiological surveillance in the early detection and prevalence estimation of infectious diseases, such as influenza. Google Flu Trends embodies the first public platform for transforming search queries to indications about the current state of flu in various pla...

Descripción completa

Detalles Bibliográficos
Autores principales: Lampos, Vasileios, Miller, Andrew C., Crossan, Steve, Stefansen, Christian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4522652/
https://www.ncbi.nlm.nih.gov/pubmed/26234783
http://dx.doi.org/10.1038/srep12760
_version_ 1782383987225788416
author Lampos, Vasileios
Miller, Andrew C.
Crossan, Steve
Stefansen, Christian
author_facet Lampos, Vasileios
Miller, Andrew C.
Crossan, Steve
Stefansen, Christian
author_sort Lampos, Vasileios
collection PubMed
description User-generated content can assist epidemiological surveillance in the early detection and prevalence estimation of infectious diseases, such as influenza. Google Flu Trends embodies the first public platform for transforming search queries to indications about the current state of flu in various places all over the world. However, the original model significantly mispredicted influenza-like illness rates in the US during the 2012–13 flu season. In this work, we build on the previous modeling attempt, proposing substantial improvements. Firstly, we investigate the performance of a widely used linear regularized regression solver, known as the Elastic Net. Then, we expand on this model by incorporating the queries selected by the Elastic Net into a nonlinear regression framework, based on a composite Gaussian Process. Finally, we augment the query-only predictions with an autoregressive model, injecting prior knowledge about the disease. We assess predictive performance using five consecutive flu seasons spanning from 2008 to 2013 and qualitatively explain certain shortcomings of the previous approach. Our results indicate that a nonlinear query modeling approach delivers the lowest cumulative nowcasting error, and also suggest that query information significantly improves autoregressive inferences, obtaining state-of-the-art performance.
format Online
Article
Text
id pubmed-4522652
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-45226522015-08-06 Advances in nowcasting influenza-like illness rates using search query logs Lampos, Vasileios Miller, Andrew C. Crossan, Steve Stefansen, Christian Sci Rep Article User-generated content can assist epidemiological surveillance in the early detection and prevalence estimation of infectious diseases, such as influenza. Google Flu Trends embodies the first public platform for transforming search queries to indications about the current state of flu in various places all over the world. However, the original model significantly mispredicted influenza-like illness rates in the US during the 2012–13 flu season. In this work, we build on the previous modeling attempt, proposing substantial improvements. Firstly, we investigate the performance of a widely used linear regularized regression solver, known as the Elastic Net. Then, we expand on this model by incorporating the queries selected by the Elastic Net into a nonlinear regression framework, based on a composite Gaussian Process. Finally, we augment the query-only predictions with an autoregressive model, injecting prior knowledge about the disease. We assess predictive performance using five consecutive flu seasons spanning from 2008 to 2013 and qualitatively explain certain shortcomings of the previous approach. Our results indicate that a nonlinear query modeling approach delivers the lowest cumulative nowcasting error, and also suggest that query information significantly improves autoregressive inferences, obtaining state-of-the-art performance. Nature Publishing Group 2015-08-03 /pmc/articles/PMC4522652/ /pubmed/26234783 http://dx.doi.org/10.1038/srep12760 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Lampos, Vasileios
Miller, Andrew C.
Crossan, Steve
Stefansen, Christian
Advances in nowcasting influenza-like illness rates using search query logs
title Advances in nowcasting influenza-like illness rates using search query logs
title_full Advances in nowcasting influenza-like illness rates using search query logs
title_fullStr Advances in nowcasting influenza-like illness rates using search query logs
title_full_unstemmed Advances in nowcasting influenza-like illness rates using search query logs
title_short Advances in nowcasting influenza-like illness rates using search query logs
title_sort advances in nowcasting influenza-like illness rates using search query logs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4522652/
https://www.ncbi.nlm.nih.gov/pubmed/26234783
http://dx.doi.org/10.1038/srep12760
work_keys_str_mv AT lamposvasileios advancesinnowcastinginfluenzalikeillnessratesusingsearchquerylogs
AT millerandrewc advancesinnowcastinginfluenzalikeillnessratesusingsearchquerylogs
AT crossansteve advancesinnowcastinginfluenzalikeillnessratesusingsearchquerylogs
AT stefansenchristian advancesinnowcastinginfluenzalikeillnessratesusingsearchquerylogs