Cargando…
Advances in nowcasting influenza-like illness rates using search query logs
User-generated content can assist epidemiological surveillance in the early detection and prevalence estimation of infectious diseases, such as influenza. Google Flu Trends embodies the first public platform for transforming search queries to indications about the current state of flu in various pla...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4522652/ https://www.ncbi.nlm.nih.gov/pubmed/26234783 http://dx.doi.org/10.1038/srep12760 |
_version_ | 1782383987225788416 |
---|---|
author | Lampos, Vasileios Miller, Andrew C. Crossan, Steve Stefansen, Christian |
author_facet | Lampos, Vasileios Miller, Andrew C. Crossan, Steve Stefansen, Christian |
author_sort | Lampos, Vasileios |
collection | PubMed |
description | User-generated content can assist epidemiological surveillance in the early detection and prevalence estimation of infectious diseases, such as influenza. Google Flu Trends embodies the first public platform for transforming search queries to indications about the current state of flu in various places all over the world. However, the original model significantly mispredicted influenza-like illness rates in the US during the 2012–13 flu season. In this work, we build on the previous modeling attempt, proposing substantial improvements. Firstly, we investigate the performance of a widely used linear regularized regression solver, known as the Elastic Net. Then, we expand on this model by incorporating the queries selected by the Elastic Net into a nonlinear regression framework, based on a composite Gaussian Process. Finally, we augment the query-only predictions with an autoregressive model, injecting prior knowledge about the disease. We assess predictive performance using five consecutive flu seasons spanning from 2008 to 2013 and qualitatively explain certain shortcomings of the previous approach. Our results indicate that a nonlinear query modeling approach delivers the lowest cumulative nowcasting error, and also suggest that query information significantly improves autoregressive inferences, obtaining state-of-the-art performance. |
format | Online Article Text |
id | pubmed-4522652 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-45226522015-08-06 Advances in nowcasting influenza-like illness rates using search query logs Lampos, Vasileios Miller, Andrew C. Crossan, Steve Stefansen, Christian Sci Rep Article User-generated content can assist epidemiological surveillance in the early detection and prevalence estimation of infectious diseases, such as influenza. Google Flu Trends embodies the first public platform for transforming search queries to indications about the current state of flu in various places all over the world. However, the original model significantly mispredicted influenza-like illness rates in the US during the 2012–13 flu season. In this work, we build on the previous modeling attempt, proposing substantial improvements. Firstly, we investigate the performance of a widely used linear regularized regression solver, known as the Elastic Net. Then, we expand on this model by incorporating the queries selected by the Elastic Net into a nonlinear regression framework, based on a composite Gaussian Process. Finally, we augment the query-only predictions with an autoregressive model, injecting prior knowledge about the disease. We assess predictive performance using five consecutive flu seasons spanning from 2008 to 2013 and qualitatively explain certain shortcomings of the previous approach. Our results indicate that a nonlinear query modeling approach delivers the lowest cumulative nowcasting error, and also suggest that query information significantly improves autoregressive inferences, obtaining state-of-the-art performance. Nature Publishing Group 2015-08-03 /pmc/articles/PMC4522652/ /pubmed/26234783 http://dx.doi.org/10.1038/srep12760 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Lampos, Vasileios Miller, Andrew C. Crossan, Steve Stefansen, Christian Advances in nowcasting influenza-like illness rates using search query logs |
title | Advances in nowcasting influenza-like illness rates using search query logs |
title_full | Advances in nowcasting influenza-like illness rates using search query logs |
title_fullStr | Advances in nowcasting influenza-like illness rates using search query logs |
title_full_unstemmed | Advances in nowcasting influenza-like illness rates using search query logs |
title_short | Advances in nowcasting influenza-like illness rates using search query logs |
title_sort | advances in nowcasting influenza-like illness rates using search query logs |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4522652/ https://www.ncbi.nlm.nih.gov/pubmed/26234783 http://dx.doi.org/10.1038/srep12760 |
work_keys_str_mv | AT lamposvasileios advancesinnowcastinginfluenzalikeillnessratesusingsearchquerylogs AT millerandrewc advancesinnowcastinginfluenzalikeillnessratesusingsearchquerylogs AT crossansteve advancesinnowcastinginfluenzalikeillnessratesusingsearchquerylogs AT stefansenchristian advancesinnowcastinginfluenzalikeillnessratesusingsearchquerylogs |