Cargando…

The added value of online user-generated content in traditional methods for influenza surveillance

There has been considerable work in evaluating the efficacy of using online data for health surveillance. Often comparisons with baseline data involve various squared error and correlation metrics. While useful, these overlook a variety of other factors important to public health bodies considering...

Descripción completa

Detalles Bibliográficos
Autores principales: Wagner, Moritz, Lampos, Vasileios, Cox, Ingemar J., Pebody, Richard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6143510/
https://www.ncbi.nlm.nih.gov/pubmed/30228285
http://dx.doi.org/10.1038/s41598-018-32029-6
_version_ 1783355983160934400
author Wagner, Moritz
Lampos, Vasileios
Cox, Ingemar J.
Pebody, Richard
author_facet Wagner, Moritz
Lampos, Vasileios
Cox, Ingemar J.
Pebody, Richard
author_sort Wagner, Moritz
collection PubMed
description There has been considerable work in evaluating the efficacy of using online data for health surveillance. Often comparisons with baseline data involve various squared error and correlation metrics. While useful, these overlook a variety of other factors important to public health bodies considering the adoption of such methods. In this paper, a proposed surveillance system that incorporates models based on recent research efforts is evaluated in terms of its added value for influenza surveillance at Public Health England. The system comprises of two supervised learning approaches trained on influenza-like illness (ILI) rates provided by the Royal College of General Practitioners (RCGP) and produces ILI estimates using Twitter posts or Google search queries. RCGP ILI rates for different age groups and laboratory confirmed cases by influenza type are used to evaluate the models with a particular focus on predicting the onset, overall intensity, peak activity and duration of the 2015/16 influenza season. We show that the Twitter-based models perform poorly and hypothesise that this is mostly due to the sparsity of the data available and a limited training period. Conversely, the Google-based model provides accurate estimates with timeliness of approximately one week and has the potential to complement current surveillance systems.
format Online
Article
Text
id pubmed-6143510
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-61435102018-09-20 The added value of online user-generated content in traditional methods for influenza surveillance Wagner, Moritz Lampos, Vasileios Cox, Ingemar J. Pebody, Richard Sci Rep Article There has been considerable work in evaluating the efficacy of using online data for health surveillance. Often comparisons with baseline data involve various squared error and correlation metrics. While useful, these overlook a variety of other factors important to public health bodies considering the adoption of such methods. In this paper, a proposed surveillance system that incorporates models based on recent research efforts is evaluated in terms of its added value for influenza surveillance at Public Health England. The system comprises of two supervised learning approaches trained on influenza-like illness (ILI) rates provided by the Royal College of General Practitioners (RCGP) and produces ILI estimates using Twitter posts or Google search queries. RCGP ILI rates for different age groups and laboratory confirmed cases by influenza type are used to evaluate the models with a particular focus on predicting the onset, overall intensity, peak activity and duration of the 2015/16 influenza season. We show that the Twitter-based models perform poorly and hypothesise that this is mostly due to the sparsity of the data available and a limited training period. Conversely, the Google-based model provides accurate estimates with timeliness of approximately one week and has the potential to complement current surveillance systems. Nature Publishing Group UK 2018-09-18 /pmc/articles/PMC6143510/ /pubmed/30228285 http://dx.doi.org/10.1038/s41598-018-32029-6 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Wagner, Moritz
Lampos, Vasileios
Cox, Ingemar J.
Pebody, Richard
The added value of online user-generated content in traditional methods for influenza surveillance
title The added value of online user-generated content in traditional methods for influenza surveillance
title_full The added value of online user-generated content in traditional methods for influenza surveillance
title_fullStr The added value of online user-generated content in traditional methods for influenza surveillance
title_full_unstemmed The added value of online user-generated content in traditional methods for influenza surveillance
title_short The added value of online user-generated content in traditional methods for influenza surveillance
title_sort added value of online user-generated content in traditional methods for influenza surveillance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6143510/
https://www.ncbi.nlm.nih.gov/pubmed/30228285
http://dx.doi.org/10.1038/s41598-018-32029-6
work_keys_str_mv AT wagnermoritz theaddedvalueofonlineusergeneratedcontentintraditionalmethodsforinfluenzasurveillance
AT lamposvasileios theaddedvalueofonlineusergeneratedcontentintraditionalmethodsforinfluenzasurveillance
AT coxingemarj theaddedvalueofonlineusergeneratedcontentintraditionalmethodsforinfluenzasurveillance
AT pebodyrichard theaddedvalueofonlineusergeneratedcontentintraditionalmethodsforinfluenzasurveillance
AT wagnermoritz addedvalueofonlineusergeneratedcontentintraditionalmethodsforinfluenzasurveillance
AT lamposvasileios addedvalueofonlineusergeneratedcontentintraditionalmethodsforinfluenzasurveillance
AT coxingemarj addedvalueofonlineusergeneratedcontentintraditionalmethodsforinfluenzasurveillance
AT pebodyrichard addedvalueofonlineusergeneratedcontentintraditionalmethodsforinfluenzasurveillance