Cargando…

Twitter Improves Influenza Forecasting

Accurate disease forecasts are imperative when preparing for influenza epidemic outbreaks; nevertheless, these forecasts are often limited by the time required to collect new, accurate data. In this paper, we show that data from the microblogging community Twitter significantly improves influenza fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Paul, Michael J., Dredze, Mark, Broniatowski, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234396/
https://www.ncbi.nlm.nih.gov/pubmed/25642377
http://dx.doi.org/10.1371/currents.outbreaks.90b9ed0f59bae4ccaa683a39865d9117
_version_ 1782344850834718720
author Paul, Michael J.
Dredze, Mark
Broniatowski, David
author_facet Paul, Michael J.
Dredze, Mark
Broniatowski, David
author_sort Paul, Michael J.
collection PubMed
description Accurate disease forecasts are imperative when preparing for influenza epidemic outbreaks; nevertheless, these forecasts are often limited by the time required to collect new, accurate data. In this paper, we show that data from the microblogging community Twitter significantly improves influenza forecasting. Most prior influenza forecast models are tested against historical influenza-like illness (ILI) data from the U.S. Centers for Disease Control and Prevention (CDC). These data are released with a one-week lag and are often initially inaccurate until the CDC revises them weeks later. Since previous studies utilize the final, revised data in evaluation, their evaluations do not properly determine the effectiveness of forecasting. Our experiments using ILI data available at the time of the forecast show that models incorporating data derived from Twitter can reduce forecasting error by 17-30% over a baseline that only uses historical data. For a given level of accuracy, using Twitter data produces forecasts that are two to four weeks ahead of baseline models. Additionally, we find that models using Twitter data are, on average, better predictors of influenza prevalence than are models using data from Google Flu Trends, the leading web data source.
format Online
Article
Text
id pubmed-4234396
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42343962015-01-29 Twitter Improves Influenza Forecasting Paul, Michael J. Dredze, Mark Broniatowski, David PLoS Curr Research Accurate disease forecasts are imperative when preparing for influenza epidemic outbreaks; nevertheless, these forecasts are often limited by the time required to collect new, accurate data. In this paper, we show that data from the microblogging community Twitter significantly improves influenza forecasting. Most prior influenza forecast models are tested against historical influenza-like illness (ILI) data from the U.S. Centers for Disease Control and Prevention (CDC). These data are released with a one-week lag and are often initially inaccurate until the CDC revises them weeks later. Since previous studies utilize the final, revised data in evaluation, their evaluations do not properly determine the effectiveness of forecasting. Our experiments using ILI data available at the time of the forecast show that models incorporating data derived from Twitter can reduce forecasting error by 17-30% over a baseline that only uses historical data. For a given level of accuracy, using Twitter data produces forecasts that are two to four weeks ahead of baseline models. Additionally, we find that models using Twitter data are, on average, better predictors of influenza prevalence than are models using data from Google Flu Trends, the leading web data source. Public Library of Science 2014-10-28 /pmc/articles/PMC4234396/ /pubmed/25642377 http://dx.doi.org/10.1371/currents.outbreaks.90b9ed0f59bae4ccaa683a39865d9117 Text en http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research
Paul, Michael J.
Dredze, Mark
Broniatowski, David
Twitter Improves Influenza Forecasting
title Twitter Improves Influenza Forecasting
title_full Twitter Improves Influenza Forecasting
title_fullStr Twitter Improves Influenza Forecasting
title_full_unstemmed Twitter Improves Influenza Forecasting
title_short Twitter Improves Influenza Forecasting
title_sort twitter improves influenza forecasting
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234396/
https://www.ncbi.nlm.nih.gov/pubmed/25642377
http://dx.doi.org/10.1371/currents.outbreaks.90b9ed0f59bae4ccaa683a39865d9117
work_keys_str_mv AT paulmichaelj twitterimprovesinfluenzaforecasting
AT dredzemark twitterimprovesinfluenzaforecasting
AT broniatowskidavid twitterimprovesinfluenzaforecasting