Cargando…
Twitter Improves Influenza Forecasting
Accurate disease forecasts are imperative when preparing for influenza epidemic outbreaks; nevertheless, these forecasts are often limited by the time required to collect new, accurate data. In this paper, we show that data from the microblogging community Twitter significantly improves influenza fo...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234396/ https://www.ncbi.nlm.nih.gov/pubmed/25642377 http://dx.doi.org/10.1371/currents.outbreaks.90b9ed0f59bae4ccaa683a39865d9117 |
_version_ | 1782344850834718720 |
---|---|
author | Paul, Michael J. Dredze, Mark Broniatowski, David |
author_facet | Paul, Michael J. Dredze, Mark Broniatowski, David |
author_sort | Paul, Michael J. |
collection | PubMed |
description | Accurate disease forecasts are imperative when preparing for influenza epidemic outbreaks; nevertheless, these forecasts are often limited by the time required to collect new, accurate data. In this paper, we show that data from the microblogging community Twitter significantly improves influenza forecasting. Most prior influenza forecast models are tested against historical influenza-like illness (ILI) data from the U.S. Centers for Disease Control and Prevention (CDC). These data are released with a one-week lag and are often initially inaccurate until the CDC revises them weeks later. Since previous studies utilize the final, revised data in evaluation, their evaluations do not properly determine the effectiveness of forecasting. Our experiments using ILI data available at the time of the forecast show that models incorporating data derived from Twitter can reduce forecasting error by 17-30% over a baseline that only uses historical data. For a given level of accuracy, using Twitter data produces forecasts that are two to four weeks ahead of baseline models. Additionally, we find that models using Twitter data are, on average, better predictors of influenza prevalence than are models using data from Google Flu Trends, the leading web data source. |
format | Online Article Text |
id | pubmed-4234396 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-42343962015-01-29 Twitter Improves Influenza Forecasting Paul, Michael J. Dredze, Mark Broniatowski, David PLoS Curr Research Accurate disease forecasts are imperative when preparing for influenza epidemic outbreaks; nevertheless, these forecasts are often limited by the time required to collect new, accurate data. In this paper, we show that data from the microblogging community Twitter significantly improves influenza forecasting. Most prior influenza forecast models are tested against historical influenza-like illness (ILI) data from the U.S. Centers for Disease Control and Prevention (CDC). These data are released with a one-week lag and are often initially inaccurate until the CDC revises them weeks later. Since previous studies utilize the final, revised data in evaluation, their evaluations do not properly determine the effectiveness of forecasting. Our experiments using ILI data available at the time of the forecast show that models incorporating data derived from Twitter can reduce forecasting error by 17-30% over a baseline that only uses historical data. For a given level of accuracy, using Twitter data produces forecasts that are two to four weeks ahead of baseline models. Additionally, we find that models using Twitter data are, on average, better predictors of influenza prevalence than are models using data from Google Flu Trends, the leading web data source. Public Library of Science 2014-10-28 /pmc/articles/PMC4234396/ /pubmed/25642377 http://dx.doi.org/10.1371/currents.outbreaks.90b9ed0f59bae4ccaa683a39865d9117 Text en http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Paul, Michael J. Dredze, Mark Broniatowski, David Twitter Improves Influenza Forecasting |
title | Twitter Improves Influenza Forecasting |
title_full | Twitter Improves Influenza Forecasting |
title_fullStr | Twitter Improves Influenza Forecasting |
title_full_unstemmed | Twitter Improves Influenza Forecasting |
title_short | Twitter Improves Influenza Forecasting |
title_sort | twitter improves influenza forecasting |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4234396/ https://www.ncbi.nlm.nih.gov/pubmed/25642377 http://dx.doi.org/10.1371/currents.outbreaks.90b9ed0f59bae4ccaa683a39865d9117 |
work_keys_str_mv | AT paulmichaelj twitterimprovesinfluenzaforecasting AT dredzemark twitterimprovesinfluenzaforecasting AT broniatowskidavid twitterimprovesinfluenzaforecasting |