Cargando…

Twitter Influenza Surveillance: Quantifying Seasonal Misdiagnosis Patterns and their Impact on Surveillance Estimates

BACKGROUND: Influenza (flu) surveillance using Twitter data can potentially save lives and increase efficiency by providing governments and healthcare organizations with greater situational awareness. However, research is needed to determine the impact of Twitter users’ misdiagnoses on surveillance...

Descripción completa

Detalles Bibliográficos
Autor principal: Mowery, Jared
Formato: Online Artículo Texto
Lenguaje:English
Publicado: University of Illinois at Chicago Library 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5302465/
https://www.ncbi.nlm.nih.gov/pubmed/28210419
http://dx.doi.org/10.5210/ojphi.v8i3.7011
_version_ 1782506548462878720
author Mowery, Jared
author_facet Mowery, Jared
author_sort Mowery, Jared
collection PubMed
description BACKGROUND: Influenza (flu) surveillance using Twitter data can potentially save lives and increase efficiency by providing governments and healthcare organizations with greater situational awareness. However, research is needed to determine the impact of Twitter users’ misdiagnoses on surveillance estimates. OBJECTIVE: This study establishes the importance of Twitter users’ misdiagnoses by showing that Twitter flu surveillance in the United States failed during the 2011-2012 flu season, estimates the extent of misdiagnoses, and tests several methods for reducing the adverse effects of misdiagnoses. METHODS: Metrics representing flu prevalence, seasonal misdiagnosis patterns, diagnosis uncertainty, flu symptoms, and noise were produced using Twitter data in conjunction with OpenSextant for geo-inferencing, and a maximum entropy classifier for identifying tweets related to illness. These metrics were tested for correlations with World Health Organization (WHO) positive specimen counts of flu from 2011 to 2014. RESULTS: Twitter flu surveillance erroneously indicated a typical flu season during 2011-2012, even though the flu season peaked three months late, and erroneously indicated plateaus of flu tweets before the 2012-2013 and 2013-2014 flu seasons. Enhancements based on estimates of misdiagnoses removed the erroneous plateaus and increased the Pearson correlation coefficients by .04 and .23, but failed to correct the 2011-2012 flu season estimate. A rough estimate indicates that approximately 40% of flu tweets reflected misdiagnoses. CONCLUSIONS: Further research into factors affecting Twitter users’ misdiagnoses, in conjunction with data from additional atypical flu seasons, is needed to enable Twitter flu surveillance systems to produce reliable estimates during atypical flu seasons.
format Online
Article
Text
id pubmed-5302465
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher University of Illinois at Chicago Library
record_format MEDLINE/PubMed
spelling pubmed-53024652017-02-16 Twitter Influenza Surveillance: Quantifying Seasonal Misdiagnosis Patterns and their Impact on Surveillance Estimates Mowery, Jared Online J Public Health Inform Research Article BACKGROUND: Influenza (flu) surveillance using Twitter data can potentially save lives and increase efficiency by providing governments and healthcare organizations with greater situational awareness. However, research is needed to determine the impact of Twitter users’ misdiagnoses on surveillance estimates. OBJECTIVE: This study establishes the importance of Twitter users’ misdiagnoses by showing that Twitter flu surveillance in the United States failed during the 2011-2012 flu season, estimates the extent of misdiagnoses, and tests several methods for reducing the adverse effects of misdiagnoses. METHODS: Metrics representing flu prevalence, seasonal misdiagnosis patterns, diagnosis uncertainty, flu symptoms, and noise were produced using Twitter data in conjunction with OpenSextant for geo-inferencing, and a maximum entropy classifier for identifying tweets related to illness. These metrics were tested for correlations with World Health Organization (WHO) positive specimen counts of flu from 2011 to 2014. RESULTS: Twitter flu surveillance erroneously indicated a typical flu season during 2011-2012, even though the flu season peaked three months late, and erroneously indicated plateaus of flu tweets before the 2012-2013 and 2013-2014 flu seasons. Enhancements based on estimates of misdiagnoses removed the erroneous plateaus and increased the Pearson correlation coefficients by .04 and .23, but failed to correct the 2011-2012 flu season estimate. A rough estimate indicates that approximately 40% of flu tweets reflected misdiagnoses. CONCLUSIONS: Further research into factors affecting Twitter users’ misdiagnoses, in conjunction with data from additional atypical flu seasons, is needed to enable Twitter flu surveillance systems to produce reliable estimates during atypical flu seasons. University of Illinois at Chicago Library 2016-12-28 /pmc/articles/PMC5302465/ /pubmed/28210419 http://dx.doi.org/10.5210/ojphi.v8i3.7011 Text en This is an Open Access article. Authors own copyright of their articles appearing in the Journal of Public Health Informatics. Readers may copy articles without permission of the copyright owner(s), as long as the author and OJPHI are acknowledged in the copy and the copy is used for educational, not-for-profit purposes.
spellingShingle Research Article
Mowery, Jared
Twitter Influenza Surveillance: Quantifying Seasonal Misdiagnosis Patterns and their Impact on Surveillance Estimates
title Twitter Influenza Surveillance: Quantifying Seasonal Misdiagnosis Patterns and their Impact on Surveillance Estimates
title_full Twitter Influenza Surveillance: Quantifying Seasonal Misdiagnosis Patterns and their Impact on Surveillance Estimates
title_fullStr Twitter Influenza Surveillance: Quantifying Seasonal Misdiagnosis Patterns and their Impact on Surveillance Estimates
title_full_unstemmed Twitter Influenza Surveillance: Quantifying Seasonal Misdiagnosis Patterns and their Impact on Surveillance Estimates
title_short Twitter Influenza Surveillance: Quantifying Seasonal Misdiagnosis Patterns and their Impact on Surveillance Estimates
title_sort twitter influenza surveillance: quantifying seasonal misdiagnosis patterns and their impact on surveillance estimates
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5302465/
https://www.ncbi.nlm.nih.gov/pubmed/28210419
http://dx.doi.org/10.5210/ojphi.v8i3.7011
work_keys_str_mv AT moweryjared twitterinfluenzasurveillancequantifyingseasonalmisdiagnosispatternsandtheirimpactonsurveillanceestimates