Cargando…
The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets
BACKGROUND: Surveillance plays a vital role in disease detection, but traditional methods of collecting patient data, reporting to health officials, and compiling reports are costly and time consuming. In recent years, syndromic surveillance tools have expanded and researchers are able to exploit th...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications Inc.
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3841359/ https://www.ncbi.nlm.nih.gov/pubmed/24158773 http://dx.doi.org/10.2196/jmir.2705 |
_version_ | 1782292773959892992 |
---|---|
author | Nagel, Anna C Tsou, Ming-Hsiang Spitzberg, Brian H An, Li Gawron, J Mark Gupta, Dipak K Yang, Jiue-An Han, Su Peddecord, K Michael Lindsay, Suzanne Sawyer, Mark H |
author_facet | Nagel, Anna C Tsou, Ming-Hsiang Spitzberg, Brian H An, Li Gawron, J Mark Gupta, Dipak K Yang, Jiue-An Han, Su Peddecord, K Michael Lindsay, Suzanne Sawyer, Mark H |
author_sort | Nagel, Anna C |
collection | PubMed |
description | BACKGROUND: Surveillance plays a vital role in disease detection, but traditional methods of collecting patient data, reporting to health officials, and compiling reports are costly and time consuming. In recent years, syndromic surveillance tools have expanded and researchers are able to exploit the vast amount of data available in real time on the Internet at minimal cost. Many data sources for infoveillance exist, but this study focuses on status updates (tweets) from the Twitter microblogging website. OBJECTIVE: The aim of this study was to explore the interaction between cyberspace message activity, measured by keyword-specific tweets, and real world occurrences of influenza and pertussis. Tweets were aggregated by week and compared to weekly influenza-like illness (ILI) and weekly pertussis incidence. The potential effect of tweet type was analyzed by categorizing tweets into 4 categories: nonretweets, retweets, tweets with a URL Web address, and tweets without a URL Web address. METHODS: Tweets were collected within a 17-mile radius of 11 US cities chosen on the basis of population size and the availability of disease data. Influenza analysis involved all 11 cities. Pertussis analysis was based on the 2 cities nearest to the Washington State pertussis outbreak (Seattle, WA and Portland, OR). Tweet collection resulted in 161,821 flu, 6174 influenza, 160 pertussis, and 1167 whooping cough tweets. The correlation coefficients between tweets or subgroups of tweets and disease occurrence were calculated and trends were presented graphically. RESULTS: Correlations between weekly aggregated tweets and disease occurrence varied greatly, but were relatively strong in some areas. In general, correlation coefficients were stronger in the flu analysis compared to the pertussis analysis. Within each analysis, flu tweets were more strongly correlated with ILI rates than influenza tweets, and whooping cough tweets correlated more strongly with pertussis incidence than pertussis tweets. Nonretweets correlated more with disease occurrence than retweets, and tweets without a URL Web address correlated better with actual incidence than those with a URL Web address primarily for the flu tweets. CONCLUSIONS: This study demonstrates that not only does keyword choice play an important role in how well tweets correlate with disease occurrence, but that the subgroup of tweets used for analysis is also important. This exploratory work shows potential in the use of tweets for infoveillance, but continued efforts are needed to further refine research methods in this field. |
format | Online Article Text |
id | pubmed-3841359 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | JMIR Publications Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-38413592013-11-27 The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets Nagel, Anna C Tsou, Ming-Hsiang Spitzberg, Brian H An, Li Gawron, J Mark Gupta, Dipak K Yang, Jiue-An Han, Su Peddecord, K Michael Lindsay, Suzanne Sawyer, Mark H J Med Internet Res Original Paper BACKGROUND: Surveillance plays a vital role in disease detection, but traditional methods of collecting patient data, reporting to health officials, and compiling reports are costly and time consuming. In recent years, syndromic surveillance tools have expanded and researchers are able to exploit the vast amount of data available in real time on the Internet at minimal cost. Many data sources for infoveillance exist, but this study focuses on status updates (tweets) from the Twitter microblogging website. OBJECTIVE: The aim of this study was to explore the interaction between cyberspace message activity, measured by keyword-specific tweets, and real world occurrences of influenza and pertussis. Tweets were aggregated by week and compared to weekly influenza-like illness (ILI) and weekly pertussis incidence. The potential effect of tweet type was analyzed by categorizing tweets into 4 categories: nonretweets, retweets, tweets with a URL Web address, and tweets without a URL Web address. METHODS: Tweets were collected within a 17-mile radius of 11 US cities chosen on the basis of population size and the availability of disease data. Influenza analysis involved all 11 cities. Pertussis analysis was based on the 2 cities nearest to the Washington State pertussis outbreak (Seattle, WA and Portland, OR). Tweet collection resulted in 161,821 flu, 6174 influenza, 160 pertussis, and 1167 whooping cough tweets. The correlation coefficients between tweets or subgroups of tweets and disease occurrence were calculated and trends were presented graphically. RESULTS: Correlations between weekly aggregated tweets and disease occurrence varied greatly, but were relatively strong in some areas. In general, correlation coefficients were stronger in the flu analysis compared to the pertussis analysis. Within each analysis, flu tweets were more strongly correlated with ILI rates than influenza tweets, and whooping cough tweets correlated more strongly with pertussis incidence than pertussis tweets. Nonretweets correlated more with disease occurrence than retweets, and tweets without a URL Web address correlated better with actual incidence than those with a URL Web address primarily for the flu tweets. CONCLUSIONS: This study demonstrates that not only does keyword choice play an important role in how well tweets correlate with disease occurrence, but that the subgroup of tweets used for analysis is also important. This exploratory work shows potential in the use of tweets for infoveillance, but continued efforts are needed to further refine research methods in this field. JMIR Publications Inc. 2013-10-26 /pmc/articles/PMC3841359/ /pubmed/24158773 http://dx.doi.org/10.2196/jmir.2705 Text en ©Anna C Nagel, Ming-Hsiang Tsou, Brian H Spitzberg, Li An, J Mark Gawron, Dipak K Gupta, Jiue-An Yang, Su Han, K Michael Peddecord, Suzanne Lindsay, Mark H Sawyer. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 26.10.2013. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Nagel, Anna C Tsou, Ming-Hsiang Spitzberg, Brian H An, Li Gawron, J Mark Gupta, Dipak K Yang, Jiue-An Han, Su Peddecord, K Michael Lindsay, Suzanne Sawyer, Mark H The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets |
title | The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets |
title_full | The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets |
title_fullStr | The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets |
title_full_unstemmed | The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets |
title_short | The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets |
title_sort | complex relationship of realspace events and messages in cyberspace: case study of influenza and pertussis using tweets |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3841359/ https://www.ncbi.nlm.nih.gov/pubmed/24158773 http://dx.doi.org/10.2196/jmir.2705 |
work_keys_str_mv | AT nagelannac thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT tsouminghsiang thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT spitzbergbrianh thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT anli thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT gawronjmark thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT guptadipakk thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT yangjiuean thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT hansu thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT peddecordkmichael thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT lindsaysuzanne thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT sawyermarkh thecomplexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT nagelannac complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT tsouminghsiang complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT spitzbergbrianh complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT anli complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT gawronjmark complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT guptadipakk complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT yangjiuean complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT hansu complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT peddecordkmichael complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT lindsaysuzanne complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets AT sawyermarkh complexrelationshipofrealspaceeventsandmessagesincyberspacecasestudyofinfluenzaandpertussisusingtweets |