Cargando…

Detecting Disease Outbreaks in Mass Gatherings Using Internet Data

BACKGROUND: Mass gatherings, such as music festivals and religious events, pose a health care challenge because of the risk of transmission of communicable diseases. This is exacerbated by the fact that participants disperse soon after the gathering, potentially spreading disease within their commun...

Descripción completa

Detalles Bibliográficos
Autores principales: Yom-Tov, Elad, Borsa, Diana, Cox, Ingemar J, McKendry, Rachel A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications Inc. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4090384/
https://www.ncbi.nlm.nih.gov/pubmed/24943128
http://dx.doi.org/10.2196/jmir.3156
_version_ 1782480622088880128
author Yom-Tov, Elad
Borsa, Diana
Cox, Ingemar J
McKendry, Rachel A
author_facet Yom-Tov, Elad
Borsa, Diana
Cox, Ingemar J
McKendry, Rachel A
author_sort Yom-Tov, Elad
collection PubMed
description BACKGROUND: Mass gatherings, such as music festivals and religious events, pose a health care challenge because of the risk of transmission of communicable diseases. This is exacerbated by the fact that participants disperse soon after the gathering, potentially spreading disease within their communities. The dispersion of participants also poses a challenge for traditional surveillance methods. The ubiquitous use of the Internet may enable the detection of disease outbreaks through analysis of data generated by users during events and shortly thereafter. OBJECTIVE: The intent of the study was to develop algorithms that can alert to possible outbreaks of communicable diseases from Internet data, specifically Twitter and search engine queries. METHODS: We extracted all Twitter postings and queries made to the Bing search engine by users who repeatedly mentioned one of nine major music festivals held in the United Kingdom and one religious event (the Hajj in Mecca) during 2012, for a period of 30 days and after each festival. We analyzed these data using three methods, two of which compared words associated with disease symptoms before and after the time of the festival, and one that compared the frequency of these words with those of other users in the United Kingdom in the days following the festivals. RESULTS: The data comprised, on average, 7.5 million tweets made by 12,163 users, and 32,143 queries made by 1756 users from each festival. Our methods indicated the statistically significant appearance of a disease symptom in two of the nine festivals. For example, cough was detected at higher than expected levels following the Wakestock festival. Statistically significant agreement (chi-square test, P<.01) between methods and across data sources was found where a statistically significant symptom was detected. Anecdotal evidence suggests that symptoms detected are indeed indicative of a disease that some users attributed to being at the festival. CONCLUSIONS: Our work shows the feasibility of creating a public health surveillance system for mass gatherings based on Internet data. The use of multiple data sources and analysis methods was found to be advantageous for rejecting false positives. Further studies are required in order to validate our findings with data from public health authorities.
format Online
Article
Text
id pubmed-4090384
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher JMIR Publications Inc.
record_format MEDLINE/PubMed
spelling pubmed-40903842014-07-10 Detecting Disease Outbreaks in Mass Gatherings Using Internet Data Yom-Tov, Elad Borsa, Diana Cox, Ingemar J McKendry, Rachel A J Med Internet Res Original Paper BACKGROUND: Mass gatherings, such as music festivals and religious events, pose a health care challenge because of the risk of transmission of communicable diseases. This is exacerbated by the fact that participants disperse soon after the gathering, potentially spreading disease within their communities. The dispersion of participants also poses a challenge for traditional surveillance methods. The ubiquitous use of the Internet may enable the detection of disease outbreaks through analysis of data generated by users during events and shortly thereafter. OBJECTIVE: The intent of the study was to develop algorithms that can alert to possible outbreaks of communicable diseases from Internet data, specifically Twitter and search engine queries. METHODS: We extracted all Twitter postings and queries made to the Bing search engine by users who repeatedly mentioned one of nine major music festivals held in the United Kingdom and one religious event (the Hajj in Mecca) during 2012, for a period of 30 days and after each festival. We analyzed these data using three methods, two of which compared words associated with disease symptoms before and after the time of the festival, and one that compared the frequency of these words with those of other users in the United Kingdom in the days following the festivals. RESULTS: The data comprised, on average, 7.5 million tweets made by 12,163 users, and 32,143 queries made by 1756 users from each festival. Our methods indicated the statistically significant appearance of a disease symptom in two of the nine festivals. For example, cough was detected at higher than expected levels following the Wakestock festival. Statistically significant agreement (chi-square test, P<.01) between methods and across data sources was found where a statistically significant symptom was detected. Anecdotal evidence suggests that symptoms detected are indeed indicative of a disease that some users attributed to being at the festival. CONCLUSIONS: Our work shows the feasibility of creating a public health surveillance system for mass gatherings based on Internet data. The use of multiple data sources and analysis methods was found to be advantageous for rejecting false positives. Further studies are required in order to validate our findings with data from public health authorities. JMIR Publications Inc. 2014-06-18 /pmc/articles/PMC4090384/ /pubmed/24943128 http://dx.doi.org/10.2196/jmir.3156 Text en ©Elad Yom-Tov, Diana Borsa, Ingemar J Cox, Rachel A McKendry. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 18.06.2014. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Yom-Tov, Elad
Borsa, Diana
Cox, Ingemar J
McKendry, Rachel A
Detecting Disease Outbreaks in Mass Gatherings Using Internet Data
title Detecting Disease Outbreaks in Mass Gatherings Using Internet Data
title_full Detecting Disease Outbreaks in Mass Gatherings Using Internet Data
title_fullStr Detecting Disease Outbreaks in Mass Gatherings Using Internet Data
title_full_unstemmed Detecting Disease Outbreaks in Mass Gatherings Using Internet Data
title_short Detecting Disease Outbreaks in Mass Gatherings Using Internet Data
title_sort detecting disease outbreaks in mass gatherings using internet data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4090384/
https://www.ncbi.nlm.nih.gov/pubmed/24943128
http://dx.doi.org/10.2196/jmir.3156
work_keys_str_mv AT yomtovelad detectingdiseaseoutbreaksinmassgatheringsusinginternetdata
AT borsadiana detectingdiseaseoutbreaksinmassgatheringsusinginternetdata
AT coxingemarj detectingdiseaseoutbreaksinmassgatheringsusinginternetdata
AT mckendryrachela detectingdiseaseoutbreaksinmassgatheringsusinginternetdata