Cargando…
Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza
Traditional methods for monitoring influenza are haphazard and lack fine-grained details regarding the spatial and temporal dynamics of outbreaks. Twitter gives researchers and public health officials an opportunity to examine the spread of influenza in real-time and at multiple geographical scales....
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4959719/ https://www.ncbi.nlm.nih.gov/pubmed/27455108 http://dx.doi.org/10.1371/journal.pone.0157734 |
_version_ | 1782444437113143296 |
---|---|
author | Allen, Chris Tsou, Ming-Hsiang Aslam, Anoshe Nagel, Anna Gawron, Jean-Mark |
author_facet | Allen, Chris Tsou, Ming-Hsiang Aslam, Anoshe Nagel, Anna Gawron, Jean-Mark |
author_sort | Allen, Chris |
collection | PubMed |
description | Traditional methods for monitoring influenza are haphazard and lack fine-grained details regarding the spatial and temporal dynamics of outbreaks. Twitter gives researchers and public health officials an opportunity to examine the spread of influenza in real-time and at multiple geographical scales. In this paper, we introduce an improved framework for monitoring influenza outbreaks using the social media platform Twitter. Relying upon techniques from geographic information science (GIS) and data mining, Twitter messages were collected, filtered, and analyzed for the thirty most populated cities in the United States during the 2013–2014 flu season. The results of this procedure are compared with national, regional, and local flu outbreak reports, revealing a statistically significant correlation between the two data sources. The main contribution of this paper is to introduce a comprehensive data mining process that enhances previous attempts to accurately identify tweets related to influenza. Additionally, geographical information systems allow us to target, filter, and normalize Twitter messages. |
format | Online Article Text |
id | pubmed-4959719 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-49597192016-08-08 Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza Allen, Chris Tsou, Ming-Hsiang Aslam, Anoshe Nagel, Anna Gawron, Jean-Mark PLoS One Research Article Traditional methods for monitoring influenza are haphazard and lack fine-grained details regarding the spatial and temporal dynamics of outbreaks. Twitter gives researchers and public health officials an opportunity to examine the spread of influenza in real-time and at multiple geographical scales. In this paper, we introduce an improved framework for monitoring influenza outbreaks using the social media platform Twitter. Relying upon techniques from geographic information science (GIS) and data mining, Twitter messages were collected, filtered, and analyzed for the thirty most populated cities in the United States during the 2013–2014 flu season. The results of this procedure are compared with national, regional, and local flu outbreak reports, revealing a statistically significant correlation between the two data sources. The main contribution of this paper is to introduce a comprehensive data mining process that enhances previous attempts to accurately identify tweets related to influenza. Additionally, geographical information systems allow us to target, filter, and normalize Twitter messages. Public Library of Science 2016-07-25 /pmc/articles/PMC4959719/ /pubmed/27455108 http://dx.doi.org/10.1371/journal.pone.0157734 Text en © 2016 Allen et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Allen, Chris Tsou, Ming-Hsiang Aslam, Anoshe Nagel, Anna Gawron, Jean-Mark Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza |
title | Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza |
title_full | Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza |
title_fullStr | Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza |
title_full_unstemmed | Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza |
title_short | Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza |
title_sort | applying gis and machine learning methods to twitter data for multiscale surveillance of influenza |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4959719/ https://www.ncbi.nlm.nih.gov/pubmed/27455108 http://dx.doi.org/10.1371/journal.pone.0157734 |
work_keys_str_mv | AT allenchris applyinggisandmachinelearningmethodstotwitterdataformultiscalesurveillanceofinfluenza AT tsouminghsiang applyinggisandmachinelearningmethodstotwitterdataformultiscalesurveillanceofinfluenza AT aslamanoshe applyinggisandmachinelearningmethodstotwitterdataformultiscalesurveillanceofinfluenza AT nagelanna applyinggisandmachinelearningmethodstotwitterdataformultiscalesurveillanceofinfluenza AT gawronjeanmark applyinggisandmachinelearningmethodstotwitterdataformultiscalesurveillanceofinfluenza |