Cargando…

Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza

Traditional methods for monitoring influenza are haphazard and lack fine-grained details regarding the spatial and temporal dynamics of outbreaks. Twitter gives researchers and public health officials an opportunity to examine the spread of influenza in real-time and at multiple geographical scales....

Descripción completa

Detalles Bibliográficos
Autores principales: Allen, Chris, Tsou, Ming-Hsiang, Aslam, Anoshe, Nagel, Anna, Gawron, Jean-Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4959719/
https://www.ncbi.nlm.nih.gov/pubmed/27455108
http://dx.doi.org/10.1371/journal.pone.0157734
_version_ 1782444437113143296
author Allen, Chris
Tsou, Ming-Hsiang
Aslam, Anoshe
Nagel, Anna
Gawron, Jean-Mark
author_facet Allen, Chris
Tsou, Ming-Hsiang
Aslam, Anoshe
Nagel, Anna
Gawron, Jean-Mark
author_sort Allen, Chris
collection PubMed
description Traditional methods for monitoring influenza are haphazard and lack fine-grained details regarding the spatial and temporal dynamics of outbreaks. Twitter gives researchers and public health officials an opportunity to examine the spread of influenza in real-time and at multiple geographical scales. In this paper, we introduce an improved framework for monitoring influenza outbreaks using the social media platform Twitter. Relying upon techniques from geographic information science (GIS) and data mining, Twitter messages were collected, filtered, and analyzed for the thirty most populated cities in the United States during the 2013–2014 flu season. The results of this procedure are compared with national, regional, and local flu outbreak reports, revealing a statistically significant correlation between the two data sources. The main contribution of this paper is to introduce a comprehensive data mining process that enhances previous attempts to accurately identify tweets related to influenza. Additionally, geographical information systems allow us to target, filter, and normalize Twitter messages.
format Online
Article
Text
id pubmed-4959719
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-49597192016-08-08 Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza Allen, Chris Tsou, Ming-Hsiang Aslam, Anoshe Nagel, Anna Gawron, Jean-Mark PLoS One Research Article Traditional methods for monitoring influenza are haphazard and lack fine-grained details regarding the spatial and temporal dynamics of outbreaks. Twitter gives researchers and public health officials an opportunity to examine the spread of influenza in real-time and at multiple geographical scales. In this paper, we introduce an improved framework for monitoring influenza outbreaks using the social media platform Twitter. Relying upon techniques from geographic information science (GIS) and data mining, Twitter messages were collected, filtered, and analyzed for the thirty most populated cities in the United States during the 2013–2014 flu season. The results of this procedure are compared with national, regional, and local flu outbreak reports, revealing a statistically significant correlation between the two data sources. The main contribution of this paper is to introduce a comprehensive data mining process that enhances previous attempts to accurately identify tweets related to influenza. Additionally, geographical information systems allow us to target, filter, and normalize Twitter messages. Public Library of Science 2016-07-25 /pmc/articles/PMC4959719/ /pubmed/27455108 http://dx.doi.org/10.1371/journal.pone.0157734 Text en © 2016 Allen et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Allen, Chris
Tsou, Ming-Hsiang
Aslam, Anoshe
Nagel, Anna
Gawron, Jean-Mark
Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza
title Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza
title_full Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza
title_fullStr Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza
title_full_unstemmed Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza
title_short Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza
title_sort applying gis and machine learning methods to twitter data for multiscale surveillance of influenza
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4959719/
https://www.ncbi.nlm.nih.gov/pubmed/27455108
http://dx.doi.org/10.1371/journal.pone.0157734
work_keys_str_mv AT allenchris applyinggisandmachinelearningmethodstotwitterdataformultiscalesurveillanceofinfluenza
AT tsouminghsiang applyinggisandmachinelearningmethodstotwitterdataformultiscalesurveillanceofinfluenza
AT aslamanoshe applyinggisandmachinelearningmethodstotwitterdataformultiscalesurveillanceofinfluenza
AT nagelanna applyinggisandmachinelearningmethodstotwitterdataformultiscalesurveillanceofinfluenza
AT gawronjeanmark applyinggisandmachinelearningmethodstotwitterdataformultiscalesurveillanceofinfluenza