Cargando…

Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation

In recent years, Online Social Networks (OSNs) have received a great deal of attention for their potential use in the spatial and temporal modeling of events owing to the information that can be extracted from these platforms. Within this context, one of the most latent applications is the monitorin...

Descripción completa

Detalles Bibliográficos
Autores principales: Hernandez-Suarez, Aldo, Sanchez-Perez, Gabriel, Toscano-Medina, Karina, Perez-Meana, Hector, Portillo-Portillo, Jose, Sanchez, Victor, García Villalba, Luis Javier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6484392/
https://www.ncbi.nlm.nih.gov/pubmed/30979067
http://dx.doi.org/10.3390/s19071746
_version_ 1783414111619514368
author Hernandez-Suarez, Aldo
Sanchez-Perez, Gabriel
Toscano-Medina, Karina
Perez-Meana, Hector
Portillo-Portillo, Jose
Sanchez, Victor
García Villalba, Luis Javier
author_facet Hernandez-Suarez, Aldo
Sanchez-Perez, Gabriel
Toscano-Medina, Karina
Perez-Meana, Hector
Portillo-Portillo, Jose
Sanchez, Victor
García Villalba, Luis Javier
author_sort Hernandez-Suarez, Aldo
collection PubMed
description In recent years, Online Social Networks (OSNs) have received a great deal of attention for their potential use in the spatial and temporal modeling of events owing to the information that can be extracted from these platforms. Within this context, one of the most latent applications is the monitoring of natural disasters. Vital information posted by OSN users can contribute to relief efforts during and after a catastrophe. Although it is possible to retrieve data from OSNs using embedded geographic information provided by GPS systems, this feature is disabled by default in most cases. An alternative solution is to geoparse specific locations using language models based on Named Entity Recognition (NER) techniques. In this work, a sensor that uses Twitter is proposed to monitor natural disasters. The approach is intended to sense data by detecting toponyms (named places written within the text) in tweets with event-related information, e.g., a collapsed building on a specific avenue or the location at which a person was last seen. The proposed approach is carried out by transforming tokenized tweets into word embeddings: a rich linguistic and contextual vector representation of textual corpora. Pre-labeled word embeddings are employed to train a Recurrent Neural Network variant, known as a Bidirectional Long Short-Term Memory (biLSTM) network, that is capable of dealing with sequential data by analyzing information in both directions of a word (past and future entries). Moreover, a Conditional Random Field (CRF) output layer, which aims to maximize the transition from one NER tag to another, is used to increase the classification accuracy. The resulting labeled words are joined to coherently form a toponym, which is geocoded and scored by a Kernel Density Estimation function. At the end of the process, the scored data are presented graphically to depict areas in which the majority of tweets reporting topics related to a natural disaster are concentrated. A case study on Mexico’s 2017 Earthquake is presented, and the data extracted during and after the event are reported.
format Online
Article
Text
id pubmed-6484392
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64843922019-04-29 Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation Hernandez-Suarez, Aldo Sanchez-Perez, Gabriel Toscano-Medina, Karina Perez-Meana, Hector Portillo-Portillo, Jose Sanchez, Victor García Villalba, Luis Javier Sensors (Basel) Article In recent years, Online Social Networks (OSNs) have received a great deal of attention for their potential use in the spatial and temporal modeling of events owing to the information that can be extracted from these platforms. Within this context, one of the most latent applications is the monitoring of natural disasters. Vital information posted by OSN users can contribute to relief efforts during and after a catastrophe. Although it is possible to retrieve data from OSNs using embedded geographic information provided by GPS systems, this feature is disabled by default in most cases. An alternative solution is to geoparse specific locations using language models based on Named Entity Recognition (NER) techniques. In this work, a sensor that uses Twitter is proposed to monitor natural disasters. The approach is intended to sense data by detecting toponyms (named places written within the text) in tweets with event-related information, e.g., a collapsed building on a specific avenue or the location at which a person was last seen. The proposed approach is carried out by transforming tokenized tweets into word embeddings: a rich linguistic and contextual vector representation of textual corpora. Pre-labeled word embeddings are employed to train a Recurrent Neural Network variant, known as a Bidirectional Long Short-Term Memory (biLSTM) network, that is capable of dealing with sequential data by analyzing information in both directions of a word (past and future entries). Moreover, a Conditional Random Field (CRF) output layer, which aims to maximize the transition from one NER tag to another, is used to increase the classification accuracy. The resulting labeled words are joined to coherently form a toponym, which is geocoded and scored by a Kernel Density Estimation function. At the end of the process, the scored data are presented graphically to depict areas in which the majority of tweets reporting topics related to a natural disaster are concentrated. A case study on Mexico’s 2017 Earthquake is presented, and the data extracted during and after the event are reported. MDPI 2019-04-11 /pmc/articles/PMC6484392/ /pubmed/30979067 http://dx.doi.org/10.3390/s19071746 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hernandez-Suarez, Aldo
Sanchez-Perez, Gabriel
Toscano-Medina, Karina
Perez-Meana, Hector
Portillo-Portillo, Jose
Sanchez, Victor
García Villalba, Luis Javier
Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation
title Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation
title_full Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation
title_fullStr Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation
title_full_unstemmed Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation
title_short Using Twitter Data to Monitor Natural Disaster Social Dynamics: A Recurrent Neural Network Approach with Word Embeddings and Kernel Density Estimation
title_sort using twitter data to monitor natural disaster social dynamics: a recurrent neural network approach with word embeddings and kernel density estimation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6484392/
https://www.ncbi.nlm.nih.gov/pubmed/30979067
http://dx.doi.org/10.3390/s19071746
work_keys_str_mv AT hernandezsuarezaldo usingtwitterdatatomonitornaturaldisastersocialdynamicsarecurrentneuralnetworkapproachwithwordembeddingsandkerneldensityestimation
AT sanchezperezgabriel usingtwitterdatatomonitornaturaldisastersocialdynamicsarecurrentneuralnetworkapproachwithwordembeddingsandkerneldensityestimation
AT toscanomedinakarina usingtwitterdatatomonitornaturaldisastersocialdynamicsarecurrentneuralnetworkapproachwithwordembeddingsandkerneldensityestimation
AT perezmeanahector usingtwitterdatatomonitornaturaldisastersocialdynamicsarecurrentneuralnetworkapproachwithwordembeddingsandkerneldensityestimation
AT portilloportillojose usingtwitterdatatomonitornaturaldisastersocialdynamicsarecurrentneuralnetworkapproachwithwordembeddingsandkerneldensityestimation
AT sanchezvictor usingtwitterdatatomonitornaturaldisastersocialdynamicsarecurrentneuralnetworkapproachwithwordembeddingsandkerneldensityestimation
AT garciavillalbaluisjavier usingtwitterdatatomonitornaturaldisastersocialdynamicsarecurrentneuralnetworkapproachwithwordembeddingsandkerneldensityestimation