Cargando…

Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey

Twitter can supply useful information on infrastructure impacts to the emergency managers during major disasters, but it is time consuming to filter through many irrelevant tweets. Previous studies have identified the types of messages that can be found on social media during disasters, but few solu...

Descripción completa

Detalles Bibliográficos
Autores principales: Mihunov, Volodymyr V., Jafari, Navid H., Wang, Kejin, Lam, Nina S. N., Govender, Dylan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Nature Singapore 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9510182/
http://dx.doi.org/10.1007/s13753-022-00442-1
_version_ 1784797394505826304
author Mihunov, Volodymyr V.
Jafari, Navid H.
Wang, Kejin
Lam, Nina S. N.
Govender, Dylan
author_facet Mihunov, Volodymyr V.
Jafari, Navid H.
Wang, Kejin
Lam, Nina S. N.
Govender, Dylan
author_sort Mihunov, Volodymyr V.
collection PubMed
description Twitter can supply useful information on infrastructure impacts to the emergency managers during major disasters, but it is time consuming to filter through many irrelevant tweets. Previous studies have identified the types of messages that can be found on social media during disasters, but few solutions have been proposed to efficiently extract useful ones. We present a framework that can be applied in a timely manner to provide disaster impact information sourced from social media. The framework is tested on a well-studied and data-rich case of Hurricane Harvey. The procedures consist of filtering the raw Twitter data based on keywords, location, and tweet attributes, and then applying the latent Dirichlet allocation (LDA) to separate the tweets from the disaster affected area into categories (topics) useful to emergency managers. The LDA revealed that out of 24 topics found in the data, nine were directly related to disaster impacts—for example, outages, closures, flooded roads, and damaged infrastructure. Features such as frequent hashtags, mentions, URLs, and useful images were then extracted and analyzed. The relevant tweets, along with useful images, were correlated at the county level with flood depth, distributed disaster aid (damage), and population density. Significant correlations were found between the nine relevant topics and population density but not flood depth and damage, suggesting that more research into the suitability of social media data for disaster impacts modeling is needed. The results from this study provide baseline information for such efforts in the future.
format Online
Article
Text
id pubmed-9510182
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Nature Singapore
record_format MEDLINE/PubMed
spelling pubmed-95101822022-09-26 Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey Mihunov, Volodymyr V. Jafari, Navid H. Wang, Kejin Lam, Nina S. N. Govender, Dylan Int J Disaster Risk Sci Article Twitter can supply useful information on infrastructure impacts to the emergency managers during major disasters, but it is time consuming to filter through many irrelevant tweets. Previous studies have identified the types of messages that can be found on social media during disasters, but few solutions have been proposed to efficiently extract useful ones. We present a framework that can be applied in a timely manner to provide disaster impact information sourced from social media. The framework is tested on a well-studied and data-rich case of Hurricane Harvey. The procedures consist of filtering the raw Twitter data based on keywords, location, and tweet attributes, and then applying the latent Dirichlet allocation (LDA) to separate the tweets from the disaster affected area into categories (topics) useful to emergency managers. The LDA revealed that out of 24 topics found in the data, nine were directly related to disaster impacts—for example, outages, closures, flooded roads, and damaged infrastructure. Features such as frequent hashtags, mentions, URLs, and useful images were then extracted and analyzed. The relevant tweets, along with useful images, were correlated at the county level with flood depth, distributed disaster aid (damage), and population density. Significant correlations were found between the nine relevant topics and population density but not flood depth and damage, suggesting that more research into the suitability of social media data for disaster impacts modeling is needed. The results from this study provide baseline information for such efforts in the future. Springer Nature Singapore 2022-09-23 2022 /pmc/articles/PMC9510182/ http://dx.doi.org/10.1007/s13753-022-00442-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Mihunov, Volodymyr V.
Jafari, Navid H.
Wang, Kejin
Lam, Nina S. N.
Govender, Dylan
Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey
title Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey
title_full Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey
title_fullStr Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey
title_full_unstemmed Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey
title_short Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey
title_sort disaster impacts surveillance from social media with topic modeling and feature extraction: case of hurricane harvey
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9510182/
http://dx.doi.org/10.1007/s13753-022-00442-1
work_keys_str_mv AT mihunovvolodymyrv disasterimpactssurveillancefromsocialmediawithtopicmodelingandfeatureextractioncaseofhurricaneharvey
AT jafarinavidh disasterimpactssurveillancefromsocialmediawithtopicmodelingandfeatureextractioncaseofhurricaneharvey
AT wangkejin disasterimpactssurveillancefromsocialmediawithtopicmodelingandfeatureextractioncaseofhurricaneharvey
AT lamninasn disasterimpactssurveillancefromsocialmediawithtopicmodelingandfeatureextractioncaseofhurricaneharvey
AT govenderdylan disasterimpactssurveillancefromsocialmediawithtopicmodelingandfeatureextractioncaseofhurricaneharvey