Cargando…
Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey
Twitter can supply useful information on infrastructure impacts to the emergency managers during major disasters, but it is time consuming to filter through many irrelevant tweets. Previous studies have identified the types of messages that can be found on social media during disasters, but few solu...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Nature Singapore
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9510182/ http://dx.doi.org/10.1007/s13753-022-00442-1 |
_version_ | 1784797394505826304 |
---|---|
author | Mihunov, Volodymyr V. Jafari, Navid H. Wang, Kejin Lam, Nina S. N. Govender, Dylan |
author_facet | Mihunov, Volodymyr V. Jafari, Navid H. Wang, Kejin Lam, Nina S. N. Govender, Dylan |
author_sort | Mihunov, Volodymyr V. |
collection | PubMed |
description | Twitter can supply useful information on infrastructure impacts to the emergency managers during major disasters, but it is time consuming to filter through many irrelevant tweets. Previous studies have identified the types of messages that can be found on social media during disasters, but few solutions have been proposed to efficiently extract useful ones. We present a framework that can be applied in a timely manner to provide disaster impact information sourced from social media. The framework is tested on a well-studied and data-rich case of Hurricane Harvey. The procedures consist of filtering the raw Twitter data based on keywords, location, and tweet attributes, and then applying the latent Dirichlet allocation (LDA) to separate the tweets from the disaster affected area into categories (topics) useful to emergency managers. The LDA revealed that out of 24 topics found in the data, nine were directly related to disaster impacts—for example, outages, closures, flooded roads, and damaged infrastructure. Features such as frequent hashtags, mentions, URLs, and useful images were then extracted and analyzed. The relevant tweets, along with useful images, were correlated at the county level with flood depth, distributed disaster aid (damage), and population density. Significant correlations were found between the nine relevant topics and population density but not flood depth and damage, suggesting that more research into the suitability of social media data for disaster impacts modeling is needed. The results from this study provide baseline information for such efforts in the future. |
format | Online Article Text |
id | pubmed-9510182 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Nature Singapore |
record_format | MEDLINE/PubMed |
spelling | pubmed-95101822022-09-26 Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey Mihunov, Volodymyr V. Jafari, Navid H. Wang, Kejin Lam, Nina S. N. Govender, Dylan Int J Disaster Risk Sci Article Twitter can supply useful information on infrastructure impacts to the emergency managers during major disasters, but it is time consuming to filter through many irrelevant tweets. Previous studies have identified the types of messages that can be found on social media during disasters, but few solutions have been proposed to efficiently extract useful ones. We present a framework that can be applied in a timely manner to provide disaster impact information sourced from social media. The framework is tested on a well-studied and data-rich case of Hurricane Harvey. The procedures consist of filtering the raw Twitter data based on keywords, location, and tweet attributes, and then applying the latent Dirichlet allocation (LDA) to separate the tweets from the disaster affected area into categories (topics) useful to emergency managers. The LDA revealed that out of 24 topics found in the data, nine were directly related to disaster impacts—for example, outages, closures, flooded roads, and damaged infrastructure. Features such as frequent hashtags, mentions, URLs, and useful images were then extracted and analyzed. The relevant tweets, along with useful images, were correlated at the county level with flood depth, distributed disaster aid (damage), and population density. Significant correlations were found between the nine relevant topics and population density but not flood depth and damage, suggesting that more research into the suitability of social media data for disaster impacts modeling is needed. The results from this study provide baseline information for such efforts in the future. Springer Nature Singapore 2022-09-23 2022 /pmc/articles/PMC9510182/ http://dx.doi.org/10.1007/s13753-022-00442-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Mihunov, Volodymyr V. Jafari, Navid H. Wang, Kejin Lam, Nina S. N. Govender, Dylan Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey |
title | Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey |
title_full | Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey |
title_fullStr | Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey |
title_full_unstemmed | Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey |
title_short | Disaster Impacts Surveillance from Social Media with Topic Modeling and Feature Extraction: Case of Hurricane Harvey |
title_sort | disaster impacts surveillance from social media with topic modeling and feature extraction: case of hurricane harvey |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9510182/ http://dx.doi.org/10.1007/s13753-022-00442-1 |
work_keys_str_mv | AT mihunovvolodymyrv disasterimpactssurveillancefromsocialmediawithtopicmodelingandfeatureextractioncaseofhurricaneharvey AT jafarinavidh disasterimpactssurveillancefromsocialmediawithtopicmodelingandfeatureextractioncaseofhurricaneharvey AT wangkejin disasterimpactssurveillancefromsocialmediawithtopicmodelingandfeatureextractioncaseofhurricaneharvey AT lamninasn disasterimpactssurveillancefromsocialmediawithtopicmodelingandfeatureextractioncaseofhurricaneharvey AT govenderdylan disasterimpactssurveillancefromsocialmediawithtopicmodelingandfeatureextractioncaseofhurricaneharvey |