Cargando…

Image Analysis Enhanced Event Detection from Geo-Tagged Tweet Streams

Events detected from social media streams often include early signs of accidents, crimes or disasters. Therefore, they can be used by related parties for timely and efficient response. Although significant progress has been made on event detection from tweet streams, most existing methods have not c...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Yi, Karunasekera, Shanika, Leckie, Christopher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206246/
http://dx.doi.org/10.1007/978-3-030-47426-3_31
_version_ 1783530376877047808
author Han, Yi
Karunasekera, Shanika
Leckie, Christopher
author_facet Han, Yi
Karunasekera, Shanika
Leckie, Christopher
author_sort Han, Yi
collection PubMed
description Events detected from social media streams often include early signs of accidents, crimes or disasters. Therefore, they can be used by related parties for timely and efficient response. Although significant progress has been made on event detection from tweet streams, most existing methods have not considered the posted images in tweets, which provide richer information than the text, and potentially can be a reliable indicator of whether an event occurs or not. In this paper, we design an event detection algorithm that combines textual, statistical and image information, following an unsupervised machine learning approach. Specifically, the algorithm starts with semantic and statistical analyses to obtain a list of tweet clusters, each of which corresponds to an event candidate, and then performs image analysis to separate events from non-events—a convolutional autoencoder is trained for each cluster as an anomaly detector, where a part of the images are used as the training data and the remaining images are used as the test instances. Our experiments on multiple datasets verify that when an event occurs, the mean reconstruction errors of the training and test images are much closer, compared with the case where the candidate is a non-event cluster. Based on this finding, the algorithm rejects a candidate if the difference is larger than a threshold. Experimental results over millions of tweets demonstrate that this image analysis enhanced approach can significantly increase the precision with minimum impact on the recall.
format Online
Article
Text
id pubmed-7206246
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72062462020-05-08 Image Analysis Enhanced Event Detection from Geo-Tagged Tweet Streams Han, Yi Karunasekera, Shanika Leckie, Christopher Advances in Knowledge Discovery and Data Mining Article Events detected from social media streams often include early signs of accidents, crimes or disasters. Therefore, they can be used by related parties for timely and efficient response. Although significant progress has been made on event detection from tweet streams, most existing methods have not considered the posted images in tweets, which provide richer information than the text, and potentially can be a reliable indicator of whether an event occurs or not. In this paper, we design an event detection algorithm that combines textual, statistical and image information, following an unsupervised machine learning approach. Specifically, the algorithm starts with semantic and statistical analyses to obtain a list of tweet clusters, each of which corresponds to an event candidate, and then performs image analysis to separate events from non-events—a convolutional autoencoder is trained for each cluster as an anomaly detector, where a part of the images are used as the training data and the remaining images are used as the test instances. Our experiments on multiple datasets verify that when an event occurs, the mean reconstruction errors of the training and test images are much closer, compared with the case where the candidate is a non-event cluster. Based on this finding, the algorithm rejects a candidate if the difference is larger than a threshold. Experimental results over millions of tweets demonstrate that this image analysis enhanced approach can significantly increase the precision with minimum impact on the recall. 2020-04-17 /pmc/articles/PMC7206246/ http://dx.doi.org/10.1007/978-3-030-47426-3_31 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Han, Yi
Karunasekera, Shanika
Leckie, Christopher
Image Analysis Enhanced Event Detection from Geo-Tagged Tweet Streams
title Image Analysis Enhanced Event Detection from Geo-Tagged Tweet Streams
title_full Image Analysis Enhanced Event Detection from Geo-Tagged Tweet Streams
title_fullStr Image Analysis Enhanced Event Detection from Geo-Tagged Tweet Streams
title_full_unstemmed Image Analysis Enhanced Event Detection from Geo-Tagged Tweet Streams
title_short Image Analysis Enhanced Event Detection from Geo-Tagged Tweet Streams
title_sort image analysis enhanced event detection from geo-tagged tweet streams
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206246/
http://dx.doi.org/10.1007/978-3-030-47426-3_31
work_keys_str_mv AT hanyi imageanalysisenhancedeventdetectionfromgeotaggedtweetstreams
AT karunasekerashanika imageanalysisenhancedeventdetectionfromgeotaggedtweetstreams
AT leckiechristopher imageanalysisenhancedeventdetectionfromgeotaggedtweetstreams