Cargando…

Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms

The concept of social media began to gain popularity in the late 1990s and has played a significant role in connecting people across the globe. The constant addition of features to old social media platforms and the creation of new ones have helped amass and retain an extensive user base. Users coul...

Descripción completa

Detalles Bibliográficos
Autores principales: Vivek, Meghashyam, Prathap, Boppuru Rudra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Nature Singapore 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10163854/
https://www.ncbi.nlm.nih.gov/pubmed/37193217
http://dx.doi.org/10.1007/s42979-023-01816-y
_version_ 1785037968939941888
author Vivek, Meghashyam
Prathap, Boppuru Rudra
author_facet Vivek, Meghashyam
Prathap, Boppuru Rudra
author_sort Vivek, Meghashyam
collection PubMed
description The concept of social media began to gain popularity in the late 1990s and has played a significant role in connecting people across the globe. The constant addition of features to old social media platforms and the creation of new ones have helped amass and retain an extensive user base. Users could now share their views and provide detailed accounts of events from worldwide to reach like-minded people. This led to the popularization of blogging and brought into focus the posts of the commoner. These posts began to be verified and included in mainstream news articles bringing about a revolution in journalism. This research aims to use a social media platform, Twitter, to classify, visualize, and forecast Indian crime tweet data and provide a spatio-temporal view of crime in the country using statistical and machine learning models. The Tweepy Python module's search function and '#crime' query have been used to scrape relevant tweets under geographical constraints, followed by substring-keyword classification using 318 unique crime keywords. The Bokeh and gmaps Python modules create analytical and geospatial visualizations, respectively. Time series forecasting of crime tweet count is performed by comparing the accuracy of Long Short-Term Memory (LSTM), Auto-Regressive Integrated Moving Average (ARIMA), and Seasonal Auto-Regressivee Integrated Moving Average (SARIMA) models to determine the best model.
format Online
Article
Text
id pubmed-10163854
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer Nature Singapore
record_format MEDLINE/PubMed
spelling pubmed-101638542023-05-09 Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms Vivek, Meghashyam Prathap, Boppuru Rudra SN Comput Sci Original Research The concept of social media began to gain popularity in the late 1990s and has played a significant role in connecting people across the globe. The constant addition of features to old social media platforms and the creation of new ones have helped amass and retain an extensive user base. Users could now share their views and provide detailed accounts of events from worldwide to reach like-minded people. This led to the popularization of blogging and brought into focus the posts of the commoner. These posts began to be verified and included in mainstream news articles bringing about a revolution in journalism. This research aims to use a social media platform, Twitter, to classify, visualize, and forecast Indian crime tweet data and provide a spatio-temporal view of crime in the country using statistical and machine learning models. The Tweepy Python module's search function and '#crime' query have been used to scrape relevant tweets under geographical constraints, followed by substring-keyword classification using 318 unique crime keywords. The Bokeh and gmaps Python modules create analytical and geospatial visualizations, respectively. Time series forecasting of crime tweet count is performed by comparing the accuracy of Long Short-Term Memory (LSTM), Auto-Regressive Integrated Moving Average (ARIMA), and Seasonal Auto-Regressivee Integrated Moving Average (SARIMA) models to determine the best model. Springer Nature Singapore 2023-05-06 2023 /pmc/articles/PMC10163854/ /pubmed/37193217 http://dx.doi.org/10.1007/s42979-023-01816-y Text en © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Research
Vivek, Meghashyam
Prathap, Boppuru Rudra
Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms
title Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms
title_full Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms
title_fullStr Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms
title_full_unstemmed Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms
title_short Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms
title_sort spatio-temporal crime analysis and forecasting on twitter data using machine learning algorithms
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10163854/
https://www.ncbi.nlm.nih.gov/pubmed/37193217
http://dx.doi.org/10.1007/s42979-023-01816-y
work_keys_str_mv AT vivekmeghashyam spatiotemporalcrimeanalysisandforecastingontwitterdatausingmachinelearningalgorithms
AT prathapboppururudra spatiotemporalcrimeanalysisandforecastingontwitterdatausingmachinelearningalgorithms