Cargando…
Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms
The concept of social media began to gain popularity in the late 1990s and has played a significant role in connecting people across the globe. The constant addition of features to old social media platforms and the creation of new ones have helped amass and retain an extensive user base. Users coul...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Nature Singapore
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10163854/ https://www.ncbi.nlm.nih.gov/pubmed/37193217 http://dx.doi.org/10.1007/s42979-023-01816-y |
_version_ | 1785037968939941888 |
---|---|
author | Vivek, Meghashyam Prathap, Boppuru Rudra |
author_facet | Vivek, Meghashyam Prathap, Boppuru Rudra |
author_sort | Vivek, Meghashyam |
collection | PubMed |
description | The concept of social media began to gain popularity in the late 1990s and has played a significant role in connecting people across the globe. The constant addition of features to old social media platforms and the creation of new ones have helped amass and retain an extensive user base. Users could now share their views and provide detailed accounts of events from worldwide to reach like-minded people. This led to the popularization of blogging and brought into focus the posts of the commoner. These posts began to be verified and included in mainstream news articles bringing about a revolution in journalism. This research aims to use a social media platform, Twitter, to classify, visualize, and forecast Indian crime tweet data and provide a spatio-temporal view of crime in the country using statistical and machine learning models. The Tweepy Python module's search function and '#crime' query have been used to scrape relevant tweets under geographical constraints, followed by substring-keyword classification using 318 unique crime keywords. The Bokeh and gmaps Python modules create analytical and geospatial visualizations, respectively. Time series forecasting of crime tweet count is performed by comparing the accuracy of Long Short-Term Memory (LSTM), Auto-Regressive Integrated Moving Average (ARIMA), and Seasonal Auto-Regressivee Integrated Moving Average (SARIMA) models to determine the best model. |
format | Online Article Text |
id | pubmed-10163854 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer Nature Singapore |
record_format | MEDLINE/PubMed |
spelling | pubmed-101638542023-05-09 Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms Vivek, Meghashyam Prathap, Boppuru Rudra SN Comput Sci Original Research The concept of social media began to gain popularity in the late 1990s and has played a significant role in connecting people across the globe. The constant addition of features to old social media platforms and the creation of new ones have helped amass and retain an extensive user base. Users could now share their views and provide detailed accounts of events from worldwide to reach like-minded people. This led to the popularization of blogging and brought into focus the posts of the commoner. These posts began to be verified and included in mainstream news articles bringing about a revolution in journalism. This research aims to use a social media platform, Twitter, to classify, visualize, and forecast Indian crime tweet data and provide a spatio-temporal view of crime in the country using statistical and machine learning models. The Tweepy Python module's search function and '#crime' query have been used to scrape relevant tweets under geographical constraints, followed by substring-keyword classification using 318 unique crime keywords. The Bokeh and gmaps Python modules create analytical and geospatial visualizations, respectively. Time series forecasting of crime tweet count is performed by comparing the accuracy of Long Short-Term Memory (LSTM), Auto-Regressive Integrated Moving Average (ARIMA), and Seasonal Auto-Regressivee Integrated Moving Average (SARIMA) models to determine the best model. Springer Nature Singapore 2023-05-06 2023 /pmc/articles/PMC10163854/ /pubmed/37193217 http://dx.doi.org/10.1007/s42979-023-01816-y Text en © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Research Vivek, Meghashyam Prathap, Boppuru Rudra Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms |
title | Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms |
title_full | Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms |
title_fullStr | Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms |
title_full_unstemmed | Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms |
title_short | Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms |
title_sort | spatio-temporal crime analysis and forecasting on twitter data using machine learning algorithms |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10163854/ https://www.ncbi.nlm.nih.gov/pubmed/37193217 http://dx.doi.org/10.1007/s42979-023-01816-y |
work_keys_str_mv | AT vivekmeghashyam spatiotemporalcrimeanalysisandforecastingontwitterdatausingmachinelearningalgorithms AT prathapboppururudra spatiotemporalcrimeanalysisandforecastingontwitterdatausingmachinelearningalgorithms |