Cargando…

Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network

The worldwide utilization of surveillance cameras in smart cities has enabled researchers to analyze a gigantic volume of data to ensure automatic monitoring. An enhanced security system in smart cities, schools, hospitals, and other surveillance domains is mandatory for the detection of violent or...

Descripción completa

Detalles Bibliográficos
Autores principales: Ullah, Fath U Min, Ullah, Amin, Muhammad, Khan, Haq, Ijaz Ul, Baik, Sung Wook
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6603512/
https://www.ncbi.nlm.nih.gov/pubmed/31151184
http://dx.doi.org/10.3390/s19112472
_version_ 1783431522139766784
author Ullah, Fath U Min
Ullah, Amin
Muhammad, Khan
Haq, Ijaz Ul
Baik, Sung Wook
author_facet Ullah, Fath U Min
Ullah, Amin
Muhammad, Khan
Haq, Ijaz Ul
Baik, Sung Wook
author_sort Ullah, Fath U Min
collection PubMed
description The worldwide utilization of surveillance cameras in smart cities has enabled researchers to analyze a gigantic volume of data to ensure automatic monitoring. An enhanced security system in smart cities, schools, hospitals, and other surveillance domains is mandatory for the detection of violent or abnormal activities to avoid any casualties which could cause social, economic, and ecological damages. Automatic detection of violence for quick actions is very significant and can efficiently assist the concerned departments. In this paper, we propose a triple-staged end-to-end deep learning violence detection framework. First, persons are detected in the surveillance video stream using a light-weight convolutional neural network (CNN) model to reduce and overcome the voluminous processing of useless frames. Second, a sequence of 16 frames with detected persons is passed to 3D CNN, where the spatiotemporal features of these sequences are extracted and fed to the Softmax classifier. Furthermore, we optimized the 3D CNN model using an open visual inference and neural networks optimization toolkit developed by Intel, which converts the trained model into intermediate representation and adjusts it for optimal execution at the end platform for the final prediction of violent activity. After detection of a violent activity, an alert is transmitted to the nearest police station or security department to take prompt preventive actions. We found that our proposed method outperforms the existing state-of-the-art methods for different benchmark datasets.
format Online
Article
Text
id pubmed-6603512
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-66035122019-07-19 Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network Ullah, Fath U Min Ullah, Amin Muhammad, Khan Haq, Ijaz Ul Baik, Sung Wook Sensors (Basel) Article The worldwide utilization of surveillance cameras in smart cities has enabled researchers to analyze a gigantic volume of data to ensure automatic monitoring. An enhanced security system in smart cities, schools, hospitals, and other surveillance domains is mandatory for the detection of violent or abnormal activities to avoid any casualties which could cause social, economic, and ecological damages. Automatic detection of violence for quick actions is very significant and can efficiently assist the concerned departments. In this paper, we propose a triple-staged end-to-end deep learning violence detection framework. First, persons are detected in the surveillance video stream using a light-weight convolutional neural network (CNN) model to reduce and overcome the voluminous processing of useless frames. Second, a sequence of 16 frames with detected persons is passed to 3D CNN, where the spatiotemporal features of these sequences are extracted and fed to the Softmax classifier. Furthermore, we optimized the 3D CNN model using an open visual inference and neural networks optimization toolkit developed by Intel, which converts the trained model into intermediate representation and adjusts it for optimal execution at the end platform for the final prediction of violent activity. After detection of a violent activity, an alert is transmitted to the nearest police station or security department to take prompt preventive actions. We found that our proposed method outperforms the existing state-of-the-art methods for different benchmark datasets. MDPI 2019-05-30 /pmc/articles/PMC6603512/ /pubmed/31151184 http://dx.doi.org/10.3390/s19112472 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ullah, Fath U Min
Ullah, Amin
Muhammad, Khan
Haq, Ijaz Ul
Baik, Sung Wook
Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network
title Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network
title_full Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network
title_fullStr Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network
title_full_unstemmed Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network
title_short Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network
title_sort violence detection using spatiotemporal features with 3d convolutional neural network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6603512/
https://www.ncbi.nlm.nih.gov/pubmed/31151184
http://dx.doi.org/10.3390/s19112472
work_keys_str_mv AT ullahfathumin violencedetectionusingspatiotemporalfeatureswith3dconvolutionalneuralnetwork
AT ullahamin violencedetectionusingspatiotemporalfeatureswith3dconvolutionalneuralnetwork
AT muhammadkhan violencedetectionusingspatiotemporalfeatureswith3dconvolutionalneuralnetwork
AT haqijazul violencedetectionusingspatiotemporalfeatureswith3dconvolutionalneuralnetwork
AT baiksungwook violencedetectionusingspatiotemporalfeatureswith3dconvolutionalneuralnetwork