Cargando…

Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks

Increasing violence in workplaces such as hospitals seriously challenges public safety. However, it is time- and labor-consuming to visually monitor masses of video data in real time. Therefore, automatic and timely violent activity detection from videos is vital, especially for small monitoring sys...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Xiang, Yang, Jie, Kasabov, Nikola K.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10181528/ https://www.ncbi.nlm.nih.gov/pubmed/37177737 http://dx.doi.org/10.3390/s23094532

_version_	1785041595775582208
author	Wang, Xiang Yang, Jie Kasabov, Nikola K.
author_facet	Wang, Xiang Yang, Jie Kasabov, Nikola K.
author_sort	Wang, Xiang
collection	PubMed
description	Increasing violence in workplaces such as hospitals seriously challenges public safety. However, it is time- and labor-consuming to visually monitor masses of video data in real time. Therefore, automatic and timely violent activity detection from videos is vital, especially for small monitoring systems. This paper proposes a two-stream deep learning architecture for video violent activity detection named SpikeConvFlowNet. First, RGB frames and their optical flow data are used as inputs for each stream to extract the spatiotemporal features of videos. After that, the spatiotemporal features from the two streams are concatenated and fed to the classifier for the final decision. Each stream utilizes a supervised neural network consisting of multiple convolutional spiking and pooling layers. Convolutional layers are used to extract high-quality spatial features within frames, and spiking neurons can efficiently extract temporal features across frames by remembering historical information. The spiking neuron-based optical flow can strengthen the capability of extracting critical motion information. This method combines their advantages to enhance the performance and efficiency for recognizing violent actions. The experimental results on public datasets demonstrate that, compared with the latest methods, this approach greatly reduces parameters and achieves higher inference efficiency with limited accuracy loss. It is a potential solution for applications in embedded devices that provide low computing power but require fast processing speeds.
format	Online Article Text
id	pubmed-10181528
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-101815282023-05-13 Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks Wang, Xiang Yang, Jie Kasabov, Nikola K. Sensors (Basel) Article Increasing violence in workplaces such as hospitals seriously challenges public safety. However, it is time- and labor-consuming to visually monitor masses of video data in real time. Therefore, automatic and timely violent activity detection from videos is vital, especially for small monitoring systems. This paper proposes a two-stream deep learning architecture for video violent activity detection named SpikeConvFlowNet. First, RGB frames and their optical flow data are used as inputs for each stream to extract the spatiotemporal features of videos. After that, the spatiotemporal features from the two streams are concatenated and fed to the classifier for the final decision. Each stream utilizes a supervised neural network consisting of multiple convolutional spiking and pooling layers. Convolutional layers are used to extract high-quality spatial features within frames, and spiking neurons can efficiently extract temporal features across frames by remembering historical information. The spiking neuron-based optical flow can strengthen the capability of extracting critical motion information. This method combines their advantages to enhance the performance and efficiency for recognizing violent actions. The experimental results on public datasets demonstrate that, compared with the latest methods, this approach greatly reduces parameters and achieves higher inference efficiency with limited accuracy loss. It is a potential solution for applications in embedded devices that provide low computing power but require fast processing speeds. MDPI 2023-05-06 /pmc/articles/PMC10181528/ /pubmed/37177737 http://dx.doi.org/10.3390/s23094532 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Xiang Yang, Jie Kasabov, Nikola K. Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks
title	Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks
title_full	Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks
title_fullStr	Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks
title_full_unstemmed	Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks
title_short	Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks
title_sort	integrating spatial and temporal information for violent activity detection from video using deep spiking neural networks
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10181528/ https://www.ncbi.nlm.nih.gov/pubmed/37177737 http://dx.doi.org/10.3390/s23094532
work_keys_str_mv	AT wangxiang integratingspatialandtemporalinformationforviolentactivitydetectionfromvideousingdeepspikingneuralnetworks AT yangjie integratingspatialandtemporalinformationforviolentactivitydetectionfromvideousingdeepspikingneuralnetworks AT kasabovnikolak integratingspatialandtemporalinformationforviolentactivitydetectionfromvideousingdeepspikingneuralnetworks

Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks

Ejemplares similares