Cargando…

Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study

Distinguishing between a dangerous audio event like a gun firing and other non-life-threatening events, such as a plastic bag bursting, can mean the difference between life and death and, therefore, the necessary and unnecessary deployment of public safety personnel. Sounds generated by plastic bag...

Descripción completa

Detalles Bibliográficos
Autores principales:	Baliram Singh, Rajesh, Zhuang, Hanqi, Pawani, Jeet Kiran
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Communication
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587567/ https://www.ncbi.nlm.nih.gov/pubmed/34770635 http://dx.doi.org/10.3390/s21217320

_version_	1784598182027591680
author	Baliram Singh, Rajesh Zhuang, Hanqi Pawani, Jeet Kiran
author_facet	Baliram Singh, Rajesh Zhuang, Hanqi Pawani, Jeet Kiran
author_sort	Baliram Singh, Rajesh
collection	PubMed
description	Distinguishing between a dangerous audio event like a gun firing and other non-life-threatening events, such as a plastic bag bursting, can mean the difference between life and death and, therefore, the necessary and unnecessary deployment of public safety personnel. Sounds generated by plastic bag explosions are often confused with real gunshot sounds, by either humans or computer algorithms. As a case study, the research reported in this paper offers insight into sounds of plastic bag explosions and gunshots. An experimental study in this research reveals that a deep learning-based classification model trained with a popular urban sound dataset containing gunshot sounds cannot distinguish plastic bag pop sounds from gunshot sounds. This study further shows that the same deep learning model, if trained with a dataset containing plastic pop sounds, can effectively detect the non-life-threatening sounds. For this purpose, first, a collection of plastic bag-popping sounds was recorded in different environments with varying parameters, such as plastic bag size and distance from the recording microphones. The audio clips’ duration ranged from 400 ms to 600 ms. This collection of data was then used, together with a gunshot sound dataset, to train a classification model based on a convolutional neural network (CNN) to differentiate life-threatening gunshot events from non-life-threatening plastic bag explosion events. A comparison between two feature extraction methods, the Mel-frequency cepstral coefficients (MFCC) and Mel-spectrograms, was also done. Experimental studies conducted in this research show that once the plastic bag pop sounds are injected into model training, the CNN classification model performs well in distinguishing actual gunshot sounds from plastic bag sounds.
format	Online Article Text
id	pubmed-8587567
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-85875672021-11-13 Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study Baliram Singh, Rajesh Zhuang, Hanqi Pawani, Jeet Kiran Sensors (Basel) Communication Distinguishing between a dangerous audio event like a gun firing and other non-life-threatening events, such as a plastic bag bursting, can mean the difference between life and death and, therefore, the necessary and unnecessary deployment of public safety personnel. Sounds generated by plastic bag explosions are often confused with real gunshot sounds, by either humans or computer algorithms. As a case study, the research reported in this paper offers insight into sounds of plastic bag explosions and gunshots. An experimental study in this research reveals that a deep learning-based classification model trained with a popular urban sound dataset containing gunshot sounds cannot distinguish plastic bag pop sounds from gunshot sounds. This study further shows that the same deep learning model, if trained with a dataset containing plastic pop sounds, can effectively detect the non-life-threatening sounds. For this purpose, first, a collection of plastic bag-popping sounds was recorded in different environments with varying parameters, such as plastic bag size and distance from the recording microphones. The audio clips’ duration ranged from 400 ms to 600 ms. This collection of data was then used, together with a gunshot sound dataset, to train a classification model based on a convolutional neural network (CNN) to differentiate life-threatening gunshot events from non-life-threatening plastic bag explosion events. A comparison between two feature extraction methods, the Mel-frequency cepstral coefficients (MFCC) and Mel-spectrograms, was also done. Experimental studies conducted in this research show that once the plastic bag pop sounds are injected into model training, the CNN classification model performs well in distinguishing actual gunshot sounds from plastic bag sounds. MDPI 2021-11-03 /pmc/articles/PMC8587567/ /pubmed/34770635 http://dx.doi.org/10.3390/s21217320 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Communication Baliram Singh, Rajesh Zhuang, Hanqi Pawani, Jeet Kiran Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study
title	Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study
title_full	Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study
title_fullStr	Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study
title_full_unstemmed	Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study
title_short	Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study
title_sort	data collection, modeling, and classification for gunshot and gunshot-like audio events: a case study
topic	Communication
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587567/ https://www.ncbi.nlm.nih.gov/pubmed/34770635 http://dx.doi.org/10.3390/s21217320
work_keys_str_mv	AT baliramsinghrajesh datacollectionmodelingandclassificationforgunshotandgunshotlikeaudioeventsacasestudy AT zhuanghanqi datacollectionmodelingandclassificationforgunshotandgunshotlikeaudioeventsacasestudy AT pawanijeetkiran datacollectionmodelingandclassificationforgunshotandgunshotlikeaudioeventsacasestudy

Data Collection, Modeling, and Classification for Gunshot and Gunshot-like Audio Events: A Case Study

Ejemplares similares