Cargando…

Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones

This paper proposes a sound event detection (SED) method in tunnels to prevent further uncontrollable accidents. Tunnel accidents are accompanied by crashes and tire skids, which usually produce abnormal sounds. Since the tunnel environment always has a severe level of noise, the detection accuracy...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kim, Nam Kyun, Jeon, Kwang Myung, Kim, Hong Kook
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6631336/ https://www.ncbi.nlm.nih.gov/pubmed/31208007 http://dx.doi.org/10.3390/s19122695

_version_	1783435496986247168
author	Kim, Nam Kyun Jeon, Kwang Myung Kim, Hong Kook
author_facet	Kim, Nam Kyun Jeon, Kwang Myung Kim, Hong Kook
author_sort	Kim, Nam Kyun
collection	PubMed
description	This paper proposes a sound event detection (SED) method in tunnels to prevent further uncontrollable accidents. Tunnel accidents are accompanied by crashes and tire skids, which usually produce abnormal sounds. Since the tunnel environment always has a severe level of noise, the detection accuracy can be greatly reduced in the existing methods. To deal with the noise issue in the tunnel environment, the proposed method involves the preprocessing of tunnel acoustic signals and a classifier for detecting acoustic events in tunnels. For preprocessing, a non-negative tensor factorization (NTF) technique is used to separate the acoustic event signal from the noisy signal in the tunnel. In particular, the NTF technique developed in this paper consists of source separation and online noise learning. In other words, the noise basis is adapted by an online noise learning technique for enhancement in adverse noise conditions. Next, a convolutional recurrent neural network (CRNN) is extended to accommodate the contributions of the separated event signal and noise to the event detection; thus, the proposed CRNN is composed of event convolution layers and noise convolution layers in parallel followed by recurrent layers and the output layer. Here, a set of mel-filterbank feature parameters is used as the input features. Evaluations of the proposed method are conducted on two datasets: a publicly available road audio events dataset and a tunnel audio dataset recorded in a real traffic tunnel for six months. In the first evaluation where the background noise is low, the proposed CRNN-based SED method with online noise learning reduces the relative recognition error rate by 56.25% when compared to the conventional CRNN-based method with noise. In the second evaluation, where the tunnel background noise is more severe than in the first evaluation, the proposed CRNN-based SED method yields superior performance when compared to the conventional methods. In particular, it is shown that among all of the compared methods, the proposed method with the online noise learning provides the best recognition rate of 91.07% and reduces the recognition error rates by 47.40% and 28.56% when compared to the Gaussian mixture model (GMM)–hidden Markov model (HMM)-based and conventional CRNN-based SED methods, respectively. The computational complexity measurements also show that the proposed CRNN-based SED method requires a processing time of 599 ms for both the NTF-based source separation with online noise learning and CRNN classification when the tunnel noisy signal is one second long, which implies that the proposed method detects events in real-time.
format	Online Article Text
id	pubmed-6631336
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-66313362019-08-19 Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones Kim, Nam Kyun Jeon, Kwang Myung Kim, Hong Kook Sensors (Basel) Article This paper proposes a sound event detection (SED) method in tunnels to prevent further uncontrollable accidents. Tunnel accidents are accompanied by crashes and tire skids, which usually produce abnormal sounds. Since the tunnel environment always has a severe level of noise, the detection accuracy can be greatly reduced in the existing methods. To deal with the noise issue in the tunnel environment, the proposed method involves the preprocessing of tunnel acoustic signals and a classifier for detecting acoustic events in tunnels. For preprocessing, a non-negative tensor factorization (NTF) technique is used to separate the acoustic event signal from the noisy signal in the tunnel. In particular, the NTF technique developed in this paper consists of source separation and online noise learning. In other words, the noise basis is adapted by an online noise learning technique for enhancement in adverse noise conditions. Next, a convolutional recurrent neural network (CRNN) is extended to accommodate the contributions of the separated event signal and noise to the event detection; thus, the proposed CRNN is composed of event convolution layers and noise convolution layers in parallel followed by recurrent layers and the output layer. Here, a set of mel-filterbank feature parameters is used as the input features. Evaluations of the proposed method are conducted on two datasets: a publicly available road audio events dataset and a tunnel audio dataset recorded in a real traffic tunnel for six months. In the first evaluation where the background noise is low, the proposed CRNN-based SED method with online noise learning reduces the relative recognition error rate by 56.25% when compared to the conventional CRNN-based method with noise. In the second evaluation, where the tunnel background noise is more severe than in the first evaluation, the proposed CRNN-based SED method yields superior performance when compared to the conventional methods. In particular, it is shown that among all of the compared methods, the proposed method with the online noise learning provides the best recognition rate of 91.07% and reduces the recognition error rates by 47.40% and 28.56% when compared to the Gaussian mixture model (GMM)–hidden Markov model (HMM)-based and conventional CRNN-based SED methods, respectively. The computational complexity measurements also show that the proposed CRNN-based SED method requires a processing time of 599 ms for both the NTF-based source separation with online noise learning and CRNN classification when the tunnel noisy signal is one second long, which implies that the proposed method detects events in real-time. MDPI 2019-06-14 /pmc/articles/PMC6631336/ /pubmed/31208007 http://dx.doi.org/10.3390/s19122695 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Kim, Nam Kyun Jeon, Kwang Myung Kim, Hong Kook Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones
title	Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones
title_full	Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones
title_fullStr	Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones
title_full_unstemmed	Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones
title_short	Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones
title_sort	convolutional recurrent neural network-based event detection in tunnels using multiple microphones
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6631336/ https://www.ncbi.nlm.nih.gov/pubmed/31208007 http://dx.doi.org/10.3390/s19122695
work_keys_str_mv	AT kimnamkyun convolutionalrecurrentneuralnetworkbasedeventdetectionintunnelsusingmultiplemicrophones AT jeonkwangmyung convolutionalrecurrentneuralnetworkbasedeventdetectionintunnelsusingmultiplemicrophones AT kimhongkook convolutionalrecurrentneuralnetworkbasedeventdetectionintunnelsusingmultiplemicrophones

Convolutional Recurrent Neural Network-Based Event Detection in Tunnels Using Multiple Microphones

Ejemplares similares