Cargando…

Car engine sounds recognition based on deformable feature map residual network

Aiming at the difficulty in extracting the features of time–frequency images for the recognition of car engine sounds, we propose a method to recognize them based on a deformable feature map residual network. A deformable feature map residual block includes offset and convolutional layers. The offse...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Zhuangwen, Wan, Zhiping, Ge, Dongdong, Pan, Ludan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8854583/
https://www.ncbi.nlm.nih.gov/pubmed/35177780
http://dx.doi.org/10.1038/s41598-022-06818-z
_version_ 1784653462240231424
author Wu, Zhuangwen
Wan, Zhiping
Ge, Dongdong
Pan, Ludan
author_facet Wu, Zhuangwen
Wan, Zhiping
Ge, Dongdong
Pan, Ludan
author_sort Wu, Zhuangwen
collection PubMed
description Aiming at the difficulty in extracting the features of time–frequency images for the recognition of car engine sounds, we propose a method to recognize them based on a deformable feature map residual network. A deformable feature map residual block includes offset and convolutional layers. The offset layers shift the pixels of the input feature map. The shifted feature map is superimposed on the feature map extracted by the convolutional layers through shortcut connections to concentrate the network to the sampling in the region of interest, and to transmit the information of the offset feature map to the lower network. Then, a deformable convolution residual network is designed, and the features extracted through this network are fused with the Mel frequency cepstral coefficients of car engine sounds. After recalibration by the squeeze and excitation block, the fused results are fed into the fully connected layer for classification. Experiments on a car engine sound dataset show that the accuracy of the proposed method is 84.28%. Compared with the existing state-of-the-art methods, in terms of the accuracy of recognizing car engine sounds under various operating conditions, the proposed method represents an improvement over the method based on dictionary learning and a convolutional neural network.
format Online
Article
Text
id pubmed-8854583
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-88545832022-02-18 Car engine sounds recognition based on deformable feature map residual network Wu, Zhuangwen Wan, Zhiping Ge, Dongdong Pan, Ludan Sci Rep Article Aiming at the difficulty in extracting the features of time–frequency images for the recognition of car engine sounds, we propose a method to recognize them based on a deformable feature map residual network. A deformable feature map residual block includes offset and convolutional layers. The offset layers shift the pixels of the input feature map. The shifted feature map is superimposed on the feature map extracted by the convolutional layers through shortcut connections to concentrate the network to the sampling in the region of interest, and to transmit the information of the offset feature map to the lower network. Then, a deformable convolution residual network is designed, and the features extracted through this network are fused with the Mel frequency cepstral coefficients of car engine sounds. After recalibration by the squeeze and excitation block, the fused results are fed into the fully connected layer for classification. Experiments on a car engine sound dataset show that the accuracy of the proposed method is 84.28%. Compared with the existing state-of-the-art methods, in terms of the accuracy of recognizing car engine sounds under various operating conditions, the proposed method represents an improvement over the method based on dictionary learning and a convolutional neural network. Nature Publishing Group UK 2022-02-17 /pmc/articles/PMC8854583/ /pubmed/35177780 http://dx.doi.org/10.1038/s41598-022-06818-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Wu, Zhuangwen
Wan, Zhiping
Ge, Dongdong
Pan, Ludan
Car engine sounds recognition based on deformable feature map residual network
title Car engine sounds recognition based on deformable feature map residual network
title_full Car engine sounds recognition based on deformable feature map residual network
title_fullStr Car engine sounds recognition based on deformable feature map residual network
title_full_unstemmed Car engine sounds recognition based on deformable feature map residual network
title_short Car engine sounds recognition based on deformable feature map residual network
title_sort car engine sounds recognition based on deformable feature map residual network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8854583/
https://www.ncbi.nlm.nih.gov/pubmed/35177780
http://dx.doi.org/10.1038/s41598-022-06818-z
work_keys_str_mv AT wuzhuangwen carenginesoundsrecognitionbasedondeformablefeaturemapresidualnetwork
AT wanzhiping carenginesoundsrecognitionbasedondeformablefeaturemapresidualnetwork
AT gedongdong carenginesoundsrecognitionbasedondeformablefeaturemapresidualnetwork
AT panludan carenginesoundsrecognitionbasedondeformablefeaturemapresidualnetwork