Cargando…
Car engine sounds recognition based on deformable feature map residual network
Aiming at the difficulty in extracting the features of time–frequency images for the recognition of car engine sounds, we propose a method to recognize them based on a deformable feature map residual network. A deformable feature map residual block includes offset and convolutional layers. The offse...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8854583/ https://www.ncbi.nlm.nih.gov/pubmed/35177780 http://dx.doi.org/10.1038/s41598-022-06818-z |
_version_ | 1784653462240231424 |
---|---|
author | Wu, Zhuangwen Wan, Zhiping Ge, Dongdong Pan, Ludan |
author_facet | Wu, Zhuangwen Wan, Zhiping Ge, Dongdong Pan, Ludan |
author_sort | Wu, Zhuangwen |
collection | PubMed |
description | Aiming at the difficulty in extracting the features of time–frequency images for the recognition of car engine sounds, we propose a method to recognize them based on a deformable feature map residual network. A deformable feature map residual block includes offset and convolutional layers. The offset layers shift the pixels of the input feature map. The shifted feature map is superimposed on the feature map extracted by the convolutional layers through shortcut connections to concentrate the network to the sampling in the region of interest, and to transmit the information of the offset feature map to the lower network. Then, a deformable convolution residual network is designed, and the features extracted through this network are fused with the Mel frequency cepstral coefficients of car engine sounds. After recalibration by the squeeze and excitation block, the fused results are fed into the fully connected layer for classification. Experiments on a car engine sound dataset show that the accuracy of the proposed method is 84.28%. Compared with the existing state-of-the-art methods, in terms of the accuracy of recognizing car engine sounds under various operating conditions, the proposed method represents an improvement over the method based on dictionary learning and a convolutional neural network. |
format | Online Article Text |
id | pubmed-8854583 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-88545832022-02-18 Car engine sounds recognition based on deformable feature map residual network Wu, Zhuangwen Wan, Zhiping Ge, Dongdong Pan, Ludan Sci Rep Article Aiming at the difficulty in extracting the features of time–frequency images for the recognition of car engine sounds, we propose a method to recognize them based on a deformable feature map residual network. A deformable feature map residual block includes offset and convolutional layers. The offset layers shift the pixels of the input feature map. The shifted feature map is superimposed on the feature map extracted by the convolutional layers through shortcut connections to concentrate the network to the sampling in the region of interest, and to transmit the information of the offset feature map to the lower network. Then, a deformable convolution residual network is designed, and the features extracted through this network are fused with the Mel frequency cepstral coefficients of car engine sounds. After recalibration by the squeeze and excitation block, the fused results are fed into the fully connected layer for classification. Experiments on a car engine sound dataset show that the accuracy of the proposed method is 84.28%. Compared with the existing state-of-the-art methods, in terms of the accuracy of recognizing car engine sounds under various operating conditions, the proposed method represents an improvement over the method based on dictionary learning and a convolutional neural network. Nature Publishing Group UK 2022-02-17 /pmc/articles/PMC8854583/ /pubmed/35177780 http://dx.doi.org/10.1038/s41598-022-06818-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Wu, Zhuangwen Wan, Zhiping Ge, Dongdong Pan, Ludan Car engine sounds recognition based on deformable feature map residual network |
title | Car engine sounds recognition based on deformable feature map residual network |
title_full | Car engine sounds recognition based on deformable feature map residual network |
title_fullStr | Car engine sounds recognition based on deformable feature map residual network |
title_full_unstemmed | Car engine sounds recognition based on deformable feature map residual network |
title_short | Car engine sounds recognition based on deformable feature map residual network |
title_sort | car engine sounds recognition based on deformable feature map residual network |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8854583/ https://www.ncbi.nlm.nih.gov/pubmed/35177780 http://dx.doi.org/10.1038/s41598-022-06818-z |
work_keys_str_mv | AT wuzhuangwen carenginesoundsrecognitionbasedondeformablefeaturemapresidualnetwork AT wanzhiping carenginesoundsrecognitionbasedondeformablefeaturemapresidualnetwork AT gedongdong carenginesoundsrecognitionbasedondeformablefeaturemapresidualnetwork AT panludan carenginesoundsrecognitionbasedondeformablefeaturemapresidualnetwork |