Cargando…

An Automatic Classification System for Environmental Sound in Smart Cities

With the continuous promotion of “smart cities” worldwide, the approach to be used in combining smart cities with modern advanced technologies (Internet of Things, cloud computing, artificial intelligence) has become a hot topic. However, due to the non-stationary nature of environmental sound and t...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Dongping, Zhong, Ziyin, Xia, Yuejian, Wang, Zhutao, Xiong, Wenbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422509/
https://www.ncbi.nlm.nih.gov/pubmed/37571606
http://dx.doi.org/10.3390/s23156823
_version_ 1785089228017762304
author Zhang, Dongping
Zhong, Ziyin
Xia, Yuejian
Wang, Zhutao
Xiong, Wenbo
author_facet Zhang, Dongping
Zhong, Ziyin
Xia, Yuejian
Wang, Zhutao
Xiong, Wenbo
author_sort Zhang, Dongping
collection PubMed
description With the continuous promotion of “smart cities” worldwide, the approach to be used in combining smart cities with modern advanced technologies (Internet of Things, cloud computing, artificial intelligence) has become a hot topic. However, due to the non-stationary nature of environmental sound and the interference of urban noise, it is challenging to fully extract features from the model with a single input and achieve ideal classification results, even with deep learning methods. To improve the recognition accuracy of ESC (environmental sound classification), we propose a dual-branch residual network (dual-resnet) based on feature fusion. Furthermore, in terms of data pre-processing, a loop-padding method is proposed to patch shorter data, enabling it to obtain more useful information. At the same time, in order to prevent the occurrence of overfitting, we use the time-frequency data enhancement method to expand the dataset. After uniform pre-processing of all the original audio, the dual-branch residual network automatically extracts the frequency domain features of the log-Mel spectrogram and log-spectrogram. Then, the two different audio features are fused to make the representation of the audio features more comprehensive. The experimental results show that compared with other models, the classification accuracy of the UrbanSound8k dataset has been improved to different degrees.
format Online
Article
Text
id pubmed-10422509
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104225092023-08-13 An Automatic Classification System for Environmental Sound in Smart Cities Zhang, Dongping Zhong, Ziyin Xia, Yuejian Wang, Zhutao Xiong, Wenbo Sensors (Basel) Article With the continuous promotion of “smart cities” worldwide, the approach to be used in combining smart cities with modern advanced technologies (Internet of Things, cloud computing, artificial intelligence) has become a hot topic. However, due to the non-stationary nature of environmental sound and the interference of urban noise, it is challenging to fully extract features from the model with a single input and achieve ideal classification results, even with deep learning methods. To improve the recognition accuracy of ESC (environmental sound classification), we propose a dual-branch residual network (dual-resnet) based on feature fusion. Furthermore, in terms of data pre-processing, a loop-padding method is proposed to patch shorter data, enabling it to obtain more useful information. At the same time, in order to prevent the occurrence of overfitting, we use the time-frequency data enhancement method to expand the dataset. After uniform pre-processing of all the original audio, the dual-branch residual network automatically extracts the frequency domain features of the log-Mel spectrogram and log-spectrogram. Then, the two different audio features are fused to make the representation of the audio features more comprehensive. The experimental results show that compared with other models, the classification accuracy of the UrbanSound8k dataset has been improved to different degrees. MDPI 2023-07-31 /pmc/articles/PMC10422509/ /pubmed/37571606 http://dx.doi.org/10.3390/s23156823 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Dongping
Zhong, Ziyin
Xia, Yuejian
Wang, Zhutao
Xiong, Wenbo
An Automatic Classification System for Environmental Sound in Smart Cities
title An Automatic Classification System for Environmental Sound in Smart Cities
title_full An Automatic Classification System for Environmental Sound in Smart Cities
title_fullStr An Automatic Classification System for Environmental Sound in Smart Cities
title_full_unstemmed An Automatic Classification System for Environmental Sound in Smart Cities
title_short An Automatic Classification System for Environmental Sound in Smart Cities
title_sort automatic classification system for environmental sound in smart cities
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422509/
https://www.ncbi.nlm.nih.gov/pubmed/37571606
http://dx.doi.org/10.3390/s23156823
work_keys_str_mv AT zhangdongping anautomaticclassificationsystemforenvironmentalsoundinsmartcities
AT zhongziyin anautomaticclassificationsystemforenvironmentalsoundinsmartcities
AT xiayuejian anautomaticclassificationsystemforenvironmentalsoundinsmartcities
AT wangzhutao anautomaticclassificationsystemforenvironmentalsoundinsmartcities
AT xiongwenbo anautomaticclassificationsystemforenvironmentalsoundinsmartcities
AT zhangdongping automaticclassificationsystemforenvironmentalsoundinsmartcities
AT zhongziyin automaticclassificationsystemforenvironmentalsoundinsmartcities
AT xiayuejian automaticclassificationsystemforenvironmentalsoundinsmartcities
AT wangzhutao automaticclassificationsystemforenvironmentalsoundinsmartcities
AT xiongwenbo automaticclassificationsystemforenvironmentalsoundinsmartcities