Cargando…
BSN-ESC: A Big–Small Network-Based Environmental Sound Classification Method for AIoT Applications
In recent years, environmental sound classification (ESC) has prevailed in many artificial intelligence Internet of Things (AIoT) applications, as environmental sound contains a wealth of information that can be used to detect particular events. However, existing ESC methods have high computational...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422364/ https://www.ncbi.nlm.nih.gov/pubmed/37571550 http://dx.doi.org/10.3390/s23156767 |
_version_ | 1785089191307116544 |
---|---|
author | Peng, Lujie Yang, Junyu Yan, Longke Chen, Zhiyi Xiao, Jianbiao Zhou, Liang Zhou, Jun |
author_facet | Peng, Lujie Yang, Junyu Yan, Longke Chen, Zhiyi Xiao, Jianbiao Zhou, Liang Zhou, Jun |
author_sort | Peng, Lujie |
collection | PubMed |
description | In recent years, environmental sound classification (ESC) has prevailed in many artificial intelligence Internet of Things (AIoT) applications, as environmental sound contains a wealth of information that can be used to detect particular events. However, existing ESC methods have high computational complexity and are not suitable for deployment on AIoT devices with constrained computing resources. Therefore, it is of great importance to propose a model with both high classification accuracy and low computational complexity. In this work, a new ESC method named BSN-ESC is proposed, including a big–small network-based ESC model that can assess the classification difficulty level and adaptively activate a big or small network for classification as well as a pre-classification processing technique with logmel spectrogram refining, which prevents distortion in the frequency-domain characteristics of the sound clip at the joint part of two adjacent sound clips. With the proposed methods, the computational complexity is significantly reduced, while the classification accuracy is still high. The proposed BSN-ESC model is implemented on both CPU and FPGA to evaluate its performance on both PC and embedded systems with the dataset ESC-50, which is the most commonly used dataset. The proposed BSN-ESC model achieves the lowest computational complexity with the number of floating-point operations (FLOPs) of only 0.123G, which represents a reduction of up to 2309 times in computational complexity compared with state-of-the-art methods while delivering a high classification accuracy of 89.25%. This work can achieve the realization of ESC being applied to AIoT devices with constrained computational resources. |
format | Online Article Text |
id | pubmed-10422364 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-104223642023-08-13 BSN-ESC: A Big–Small Network-Based Environmental Sound Classification Method for AIoT Applications Peng, Lujie Yang, Junyu Yan, Longke Chen, Zhiyi Xiao, Jianbiao Zhou, Liang Zhou, Jun Sensors (Basel) Communication In recent years, environmental sound classification (ESC) has prevailed in many artificial intelligence Internet of Things (AIoT) applications, as environmental sound contains a wealth of information that can be used to detect particular events. However, existing ESC methods have high computational complexity and are not suitable for deployment on AIoT devices with constrained computing resources. Therefore, it is of great importance to propose a model with both high classification accuracy and low computational complexity. In this work, a new ESC method named BSN-ESC is proposed, including a big–small network-based ESC model that can assess the classification difficulty level and adaptively activate a big or small network for classification as well as a pre-classification processing technique with logmel spectrogram refining, which prevents distortion in the frequency-domain characteristics of the sound clip at the joint part of two adjacent sound clips. With the proposed methods, the computational complexity is significantly reduced, while the classification accuracy is still high. The proposed BSN-ESC model is implemented on both CPU and FPGA to evaluate its performance on both PC and embedded systems with the dataset ESC-50, which is the most commonly used dataset. The proposed BSN-ESC model achieves the lowest computational complexity with the number of floating-point operations (FLOPs) of only 0.123G, which represents a reduction of up to 2309 times in computational complexity compared with state-of-the-art methods while delivering a high classification accuracy of 89.25%. This work can achieve the realization of ESC being applied to AIoT devices with constrained computational resources. MDPI 2023-07-28 /pmc/articles/PMC10422364/ /pubmed/37571550 http://dx.doi.org/10.3390/s23156767 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Communication Peng, Lujie Yang, Junyu Yan, Longke Chen, Zhiyi Xiao, Jianbiao Zhou, Liang Zhou, Jun BSN-ESC: A Big–Small Network-Based Environmental Sound Classification Method for AIoT Applications |
title | BSN-ESC: A Big–Small Network-Based Environmental Sound Classification Method for AIoT Applications |
title_full | BSN-ESC: A Big–Small Network-Based Environmental Sound Classification Method for AIoT Applications |
title_fullStr | BSN-ESC: A Big–Small Network-Based Environmental Sound Classification Method for AIoT Applications |
title_full_unstemmed | BSN-ESC: A Big–Small Network-Based Environmental Sound Classification Method for AIoT Applications |
title_short | BSN-ESC: A Big–Small Network-Based Environmental Sound Classification Method for AIoT Applications |
title_sort | bsn-esc: a big–small network-based environmental sound classification method for aiot applications |
topic | Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422364/ https://www.ncbi.nlm.nih.gov/pubmed/37571550 http://dx.doi.org/10.3390/s23156767 |
work_keys_str_mv | AT penglujie bsnescabigsmallnetworkbasedenvironmentalsoundclassificationmethodforaiotapplications AT yangjunyu bsnescabigsmallnetworkbasedenvironmentalsoundclassificationmethodforaiotapplications AT yanlongke bsnescabigsmallnetworkbasedenvironmentalsoundclassificationmethodforaiotapplications AT chenzhiyi bsnescabigsmallnetworkbasedenvironmentalsoundclassificationmethodforaiotapplications AT xiaojianbiao bsnescabigsmallnetworkbasedenvironmentalsoundclassificationmethodforaiotapplications AT zhouliang bsnescabigsmallnetworkbasedenvironmentalsoundclassificationmethodforaiotapplications AT zhoujun bsnescabigsmallnetworkbasedenvironmentalsoundclassificationmethodforaiotapplications |