Cargando…

Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion

The analysis of ambient sounds can be very useful when developing sound base intelligent systems. Acoustic scene classification (ASC) is defined as identifying the area of a recorded sound or clip among some predefined scenes. ASC has huge potential to be used in urban sound event classification sys...

Descripción completa

Detalles Bibliográficos
Autores principales: Hajihashemi, Vahid, Gharahbagh, Abdorreza Alavi, Cruz, Pedro Miguel, Ferreira, Marta Campos, Machado, José J. M., Tavares, João Manuel R. S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8878239/
https://www.ncbi.nlm.nih.gov/pubmed/35214436
http://dx.doi.org/10.3390/s22041535
_version_ 1784658614691037184
author Hajihashemi, Vahid
Gharahbagh, Abdorreza Alavi
Cruz, Pedro Miguel
Ferreira, Marta Campos
Machado, José J. M.
Tavares, João Manuel R. S.
author_facet Hajihashemi, Vahid
Gharahbagh, Abdorreza Alavi
Cruz, Pedro Miguel
Ferreira, Marta Campos
Machado, José J. M.
Tavares, João Manuel R. S.
author_sort Hajihashemi, Vahid
collection PubMed
description The analysis of ambient sounds can be very useful when developing sound base intelligent systems. Acoustic scene classification (ASC) is defined as identifying the area of a recorded sound or clip among some predefined scenes. ASC has huge potential to be used in urban sound event classification systems. This research presents a hybrid method that includes a novel mathematical fusion step which aims to tackle the challenges of ASC accuracy and adaptability of current state-of-the-art models. The proposed method uses a stereo signal, two ensemble classifiers (random subspace), and a novel mathematical fusion step. In the proposed method, a stable, invariant signal representation of the stereo signal is built using Wavelet Scattering Transform (WST). For each mono, i.e., left and right, channel, a different random subspace classifier is trained using WST. A novel mathematical formula for fusion step was developed, its parameters being found using a Genetic algorithm. The results on the DCASE 2017 dataset showed that the proposed method has higher classification accuracy (about 95%), pushing the boundaries of existing methods.
format Online
Article
Text
id pubmed-8878239
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88782392022-02-26 Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion Hajihashemi, Vahid Gharahbagh, Abdorreza Alavi Cruz, Pedro Miguel Ferreira, Marta Campos Machado, José J. M. Tavares, João Manuel R. S. Sensors (Basel) Article The analysis of ambient sounds can be very useful when developing sound base intelligent systems. Acoustic scene classification (ASC) is defined as identifying the area of a recorded sound or clip among some predefined scenes. ASC has huge potential to be used in urban sound event classification systems. This research presents a hybrid method that includes a novel mathematical fusion step which aims to tackle the challenges of ASC accuracy and adaptability of current state-of-the-art models. The proposed method uses a stereo signal, two ensemble classifiers (random subspace), and a novel mathematical fusion step. In the proposed method, a stable, invariant signal representation of the stereo signal is built using Wavelet Scattering Transform (WST). For each mono, i.e., left and right, channel, a different random subspace classifier is trained using WST. A novel mathematical formula for fusion step was developed, its parameters being found using a Genetic algorithm. The results on the DCASE 2017 dataset showed that the proposed method has higher classification accuracy (about 95%), pushing the boundaries of existing methods. MDPI 2022-02-16 /pmc/articles/PMC8878239/ /pubmed/35214436 http://dx.doi.org/10.3390/s22041535 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hajihashemi, Vahid
Gharahbagh, Abdorreza Alavi
Cruz, Pedro Miguel
Ferreira, Marta Campos
Machado, José J. M.
Tavares, João Manuel R. S.
Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title_full Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title_fullStr Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title_full_unstemmed Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title_short Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
title_sort binaural acoustic scene classification using wavelet scattering, parallel ensemble classifiers and nonlinear fusion
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8878239/
https://www.ncbi.nlm.nih.gov/pubmed/35214436
http://dx.doi.org/10.3390/s22041535
work_keys_str_mv AT hajihashemivahid binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion
AT gharahbaghabdorrezaalavi binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion
AT cruzpedromiguel binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion
AT ferreiramartacampos binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion
AT machadojosejm binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion
AT tavaresjoaomanuelrs binauralacousticsceneclassificationusingwaveletscatteringparallelensembleclassifiersandnonlinearfusion