Cargando…

Empirical Mode Decomposition-Based Feature Extraction for Environmental Sound Classification

In environment sound classification, log Mel band energies (MBEs) are considered as the most successful and commonly used features for classification. The underlying algorithm, fast Fourier transform (FFT), is valid under certain restrictions. In this study, we address these limitations of Fourier t...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmed, Ammar, Serrestou, Youssef, Raoof, Kosai, Diouris, Jean-François
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9612378/
https://www.ncbi.nlm.nih.gov/pubmed/36298067
http://dx.doi.org/10.3390/s22207717
_version_ 1784819760684335104
author Ahmed, Ammar
Serrestou, Youssef
Raoof, Kosai
Diouris, Jean-François
author_facet Ahmed, Ammar
Serrestou, Youssef
Raoof, Kosai
Diouris, Jean-François
author_sort Ahmed, Ammar
collection PubMed
description In environment sound classification, log Mel band energies (MBEs) are considered as the most successful and commonly used features for classification. The underlying algorithm, fast Fourier transform (FFT), is valid under certain restrictions. In this study, we address these limitations of Fourier transform and propose a new method to extract log Mel band energies using amplitude modulation and frequency modulation. We present a comparative study between traditionally used log Mel band energy features extracted by Fourier transform and log Mel band energy features extracted by our new approach. This approach is based on extracting log Mel band energies from estimation of instantaneous frequency (IF) and instantaneous amplitude (IA), which are used to construct a spectrogram. The estimation of IA and IF is made by associating empirical mode decomposition (EMD) with the Teager–Kaiser energy operator (TKEO) and the discrete energy separation algorithm. Later, Mel filter bank is applied to the estimated spectrogram to generate EMD-TKEO-based MBEs, or simply, EMD-MBEs. In addition, we employ the EMD method to remove signal trends from the original signal and generate another type of MBE, called S-MBEs, using FFT and a Mel filter bank. Four different datasets were utilised and convolutional neural networks (CNN) were trained using features extracted from Fourier transform-based MBEs (FFT-MBEs), EMD-MBEs, and S-MBEs. In addition, CNNs were trained with an aggregation of all three feature extraction techniques and a combination of FFT-MBEs and EMD-MBEs. Individually, FFT-MBEs achieved higher accuracy compared to EMD-MBEs and S-MBEs. In general, the system trained with the combination of all three features performed slightly better compared to the system trained with the three features separately.
format Online
Article
Text
id pubmed-9612378
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96123782022-10-28 Empirical Mode Decomposition-Based Feature Extraction for Environmental Sound Classification Ahmed, Ammar Serrestou, Youssef Raoof, Kosai Diouris, Jean-François Sensors (Basel) Article In environment sound classification, log Mel band energies (MBEs) are considered as the most successful and commonly used features for classification. The underlying algorithm, fast Fourier transform (FFT), is valid under certain restrictions. In this study, we address these limitations of Fourier transform and propose a new method to extract log Mel band energies using amplitude modulation and frequency modulation. We present a comparative study between traditionally used log Mel band energy features extracted by Fourier transform and log Mel band energy features extracted by our new approach. This approach is based on extracting log Mel band energies from estimation of instantaneous frequency (IF) and instantaneous amplitude (IA), which are used to construct a spectrogram. The estimation of IA and IF is made by associating empirical mode decomposition (EMD) with the Teager–Kaiser energy operator (TKEO) and the discrete energy separation algorithm. Later, Mel filter bank is applied to the estimated spectrogram to generate EMD-TKEO-based MBEs, or simply, EMD-MBEs. In addition, we employ the EMD method to remove signal trends from the original signal and generate another type of MBE, called S-MBEs, using FFT and a Mel filter bank. Four different datasets were utilised and convolutional neural networks (CNN) were trained using features extracted from Fourier transform-based MBEs (FFT-MBEs), EMD-MBEs, and S-MBEs. In addition, CNNs were trained with an aggregation of all three feature extraction techniques and a combination of FFT-MBEs and EMD-MBEs. Individually, FFT-MBEs achieved higher accuracy compared to EMD-MBEs and S-MBEs. In general, the system trained with the combination of all three features performed slightly better compared to the system trained with the three features separately. MDPI 2022-10-11 /pmc/articles/PMC9612378/ /pubmed/36298067 http://dx.doi.org/10.3390/s22207717 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ahmed, Ammar
Serrestou, Youssef
Raoof, Kosai
Diouris, Jean-François
Empirical Mode Decomposition-Based Feature Extraction for Environmental Sound Classification
title Empirical Mode Decomposition-Based Feature Extraction for Environmental Sound Classification
title_full Empirical Mode Decomposition-Based Feature Extraction for Environmental Sound Classification
title_fullStr Empirical Mode Decomposition-Based Feature Extraction for Environmental Sound Classification
title_full_unstemmed Empirical Mode Decomposition-Based Feature Extraction for Environmental Sound Classification
title_short Empirical Mode Decomposition-Based Feature Extraction for Environmental Sound Classification
title_sort empirical mode decomposition-based feature extraction for environmental sound classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9612378/
https://www.ncbi.nlm.nih.gov/pubmed/36298067
http://dx.doi.org/10.3390/s22207717
work_keys_str_mv AT ahmedammar empiricalmodedecompositionbasedfeatureextractionforenvironmentalsoundclassification
AT serrestouyoussef empiricalmodedecompositionbasedfeatureextractionforenvironmentalsoundclassification
AT raoofkosai empiricalmodedecompositionbasedfeatureextractionforenvironmentalsoundclassification
AT diourisjeanfrancois empiricalmodedecompositionbasedfeatureextractionforenvironmentalsoundclassification