Cargando…
Efficient Noisy Sound-Event Mixture Classification Using Adaptive-Sparse Complex-Valued Matrix Factorization and OvsO SVM
This paper proposes a solution for events classification from a sole noisy mixture that consist of two major steps: a sound-event separation and a sound-event classification. The traditional complex nonnegative matrix factorization (CMF) is extended by cooperation with the optimal adaptive L(1) spar...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472059/ https://www.ncbi.nlm.nih.gov/pubmed/32764362 http://dx.doi.org/10.3390/s20164368 |
_version_ | 1783578901848522752 |
---|---|
author | Parathai, Phetcharat Tengtrairat, Naruephorn Woo, Wai Lok Abdullah, Mohammed A. M. Rafiee, Gholamreza Alshabrawy, Ossama |
author_facet | Parathai, Phetcharat Tengtrairat, Naruephorn Woo, Wai Lok Abdullah, Mohammed A. M. Rafiee, Gholamreza Alshabrawy, Ossama |
author_sort | Parathai, Phetcharat |
collection | PubMed |
description | This paper proposes a solution for events classification from a sole noisy mixture that consist of two major steps: a sound-event separation and a sound-event classification. The traditional complex nonnegative matrix factorization (CMF) is extended by cooperation with the optimal adaptive L(1) sparsity to decompose a noisy single-channel mixture. The proposed adaptive L(1) sparsity CMF algorithm encodes the spectra pattern and estimates the phase of the original signals in time-frequency representation. Their features enhance the temporal decomposition process efficiently. The support vector machine (SVM) based one versus one (OvsO) strategy was applied with a mean supervector to categorize the demixed sound into the matching sound-event class. The first step of the multi-class MSVM method is to segment the separated signal into blocks by sliding demixed signals, then encoding the three features of each block. Mel frequency cepstral coefficients, short-time energy, and short-time zero-crossing rate are learned with multi sound-event classes by the SVM based OvsO method. The mean supervector is encoded from the obtained features. The proposed method has been evaluated with both separation and classification scenarios using real-world single recorded signals and compared with the state-of-the-art separation method. Experimental results confirmed that the proposed method outperformed the state-of-the-art methods. |
format | Online Article Text |
id | pubmed-7472059 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-74720592020-09-04 Efficient Noisy Sound-Event Mixture Classification Using Adaptive-Sparse Complex-Valued Matrix Factorization and OvsO SVM Parathai, Phetcharat Tengtrairat, Naruephorn Woo, Wai Lok Abdullah, Mohammed A. M. Rafiee, Gholamreza Alshabrawy, Ossama Sensors (Basel) Article This paper proposes a solution for events classification from a sole noisy mixture that consist of two major steps: a sound-event separation and a sound-event classification. The traditional complex nonnegative matrix factorization (CMF) is extended by cooperation with the optimal adaptive L(1) sparsity to decompose a noisy single-channel mixture. The proposed adaptive L(1) sparsity CMF algorithm encodes the spectra pattern and estimates the phase of the original signals in time-frequency representation. Their features enhance the temporal decomposition process efficiently. The support vector machine (SVM) based one versus one (OvsO) strategy was applied with a mean supervector to categorize the demixed sound into the matching sound-event class. The first step of the multi-class MSVM method is to segment the separated signal into blocks by sliding demixed signals, then encoding the three features of each block. Mel frequency cepstral coefficients, short-time energy, and short-time zero-crossing rate are learned with multi sound-event classes by the SVM based OvsO method. The mean supervector is encoded from the obtained features. The proposed method has been evaluated with both separation and classification scenarios using real-world single recorded signals and compared with the state-of-the-art separation method. Experimental results confirmed that the proposed method outperformed the state-of-the-art methods. MDPI 2020-08-05 /pmc/articles/PMC7472059/ /pubmed/32764362 http://dx.doi.org/10.3390/s20164368 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Parathai, Phetcharat Tengtrairat, Naruephorn Woo, Wai Lok Abdullah, Mohammed A. M. Rafiee, Gholamreza Alshabrawy, Ossama Efficient Noisy Sound-Event Mixture Classification Using Adaptive-Sparse Complex-Valued Matrix Factorization and OvsO SVM |
title | Efficient Noisy Sound-Event Mixture Classification Using Adaptive-Sparse Complex-Valued Matrix Factorization and OvsO SVM |
title_full | Efficient Noisy Sound-Event Mixture Classification Using Adaptive-Sparse Complex-Valued Matrix Factorization and OvsO SVM |
title_fullStr | Efficient Noisy Sound-Event Mixture Classification Using Adaptive-Sparse Complex-Valued Matrix Factorization and OvsO SVM |
title_full_unstemmed | Efficient Noisy Sound-Event Mixture Classification Using Adaptive-Sparse Complex-Valued Matrix Factorization and OvsO SVM |
title_short | Efficient Noisy Sound-Event Mixture Classification Using Adaptive-Sparse Complex-Valued Matrix Factorization and OvsO SVM |
title_sort | efficient noisy sound-event mixture classification using adaptive-sparse complex-valued matrix factorization and ovso svm |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472059/ https://www.ncbi.nlm.nih.gov/pubmed/32764362 http://dx.doi.org/10.3390/s20164368 |
work_keys_str_mv | AT parathaiphetcharat efficientnoisysoundeventmixtureclassificationusingadaptivesparsecomplexvaluedmatrixfactorizationandovsosvm AT tengtrairatnaruephorn efficientnoisysoundeventmixtureclassificationusingadaptivesparsecomplexvaluedmatrixfactorizationandovsosvm AT woowailok efficientnoisysoundeventmixtureclassificationusingadaptivesparsecomplexvaluedmatrixfactorizationandovsosvm AT abdullahmohammedam efficientnoisysoundeventmixtureclassificationusingadaptivesparsecomplexvaluedmatrixfactorizationandovsosvm AT rafieegholamreza efficientnoisysoundeventmixtureclassificationusingadaptivesparsecomplexvaluedmatrixfactorizationandovsosvm AT alshabrawyossama efficientnoisysoundeventmixtureclassificationusingadaptivesparsecomplexvaluedmatrixfactorizationandovsosvm |