Cargando…

Classification of Fricative Consonants for Speech Enhancement in Hearing Devices

OBJECTIVE: To investigate a set of acoustic features and classification methods for the classification of three groups of fricative consonants differing in place of articulation. METHOD: A support vector machine (SVM) algorithm was used to classify the fricatives extracted from the TIMIT database in...

Descripción completa

Detalles Bibliográficos
Autores principales: Kong, Ying-Yee, Mullangi, Ala, Kokkinakis, Kostas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3991644/
https://www.ncbi.nlm.nih.gov/pubmed/24747721
http://dx.doi.org/10.1371/journal.pone.0095001
_version_ 1782312475577810944
author Kong, Ying-Yee
Mullangi, Ala
Kokkinakis, Kostas
author_facet Kong, Ying-Yee
Mullangi, Ala
Kokkinakis, Kostas
author_sort Kong, Ying-Yee
collection PubMed
description OBJECTIVE: To investigate a set of acoustic features and classification methods for the classification of three groups of fricative consonants differing in place of articulation. METHOD: A support vector machine (SVM) algorithm was used to classify the fricatives extracted from the TIMIT database in quiet and also in speech babble noise at various signal-to-noise ratios (SNRs). Spectral features including four spectral moments, peak, slope, Mel-frequency cepstral coefficients (MFCC), Gammatone filters outputs, and magnitudes of fast Fourier Transform (FFT) spectrum were used for the classification. The analysis frame was restricted to only 8 msec. In addition, commonly-used linear and nonlinear principal component analysis dimensionality reduction techniques that project a high-dimensional feature vector onto a lower dimensional space were examined. RESULTS: With 13 MFCC coefficients, 14 or 24 Gammatone filter outputs, classification performance was greater than or equal to 85% in quiet and at +10 dB SNR. Using 14 Gammatone filter outputs above 1 kHz, classification accuracy remained high (greater than 80%) for a wide range of SNRs from +20 to +5 dB SNR. CONCLUSIONS: High levels of classification accuracy for fricative consonants in quiet and in noise could be achieved using only spectral features extracted from a short time window. Results of this work have a direct impact on the development of speech enhancement algorithms for hearing devices.
format Online
Article
Text
id pubmed-3991644
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39916442014-04-21 Classification of Fricative Consonants for Speech Enhancement in Hearing Devices Kong, Ying-Yee Mullangi, Ala Kokkinakis, Kostas PLoS One Research Article OBJECTIVE: To investigate a set of acoustic features and classification methods for the classification of three groups of fricative consonants differing in place of articulation. METHOD: A support vector machine (SVM) algorithm was used to classify the fricatives extracted from the TIMIT database in quiet and also in speech babble noise at various signal-to-noise ratios (SNRs). Spectral features including four spectral moments, peak, slope, Mel-frequency cepstral coefficients (MFCC), Gammatone filters outputs, and magnitudes of fast Fourier Transform (FFT) spectrum were used for the classification. The analysis frame was restricted to only 8 msec. In addition, commonly-used linear and nonlinear principal component analysis dimensionality reduction techniques that project a high-dimensional feature vector onto a lower dimensional space were examined. RESULTS: With 13 MFCC coefficients, 14 or 24 Gammatone filter outputs, classification performance was greater than or equal to 85% in quiet and at +10 dB SNR. Using 14 Gammatone filter outputs above 1 kHz, classification accuracy remained high (greater than 80%) for a wide range of SNRs from +20 to +5 dB SNR. CONCLUSIONS: High levels of classification accuracy for fricative consonants in quiet and in noise could be achieved using only spectral features extracted from a short time window. Results of this work have a direct impact on the development of speech enhancement algorithms for hearing devices. Public Library of Science 2014-04-18 /pmc/articles/PMC3991644/ /pubmed/24747721 http://dx.doi.org/10.1371/journal.pone.0095001 Text en © 2014 Kong et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kong, Ying-Yee
Mullangi, Ala
Kokkinakis, Kostas
Classification of Fricative Consonants for Speech Enhancement in Hearing Devices
title Classification of Fricative Consonants for Speech Enhancement in Hearing Devices
title_full Classification of Fricative Consonants for Speech Enhancement in Hearing Devices
title_fullStr Classification of Fricative Consonants for Speech Enhancement in Hearing Devices
title_full_unstemmed Classification of Fricative Consonants for Speech Enhancement in Hearing Devices
title_short Classification of Fricative Consonants for Speech Enhancement in Hearing Devices
title_sort classification of fricative consonants for speech enhancement in hearing devices
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3991644/
https://www.ncbi.nlm.nih.gov/pubmed/24747721
http://dx.doi.org/10.1371/journal.pone.0095001
work_keys_str_mv AT kongyingyee classificationoffricativeconsonantsforspeechenhancementinhearingdevices
AT mullangiala classificationoffricativeconsonantsforspeechenhancementinhearingdevices
AT kokkinakiskostas classificationoffricativeconsonantsforspeechenhancementinhearingdevices