Cargando…
Classification of Fricative Consonants for Speech Enhancement in Hearing Devices
OBJECTIVE: To investigate a set of acoustic features and classification methods for the classification of three groups of fricative consonants differing in place of articulation. METHOD: A support vector machine (SVM) algorithm was used to classify the fricatives extracted from the TIMIT database in...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3991644/ https://www.ncbi.nlm.nih.gov/pubmed/24747721 http://dx.doi.org/10.1371/journal.pone.0095001 |
_version_ | 1782312475577810944 |
---|---|
author | Kong, Ying-Yee Mullangi, Ala Kokkinakis, Kostas |
author_facet | Kong, Ying-Yee Mullangi, Ala Kokkinakis, Kostas |
author_sort | Kong, Ying-Yee |
collection | PubMed |
description | OBJECTIVE: To investigate a set of acoustic features and classification methods for the classification of three groups of fricative consonants differing in place of articulation. METHOD: A support vector machine (SVM) algorithm was used to classify the fricatives extracted from the TIMIT database in quiet and also in speech babble noise at various signal-to-noise ratios (SNRs). Spectral features including four spectral moments, peak, slope, Mel-frequency cepstral coefficients (MFCC), Gammatone filters outputs, and magnitudes of fast Fourier Transform (FFT) spectrum were used for the classification. The analysis frame was restricted to only 8 msec. In addition, commonly-used linear and nonlinear principal component analysis dimensionality reduction techniques that project a high-dimensional feature vector onto a lower dimensional space were examined. RESULTS: With 13 MFCC coefficients, 14 or 24 Gammatone filter outputs, classification performance was greater than or equal to 85% in quiet and at +10 dB SNR. Using 14 Gammatone filter outputs above 1 kHz, classification accuracy remained high (greater than 80%) for a wide range of SNRs from +20 to +5 dB SNR. CONCLUSIONS: High levels of classification accuracy for fricative consonants in quiet and in noise could be achieved using only spectral features extracted from a short time window. Results of this work have a direct impact on the development of speech enhancement algorithms for hearing devices. |
format | Online Article Text |
id | pubmed-3991644 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-39916442014-04-21 Classification of Fricative Consonants for Speech Enhancement in Hearing Devices Kong, Ying-Yee Mullangi, Ala Kokkinakis, Kostas PLoS One Research Article OBJECTIVE: To investigate a set of acoustic features and classification methods for the classification of three groups of fricative consonants differing in place of articulation. METHOD: A support vector machine (SVM) algorithm was used to classify the fricatives extracted from the TIMIT database in quiet and also in speech babble noise at various signal-to-noise ratios (SNRs). Spectral features including four spectral moments, peak, slope, Mel-frequency cepstral coefficients (MFCC), Gammatone filters outputs, and magnitudes of fast Fourier Transform (FFT) spectrum were used for the classification. The analysis frame was restricted to only 8 msec. In addition, commonly-used linear and nonlinear principal component analysis dimensionality reduction techniques that project a high-dimensional feature vector onto a lower dimensional space were examined. RESULTS: With 13 MFCC coefficients, 14 or 24 Gammatone filter outputs, classification performance was greater than or equal to 85% in quiet and at +10 dB SNR. Using 14 Gammatone filter outputs above 1 kHz, classification accuracy remained high (greater than 80%) for a wide range of SNRs from +20 to +5 dB SNR. CONCLUSIONS: High levels of classification accuracy for fricative consonants in quiet and in noise could be achieved using only spectral features extracted from a short time window. Results of this work have a direct impact on the development of speech enhancement algorithms for hearing devices. Public Library of Science 2014-04-18 /pmc/articles/PMC3991644/ /pubmed/24747721 http://dx.doi.org/10.1371/journal.pone.0095001 Text en © 2014 Kong et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Kong, Ying-Yee Mullangi, Ala Kokkinakis, Kostas Classification of Fricative Consonants for Speech Enhancement in Hearing Devices |
title | Classification of Fricative Consonants for Speech Enhancement in Hearing Devices |
title_full | Classification of Fricative Consonants for Speech Enhancement in Hearing Devices |
title_fullStr | Classification of Fricative Consonants for Speech Enhancement in Hearing Devices |
title_full_unstemmed | Classification of Fricative Consonants for Speech Enhancement in Hearing Devices |
title_short | Classification of Fricative Consonants for Speech Enhancement in Hearing Devices |
title_sort | classification of fricative consonants for speech enhancement in hearing devices |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3991644/ https://www.ncbi.nlm.nih.gov/pubmed/24747721 http://dx.doi.org/10.1371/journal.pone.0095001 |
work_keys_str_mv | AT kongyingyee classificationoffricativeconsonantsforspeechenhancementinhearingdevices AT mullangiala classificationoffricativeconsonantsforspeechenhancementinhearingdevices AT kokkinakiskostas classificationoffricativeconsonantsforspeechenhancementinhearingdevices |