Cargando…

Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features

The Mel Frequency Cepstral Coefficients (MFCCs) are widely used in order to extract essential information from a voice signal and became a popular feature extractor used in audio processing. However, MFCC features are usually calculated from a single window (taper) characterized by large variance. T...

Descripción completa

Detalles Bibliográficos
Autores principales: Eskidere, Ömer, Gürhanlı, Ahmet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4670637/
https://www.ncbi.nlm.nih.gov/pubmed/26681977
http://dx.doi.org/10.1155/2015/956249
_version_ 1782404281302777856
author Eskidere, Ömer
Gürhanlı, Ahmet
author_facet Eskidere, Ömer
Gürhanlı, Ahmet
author_sort Eskidere, Ömer
collection PubMed
description The Mel Frequency Cepstral Coefficients (MFCCs) are widely used in order to extract essential information from a voice signal and became a popular feature extractor used in audio processing. However, MFCC features are usually calculated from a single window (taper) characterized by large variance. This study shows investigations on reducing variance for the classification of two different voice qualities (normal voice and disordered voice) using multitaper MFCC features. We also compare their performance by newly proposed windowing techniques and conventional single-taper technique. The results demonstrate that adapted weighted Thomson multitaper method could distinguish between normal voice and disordered voice better than the results done by the conventional single-taper (Hamming window) technique and two newly proposed windowing methods. The multitaper MFCC features may be helpful in identifying voices at risk for a real pathology that has to be proven later.
format Online
Article
Text
id pubmed-4670637
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-46706372015-12-17 Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features Eskidere, Ömer Gürhanlı, Ahmet Comput Math Methods Med Research Article The Mel Frequency Cepstral Coefficients (MFCCs) are widely used in order to extract essential information from a voice signal and became a popular feature extractor used in audio processing. However, MFCC features are usually calculated from a single window (taper) characterized by large variance. This study shows investigations on reducing variance for the classification of two different voice qualities (normal voice and disordered voice) using multitaper MFCC features. We also compare their performance by newly proposed windowing techniques and conventional single-taper technique. The results demonstrate that adapted weighted Thomson multitaper method could distinguish between normal voice and disordered voice better than the results done by the conventional single-taper (Hamming window) technique and two newly proposed windowing methods. The multitaper MFCC features may be helpful in identifying voices at risk for a real pathology that has to be proven later. Hindawi Publishing Corporation 2015 2015-11-22 /pmc/articles/PMC4670637/ /pubmed/26681977 http://dx.doi.org/10.1155/2015/956249 Text en Copyright © 2015 Ö. Eskidere and A. Gürhanlı. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Eskidere, Ömer
Gürhanlı, Ahmet
Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
title Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
title_full Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
title_fullStr Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
title_full_unstemmed Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
title_short Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features
title_sort voice disorder classification based on multitaper mel frequency cepstral coefficients features
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4670637/
https://www.ncbi.nlm.nih.gov/pubmed/26681977
http://dx.doi.org/10.1155/2015/956249
work_keys_str_mv AT eskidereomer voicedisorderclassificationbasedonmultitapermelfrequencycepstralcoefficientsfeatures
AT gurhanlıahmet voicedisorderclassificationbasedonmultitapermelfrequencycepstralcoefficientsfeatures