Cargando…

An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection

Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one’s own bank account. Among all biometrics,...

Descripción completa

Detalles Bibliográficos
Autores principales: Contreras , Rodrigo Colnago, Viana , Monique Simplicio, Fonseca , Everthon Silva, dos Santos, Francisco Lledo, Zanin , Rodrigo Bruno, Guido , Rodrigo Capobianco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256083/
https://www.ncbi.nlm.nih.gov/pubmed/37299922
http://dx.doi.org/10.3390/s23115196
_version_ 1785057028418306048
author Contreras , Rodrigo Colnago
Viana , Monique Simplicio
Fonseca , Everthon Silva
dos Santos, Francisco Lledo
Zanin , Rodrigo Bruno
Guido , Rodrigo Capobianco
author_facet Contreras , Rodrigo Colnago
Viana , Monique Simplicio
Fonseca , Everthon Silva
dos Santos, Francisco Lledo
Zanin , Rodrigo Bruno
Guido , Rodrigo Capobianco
author_sort Contreras , Rodrigo Colnago
collection PubMed
description Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one’s own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.
format Online
Article
Text
id pubmed-10256083
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102560832023-06-10 An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection Contreras , Rodrigo Colnago Viana , Monique Simplicio Fonseca , Everthon Silva dos Santos, Francisco Lledo Zanin , Rodrigo Bruno Guido , Rodrigo Capobianco Sensors (Basel) Article Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one’s own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice. MDPI 2023-05-30 /pmc/articles/PMC10256083/ /pubmed/37299922 http://dx.doi.org/10.3390/s23115196 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Contreras , Rodrigo Colnago
Viana , Monique Simplicio
Fonseca , Everthon Silva
dos Santos, Francisco Lledo
Zanin , Rodrigo Bruno
Guido , Rodrigo Capobianco
An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title_full An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title_fullStr An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title_full_unstemmed An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title_short An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection
title_sort experimental analysis on multicepstral projection representation strategies for dysphonia detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256083/
https://www.ncbi.nlm.nih.gov/pubmed/37299922
http://dx.doi.org/10.3390/s23115196
work_keys_str_mv AT contrerasrodrigocolnago anexperimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT vianamoniquesimplicio anexperimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT fonsecaeverthonsilva anexperimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT dossantosfranciscolledo anexperimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT zaninrodrigobruno anexperimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT guidorodrigocapobianco anexperimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT contrerasrodrigocolnago experimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT vianamoniquesimplicio experimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT fonsecaeverthonsilva experimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT dossantosfranciscolledo experimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT zaninrodrigobruno experimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection
AT guidorodrigocapobianco experimentalanalysisonmulticepstralprojectionrepresentationstrategiesfordysphoniadetection