Cargando…

Employing Energy and Statistical Features for Automatic Diagnosis of Voice Disorders

The presence of laryngeal disease affects vocal fold(s) dynamics and thus causes changes in pitch, loudness, and other characteristics of the human voice. Many frameworks based on the acoustic analysis of speech signals have been created in recent years; however, they are evaluated on just one or tw...

Descripción completa

Detalles Bibliográficos
Autores principales: Shrivas, Avinash, Deshpande, Shrinivas, Gidaye, Girish, Nirmal, Jagannath, Ezzine, Kadria, Frikha, Mondher, Desai, Kamalakar, Shinde, Sachin, Oza, Ankit D., Burduhos-Nergis, Dumitru Doru, Burduhos-Nergis, Diana Petronela
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9689977/
https://www.ncbi.nlm.nih.gov/pubmed/36428819
http://dx.doi.org/10.3390/diagnostics12112758
_version_ 1784836670653202432
author Shrivas, Avinash
Deshpande, Shrinivas
Gidaye, Girish
Nirmal, Jagannath
Ezzine, Kadria
Frikha, Mondher
Desai, Kamalakar
Shinde, Sachin
Oza, Ankit D.
Burduhos-Nergis, Dumitru Doru
Burduhos-Nergis, Diana Petronela
author_facet Shrivas, Avinash
Deshpande, Shrinivas
Gidaye, Girish
Nirmal, Jagannath
Ezzine, Kadria
Frikha, Mondher
Desai, Kamalakar
Shinde, Sachin
Oza, Ankit D.
Burduhos-Nergis, Dumitru Doru
Burduhos-Nergis, Diana Petronela
author_sort Shrivas, Avinash
collection PubMed
description The presence of laryngeal disease affects vocal fold(s) dynamics and thus causes changes in pitch, loudness, and other characteristics of the human voice. Many frameworks based on the acoustic analysis of speech signals have been created in recent years; however, they are evaluated on just one or two corpora and are not independent to voice illnesses and human bias. In this article, a unified wavelet-based paradigm for evaluating voice diseases is presented. This approach is independent of voice diseases, human bias, or dialect. The vocal folds’ dynamics are impacted by the voice disorder, and this further modifies the sound source. Therefore, inverse filtering is used to capture the modified voice source. Furthermore, the fundamental frequency independent statistical and energy metrics are derived from each spectral sub-band to characterize the retrieved voice source. Speech recordings of the sustained vowel /a/ were collected from four different datasets in German, Spanish, English, and Arabic to run the several intra and inter-dataset experiments. The classifiers’ achieved performance indicators show that energy and statistical features uncover vital information on a variety of clinical voices, and therefore the suggested approach can be used as a complementary means for the automatic medical assessment of voice diseases.
format Online
Article
Text
id pubmed-9689977
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96899772022-11-25 Employing Energy and Statistical Features for Automatic Diagnosis of Voice Disorders Shrivas, Avinash Deshpande, Shrinivas Gidaye, Girish Nirmal, Jagannath Ezzine, Kadria Frikha, Mondher Desai, Kamalakar Shinde, Sachin Oza, Ankit D. Burduhos-Nergis, Dumitru Doru Burduhos-Nergis, Diana Petronela Diagnostics (Basel) Article The presence of laryngeal disease affects vocal fold(s) dynamics and thus causes changes in pitch, loudness, and other characteristics of the human voice. Many frameworks based on the acoustic analysis of speech signals have been created in recent years; however, they are evaluated on just one or two corpora and are not independent to voice illnesses and human bias. In this article, a unified wavelet-based paradigm for evaluating voice diseases is presented. This approach is independent of voice diseases, human bias, or dialect. The vocal folds’ dynamics are impacted by the voice disorder, and this further modifies the sound source. Therefore, inverse filtering is used to capture the modified voice source. Furthermore, the fundamental frequency independent statistical and energy metrics are derived from each spectral sub-band to characterize the retrieved voice source. Speech recordings of the sustained vowel /a/ were collected from four different datasets in German, Spanish, English, and Arabic to run the several intra and inter-dataset experiments. The classifiers’ achieved performance indicators show that energy and statistical features uncover vital information on a variety of clinical voices, and therefore the suggested approach can be used as a complementary means for the automatic medical assessment of voice diseases. MDPI 2022-11-11 /pmc/articles/PMC9689977/ /pubmed/36428819 http://dx.doi.org/10.3390/diagnostics12112758 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Shrivas, Avinash
Deshpande, Shrinivas
Gidaye, Girish
Nirmal, Jagannath
Ezzine, Kadria
Frikha, Mondher
Desai, Kamalakar
Shinde, Sachin
Oza, Ankit D.
Burduhos-Nergis, Dumitru Doru
Burduhos-Nergis, Diana Petronela
Employing Energy and Statistical Features for Automatic Diagnosis of Voice Disorders
title Employing Energy and Statistical Features for Automatic Diagnosis of Voice Disorders
title_full Employing Energy and Statistical Features for Automatic Diagnosis of Voice Disorders
title_fullStr Employing Energy and Statistical Features for Automatic Diagnosis of Voice Disorders
title_full_unstemmed Employing Energy and Statistical Features for Automatic Diagnosis of Voice Disorders
title_short Employing Energy and Statistical Features for Automatic Diagnosis of Voice Disorders
title_sort employing energy and statistical features for automatic diagnosis of voice disorders
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9689977/
https://www.ncbi.nlm.nih.gov/pubmed/36428819
http://dx.doi.org/10.3390/diagnostics12112758
work_keys_str_mv AT shrivasavinash employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders
AT deshpandeshrinivas employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders
AT gidayegirish employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders
AT nirmaljagannath employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders
AT ezzinekadria employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders
AT frikhamondher employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders
AT desaikamalakar employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders
AT shindesachin employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders
AT ozaankitd employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders
AT burduhosnergisdumitrudoru employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders
AT burduhosnergisdianapetronela employingenergyandstatisticalfeaturesforautomaticdiagnosisofvoicedisorders