Cargando…

Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning

Because underlying cognitive and neuromuscular activities regulate speech signals, biomarkers in the human voice can provide insight into neurological illnesses. Multiple motor and nonmotor aspects of neurologic voice disorders arise from an underlying neurologic condition such as Parkinson's d...

Descripción completa

Detalles Bibliográficos
Autores principales: Alghamdi, Norah Saleh, Zakariah, Mohammed, Hoang, Vinh Truong, Elahi, Mohammad Mamun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9001083/
https://www.ncbi.nlm.nih.gov/pubmed/35419079
http://dx.doi.org/10.1155/2022/4364186
_version_ 1784685588252721152
author Alghamdi, Norah Saleh
Zakariah, Mohammed
Hoang, Vinh Truong
Elahi, Mohammad Mamun
author_facet Alghamdi, Norah Saleh
Zakariah, Mohammed
Hoang, Vinh Truong
Elahi, Mohammad Mamun
author_sort Alghamdi, Norah Saleh
collection PubMed
description Because underlying cognitive and neuromuscular activities regulate speech signals, biomarkers in the human voice can provide insight into neurological illnesses. Multiple motor and nonmotor aspects of neurologic voice disorders arise from an underlying neurologic condition such as Parkinson's disease, multiple sclerosis, myasthenia gravis, or ALS. Voice problems can be caused by disorders that affect the corticospinal system, cerebellum, basal ganglia, and upper or lower motoneurons. According to a new study, voice pathology detection technologies can successfully aid in the assessment of voice irregularities and enable the early diagnosis of voice pathology. In this paper, we offer two deep-learning-based computational models, 1-dimensional convolutional neural network (1D CNN) and 2-dimensional convolutional neural network (2D CNN), that simultaneously detect voice pathologies caused by neurological illnesses or other causes. From the German corpus Saarbruecken Voice Database (SVD), we used voice recordings of sustained vowel /a/ generated at normal pitch. The collected voice signals are padded and segmented to maintain homogeneity and increase the number of samples. Convolutional layers are applied to raw data, and MFCC features are extracted in this project. Although the 1D CNN had the maximum accuracy of 93.11% on test data, model training produced overfitting and 2D CNN, which generalized the data better and had lower train and validation loss despite having an accuracy of 84.17% on test data. Also, 2D CNN outperforms state-of-the-art studies in the field, implying that a model trained on handcrafted features is better for speech processing than a model that extracts features directly.
format Online
Article
Text
id pubmed-9001083
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-90010832022-04-12 Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning Alghamdi, Norah Saleh Zakariah, Mohammed Hoang, Vinh Truong Elahi, Mohammad Mamun Comput Math Methods Med Research Article Because underlying cognitive and neuromuscular activities regulate speech signals, biomarkers in the human voice can provide insight into neurological illnesses. Multiple motor and nonmotor aspects of neurologic voice disorders arise from an underlying neurologic condition such as Parkinson's disease, multiple sclerosis, myasthenia gravis, or ALS. Voice problems can be caused by disorders that affect the corticospinal system, cerebellum, basal ganglia, and upper or lower motoneurons. According to a new study, voice pathology detection technologies can successfully aid in the assessment of voice irregularities and enable the early diagnosis of voice pathology. In this paper, we offer two deep-learning-based computational models, 1-dimensional convolutional neural network (1D CNN) and 2-dimensional convolutional neural network (2D CNN), that simultaneously detect voice pathologies caused by neurological illnesses or other causes. From the German corpus Saarbruecken Voice Database (SVD), we used voice recordings of sustained vowel /a/ generated at normal pitch. The collected voice signals are padded and segmented to maintain homogeneity and increase the number of samples. Convolutional layers are applied to raw data, and MFCC features are extracted in this project. Although the 1D CNN had the maximum accuracy of 93.11% on test data, model training produced overfitting and 2D CNN, which generalized the data better and had lower train and validation loss despite having an accuracy of 84.17% on test data. Also, 2D CNN outperforms state-of-the-art studies in the field, implying that a model trained on handcrafted features is better for speech processing than a model that extracts features directly. Hindawi 2022-04-04 /pmc/articles/PMC9001083/ /pubmed/35419079 http://dx.doi.org/10.1155/2022/4364186 Text en Copyright © 2022 Norah Saleh Alghamdi et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Alghamdi, Norah Saleh
Zakariah, Mohammed
Hoang, Vinh Truong
Elahi, Mohammad Mamun
Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning
title Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning
title_full Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning
title_fullStr Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning
title_full_unstemmed Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning
title_short Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning
title_sort neurogenerative disease diagnosis in cepstral domain using mfcc with deep learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9001083/
https://www.ncbi.nlm.nih.gov/pubmed/35419079
http://dx.doi.org/10.1155/2022/4364186
work_keys_str_mv AT alghamdinorahsaleh neurogenerativediseasediagnosisincepstraldomainusingmfccwithdeeplearning
AT zakariahmohammed neurogenerativediseasediagnosisincepstraldomainusingmfccwithdeeplearning
AT hoangvinhtruong neurogenerativediseasediagnosisincepstraldomainusingmfccwithdeeplearning
AT elahimohammadmamun neurogenerativediseasediagnosisincepstraldomainusingmfccwithdeeplearning