Cargando…
Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning
Because underlying cognitive and neuromuscular activities regulate speech signals, biomarkers in the human voice can provide insight into neurological illnesses. Multiple motor and nonmotor aspects of neurologic voice disorders arise from an underlying neurologic condition such as Parkinson's d...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9001083/ https://www.ncbi.nlm.nih.gov/pubmed/35419079 http://dx.doi.org/10.1155/2022/4364186 |
_version_ | 1784685588252721152 |
---|---|
author | Alghamdi, Norah Saleh Zakariah, Mohammed Hoang, Vinh Truong Elahi, Mohammad Mamun |
author_facet | Alghamdi, Norah Saleh Zakariah, Mohammed Hoang, Vinh Truong Elahi, Mohammad Mamun |
author_sort | Alghamdi, Norah Saleh |
collection | PubMed |
description | Because underlying cognitive and neuromuscular activities regulate speech signals, biomarkers in the human voice can provide insight into neurological illnesses. Multiple motor and nonmotor aspects of neurologic voice disorders arise from an underlying neurologic condition such as Parkinson's disease, multiple sclerosis, myasthenia gravis, or ALS. Voice problems can be caused by disorders that affect the corticospinal system, cerebellum, basal ganglia, and upper or lower motoneurons. According to a new study, voice pathology detection technologies can successfully aid in the assessment of voice irregularities and enable the early diagnosis of voice pathology. In this paper, we offer two deep-learning-based computational models, 1-dimensional convolutional neural network (1D CNN) and 2-dimensional convolutional neural network (2D CNN), that simultaneously detect voice pathologies caused by neurological illnesses or other causes. From the German corpus Saarbruecken Voice Database (SVD), we used voice recordings of sustained vowel /a/ generated at normal pitch. The collected voice signals are padded and segmented to maintain homogeneity and increase the number of samples. Convolutional layers are applied to raw data, and MFCC features are extracted in this project. Although the 1D CNN had the maximum accuracy of 93.11% on test data, model training produced overfitting and 2D CNN, which generalized the data better and had lower train and validation loss despite having an accuracy of 84.17% on test data. Also, 2D CNN outperforms state-of-the-art studies in the field, implying that a model trained on handcrafted features is better for speech processing than a model that extracts features directly. |
format | Online Article Text |
id | pubmed-9001083 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-90010832022-04-12 Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning Alghamdi, Norah Saleh Zakariah, Mohammed Hoang, Vinh Truong Elahi, Mohammad Mamun Comput Math Methods Med Research Article Because underlying cognitive and neuromuscular activities regulate speech signals, biomarkers in the human voice can provide insight into neurological illnesses. Multiple motor and nonmotor aspects of neurologic voice disorders arise from an underlying neurologic condition such as Parkinson's disease, multiple sclerosis, myasthenia gravis, or ALS. Voice problems can be caused by disorders that affect the corticospinal system, cerebellum, basal ganglia, and upper or lower motoneurons. According to a new study, voice pathology detection technologies can successfully aid in the assessment of voice irregularities and enable the early diagnosis of voice pathology. In this paper, we offer two deep-learning-based computational models, 1-dimensional convolutional neural network (1D CNN) and 2-dimensional convolutional neural network (2D CNN), that simultaneously detect voice pathologies caused by neurological illnesses or other causes. From the German corpus Saarbruecken Voice Database (SVD), we used voice recordings of sustained vowel /a/ generated at normal pitch. The collected voice signals are padded and segmented to maintain homogeneity and increase the number of samples. Convolutional layers are applied to raw data, and MFCC features are extracted in this project. Although the 1D CNN had the maximum accuracy of 93.11% on test data, model training produced overfitting and 2D CNN, which generalized the data better and had lower train and validation loss despite having an accuracy of 84.17% on test data. Also, 2D CNN outperforms state-of-the-art studies in the field, implying that a model trained on handcrafted features is better for speech processing than a model that extracts features directly. Hindawi 2022-04-04 /pmc/articles/PMC9001083/ /pubmed/35419079 http://dx.doi.org/10.1155/2022/4364186 Text en Copyright © 2022 Norah Saleh Alghamdi et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Alghamdi, Norah Saleh Zakariah, Mohammed Hoang, Vinh Truong Elahi, Mohammad Mamun Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning |
title | Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning |
title_full | Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning |
title_fullStr | Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning |
title_full_unstemmed | Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning |
title_short | Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning |
title_sort | neurogenerative disease diagnosis in cepstral domain using mfcc with deep learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9001083/ https://www.ncbi.nlm.nih.gov/pubmed/35419079 http://dx.doi.org/10.1155/2022/4364186 |
work_keys_str_mv | AT alghamdinorahsaleh neurogenerativediseasediagnosisincepstraldomainusingmfccwithdeeplearning AT zakariahmohammed neurogenerativediseasediagnosisincepstraldomainusingmfccwithdeeplearning AT hoangvinhtruong neurogenerativediseasediagnosisincepstraldomainusingmfccwithdeeplearning AT elahimohammadmamun neurogenerativediseasediagnosisincepstraldomainusingmfccwithdeeplearning |