Cargando…

Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery

SIMPLE SUMMARY: A total laryngectomy involves the full and permanent separation of the upper and lower airways, resulting in the loss of voice and inability to interact vocally. To identify, extract, and evaluate replacement voicing following laryngeal oncosurgery, we propose employing convolutional...

Descripción completa

Detalles Bibliográficos
Autores principales: Maskeliūnas, Rytis, Kulikajevas, Audrius, Damaševičius, Robertas, Pribuišis, Kipras, Ulozaitė-Stanienė, Nora, Uloza, Virgilijus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139213/
https://www.ncbi.nlm.nih.gov/pubmed/35625971
http://dx.doi.org/10.3390/cancers14102366
_version_ 1784714806486368256
author Maskeliūnas, Rytis
Kulikajevas, Audrius
Damaševičius, Robertas
Pribuišis, Kipras
Ulozaitė-Stanienė, Nora
Uloza, Virgilijus
author_facet Maskeliūnas, Rytis
Kulikajevas, Audrius
Damaševičius, Robertas
Pribuišis, Kipras
Ulozaitė-Stanienė, Nora
Uloza, Virgilijus
author_sort Maskeliūnas, Rytis
collection PubMed
description SIMPLE SUMMARY: A total laryngectomy involves the full and permanent separation of the upper and lower airways, resulting in the loss of voice and inability to interact vocally. To identify, extract, and evaluate replacement voicing following laryngeal oncosurgery, we propose employing convolutional neural networks for categorization of speech representations (spectrograms). With an overall accuracy of 89.47 percent, our technique has the greatest true-positive rate of any of the tested state-of-the-art methodologies. ABSTRACT: Laryngeal carcinoma is the most common malignant tumor of the upper respiratory tract. Total laryngectomy provides complete and permanent detachment of the upper and lower airways that causes the loss of voice, leading to a patient’s inability to verbally communicate in the postoperative period. This paper aims to exploit modern areas of deep learning research to objectively classify, extract and measure the substitution voicing after laryngeal oncosurgery from the audio signal. We propose using well-known convolutional neural networks (CNNs) applied for image classification for the analysis of voice audio signal. Our approach takes an input of Mel-frequency spectrogram (MFCC) as an input of deep neural network architecture. A database of digital speech recordings of 367 male subjects (279 normal speech samples and 88 pathological speech samples) was used. Our approach has shown the best true-positive rate of any of the compared state-of-the-art approaches, achieving an overall accuracy of 89.47%.
format Online
Article
Text
id pubmed-9139213
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91392132022-05-28 Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery Maskeliūnas, Rytis Kulikajevas, Audrius Damaševičius, Robertas Pribuišis, Kipras Ulozaitė-Stanienė, Nora Uloza, Virgilijus Cancers (Basel) Article SIMPLE SUMMARY: A total laryngectomy involves the full and permanent separation of the upper and lower airways, resulting in the loss of voice and inability to interact vocally. To identify, extract, and evaluate replacement voicing following laryngeal oncosurgery, we propose employing convolutional neural networks for categorization of speech representations (spectrograms). With an overall accuracy of 89.47 percent, our technique has the greatest true-positive rate of any of the tested state-of-the-art methodologies. ABSTRACT: Laryngeal carcinoma is the most common malignant tumor of the upper respiratory tract. Total laryngectomy provides complete and permanent detachment of the upper and lower airways that causes the loss of voice, leading to a patient’s inability to verbally communicate in the postoperative period. This paper aims to exploit modern areas of deep learning research to objectively classify, extract and measure the substitution voicing after laryngeal oncosurgery from the audio signal. We propose using well-known convolutional neural networks (CNNs) applied for image classification for the analysis of voice audio signal. Our approach takes an input of Mel-frequency spectrogram (MFCC) as an input of deep neural network architecture. A database of digital speech recordings of 367 male subjects (279 normal speech samples and 88 pathological speech samples) was used. Our approach has shown the best true-positive rate of any of the compared state-of-the-art approaches, achieving an overall accuracy of 89.47%. MDPI 2022-05-11 /pmc/articles/PMC9139213/ /pubmed/35625971 http://dx.doi.org/10.3390/cancers14102366 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Maskeliūnas, Rytis
Kulikajevas, Audrius
Damaševičius, Robertas
Pribuišis, Kipras
Ulozaitė-Stanienė, Nora
Uloza, Virgilijus
Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery
title Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery
title_full Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery
title_fullStr Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery
title_full_unstemmed Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery
title_short Lightweight Deep Learning Model for Assessment of Substitution Voicing and Speech after Laryngeal Carcinoma Surgery
title_sort lightweight deep learning model for assessment of substitution voicing and speech after laryngeal carcinoma surgery
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139213/
https://www.ncbi.nlm.nih.gov/pubmed/35625971
http://dx.doi.org/10.3390/cancers14102366
work_keys_str_mv AT maskeliunasrytis lightweightdeeplearningmodelforassessmentofsubstitutionvoicingandspeechafterlaryngealcarcinomasurgery
AT kulikajevasaudrius lightweightdeeplearningmodelforassessmentofsubstitutionvoicingandspeechafterlaryngealcarcinomasurgery
AT damaseviciusrobertas lightweightdeeplearningmodelforassessmentofsubstitutionvoicingandspeechafterlaryngealcarcinomasurgery
AT pribuisiskipras lightweightdeeplearningmodelforassessmentofsubstitutionvoicingandspeechafterlaryngealcarcinomasurgery
AT ulozaitestanienenora lightweightdeeplearningmodelforassessmentofsubstitutionvoicingandspeechafterlaryngealcarcinomasurgery
AT ulozavirgilijus lightweightdeeplearningmodelforassessmentofsubstitutionvoicingandspeechafterlaryngealcarcinomasurgery