Cargando…

Automatic Detection System for Velopharyngeal Insufficiency Based on Acoustic Signals from Nasal and Oral Channels

Velopharyngeal insufficiency (VPI) is a type of pharyngeal function dysfunction that causes speech impairment and swallowing disorder. Speech therapists play a key role on the diagnosis and treatment of speech disorders. However, there is a worldwide shortage of experienced speech therapists. Artifi...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yu, Zhang, Jing, Li, Wen, Yin, Heng, He, Ling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10453249/
https://www.ncbi.nlm.nih.gov/pubmed/37627973
http://dx.doi.org/10.3390/diagnostics13162714
_version_ 1785095886751137792
author Zhang, Yu
Zhang, Jing
Li, Wen
Yin, Heng
He, Ling
author_facet Zhang, Yu
Zhang, Jing
Li, Wen
Yin, Heng
He, Ling
author_sort Zhang, Yu
collection PubMed
description Velopharyngeal insufficiency (VPI) is a type of pharyngeal function dysfunction that causes speech impairment and swallowing disorder. Speech therapists play a key role on the diagnosis and treatment of speech disorders. However, there is a worldwide shortage of experienced speech therapists. Artificial intelligence-based computer-aided diagnosing technology could be a solution for this. This paper proposes an automatic system for VPI detection at the subject level. It is a non-invasive and convenient approach for VPI diagnosis. Based on the principle of impaired articulation of VPI patients, nasal- and oral-channel acoustic signals are collected as raw data. The system integrates the symptom discriminant results at the phoneme level. For consonants, relative prominent frequency description and relative frequency distribution features are proposed to discriminate nasal air emission caused by VPI. For hypernasality-sensitive vowels, a cross-attention residual Siamese network (CARS-Net) is proposed to perform automatic VPI/non-VPI classification at the phoneme level. CARS-Net embeds a cross-attention module between the two branches to improve the VPI/non-VPI classification model for vowels. We validate the proposed system on a self-built dataset, and the accuracy reaches 98.52%. This provides possibilities for implementing automatic VPI diagnosis.
format Online
Article
Text
id pubmed-10453249
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104532492023-08-26 Automatic Detection System for Velopharyngeal Insufficiency Based on Acoustic Signals from Nasal and Oral Channels Zhang, Yu Zhang, Jing Li, Wen Yin, Heng He, Ling Diagnostics (Basel) Article Velopharyngeal insufficiency (VPI) is a type of pharyngeal function dysfunction that causes speech impairment and swallowing disorder. Speech therapists play a key role on the diagnosis and treatment of speech disorders. However, there is a worldwide shortage of experienced speech therapists. Artificial intelligence-based computer-aided diagnosing technology could be a solution for this. This paper proposes an automatic system for VPI detection at the subject level. It is a non-invasive and convenient approach for VPI diagnosis. Based on the principle of impaired articulation of VPI patients, nasal- and oral-channel acoustic signals are collected as raw data. The system integrates the symptom discriminant results at the phoneme level. For consonants, relative prominent frequency description and relative frequency distribution features are proposed to discriminate nasal air emission caused by VPI. For hypernasality-sensitive vowels, a cross-attention residual Siamese network (CARS-Net) is proposed to perform automatic VPI/non-VPI classification at the phoneme level. CARS-Net embeds a cross-attention module between the two branches to improve the VPI/non-VPI classification model for vowels. We validate the proposed system on a self-built dataset, and the accuracy reaches 98.52%. This provides possibilities for implementing automatic VPI diagnosis. MDPI 2023-08-21 /pmc/articles/PMC10453249/ /pubmed/37627973 http://dx.doi.org/10.3390/diagnostics13162714 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Yu
Zhang, Jing
Li, Wen
Yin, Heng
He, Ling
Automatic Detection System for Velopharyngeal Insufficiency Based on Acoustic Signals from Nasal and Oral Channels
title Automatic Detection System for Velopharyngeal Insufficiency Based on Acoustic Signals from Nasal and Oral Channels
title_full Automatic Detection System for Velopharyngeal Insufficiency Based on Acoustic Signals from Nasal and Oral Channels
title_fullStr Automatic Detection System for Velopharyngeal Insufficiency Based on Acoustic Signals from Nasal and Oral Channels
title_full_unstemmed Automatic Detection System for Velopharyngeal Insufficiency Based on Acoustic Signals from Nasal and Oral Channels
title_short Automatic Detection System for Velopharyngeal Insufficiency Based on Acoustic Signals from Nasal and Oral Channels
title_sort automatic detection system for velopharyngeal insufficiency based on acoustic signals from nasal and oral channels
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10453249/
https://www.ncbi.nlm.nih.gov/pubmed/37627973
http://dx.doi.org/10.3390/diagnostics13162714
work_keys_str_mv AT zhangyu automaticdetectionsystemforvelopharyngealinsufficiencybasedonacousticsignalsfromnasalandoralchannels
AT zhangjing automaticdetectionsystemforvelopharyngealinsufficiencybasedonacousticsignalsfromnasalandoralchannels
AT liwen automaticdetectionsystemforvelopharyngealinsufficiencybasedonacousticsignalsfromnasalandoralchannels
AT yinheng automaticdetectionsystemforvelopharyngealinsufficiencybasedonacousticsignalsfromnasalandoralchannels
AT heling automaticdetectionsystemforvelopharyngealinsufficiencybasedonacousticsignalsfromnasalandoralchannels