Cargando…

Laryngeal Pressure Estimation With a Recurrent Neural Network

Quantifying the physical parameters of voice production is essential for understanding the process of phonation and can aid in voice research and diagnosis. As an alternative to invasive measurements, they can be estimated by formulating an inverse problem using a numerical forward model. However, h...

Descripción completa

Detalles Bibliográficos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: IEEE 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6331197/
https://www.ncbi.nlm.nih.gov/pubmed/30680252
http://dx.doi.org/10.1109/JTEHM.2018.2886021
_version_ 1783387103833358336
collection PubMed
description Quantifying the physical parameters of voice production is essential for understanding the process of phonation and can aid in voice research and diagnosis. As an alternative to invasive measurements, they can be estimated by formulating an inverse problem using a numerical forward model. However, high-fidelity numerical models are often computationally too expensive for this. This paper presents a novel approach to train a long short-term memory network to estimate the subglottal pressure in the larynx at massively reduced computational cost using solely synthetic training data. We train the network on synthetic data from a numerical two-mass model and validate it on experimental data from 288 high-speed ex vivo video recordings of porcine vocal folds from a previous study. The training requires significantly fewer model evaluations compared with the previous optimization approach. On the test set, we maintain a comparable performance of 21.2% versus previous 17.7% mean absolute percentage error in estimating the subglottal pressure. The evaluation of one sample requires a vanishingly small amount of computation time. The presented approach is able to maintain estimation accuracy of the subglottal pressure at significantly reduced computational cost. The methodology is likely transferable to estimate other parameters and training with other numerical models. This improvement should allow the adoption of more sophisticated, high-fidelity numerical models of the larynx. The vast speedup is a critical step to enable a future clinical application and knowledge of parameters such as the subglottal pressure will aid in diagnosis and treatment selection.
format Online
Article
Text
id pubmed-6331197
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher IEEE
record_format MEDLINE/PubMed
spelling pubmed-63311972019-01-24 Laryngeal Pressure Estimation With a Recurrent Neural Network IEEE J Transl Eng Health Med Article Quantifying the physical parameters of voice production is essential for understanding the process of phonation and can aid in voice research and diagnosis. As an alternative to invasive measurements, they can be estimated by formulating an inverse problem using a numerical forward model. However, high-fidelity numerical models are often computationally too expensive for this. This paper presents a novel approach to train a long short-term memory network to estimate the subglottal pressure in the larynx at massively reduced computational cost using solely synthetic training data. We train the network on synthetic data from a numerical two-mass model and validate it on experimental data from 288 high-speed ex vivo video recordings of porcine vocal folds from a previous study. The training requires significantly fewer model evaluations compared with the previous optimization approach. On the test set, we maintain a comparable performance of 21.2% versus previous 17.7% mean absolute percentage error in estimating the subglottal pressure. The evaluation of one sample requires a vanishingly small amount of computation time. The presented approach is able to maintain estimation accuracy of the subglottal pressure at significantly reduced computational cost. The methodology is likely transferable to estimate other parameters and training with other numerical models. This improvement should allow the adoption of more sophisticated, high-fidelity numerical models of the larynx. The vast speedup is a critical step to enable a future clinical application and knowledge of parameters such as the subglottal pressure will aid in diagnosis and treatment selection. IEEE 2018-12-27 /pmc/articles/PMC6331197/ /pubmed/30680252 http://dx.doi.org/10.1109/JTEHM.2018.2886021 Text en This work is licensed under a Creative Commons Attribution 3.0 License. For more information, see http://creativecommons.org/licenses/by/3.0/
spellingShingle Article
Laryngeal Pressure Estimation With a Recurrent Neural Network
title Laryngeal Pressure Estimation With a Recurrent Neural Network
title_full Laryngeal Pressure Estimation With a Recurrent Neural Network
title_fullStr Laryngeal Pressure Estimation With a Recurrent Neural Network
title_full_unstemmed Laryngeal Pressure Estimation With a Recurrent Neural Network
title_short Laryngeal Pressure Estimation With a Recurrent Neural Network
title_sort laryngeal pressure estimation with a recurrent neural network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6331197/
https://www.ncbi.nlm.nih.gov/pubmed/30680252
http://dx.doi.org/10.1109/JTEHM.2018.2886021
work_keys_str_mv AT laryngealpressureestimationwitharecurrentneuralnetwork
AT laryngealpressureestimationwitharecurrentneuralnetwork
AT laryngealpressureestimationwitharecurrentneuralnetwork
AT laryngealpressureestimationwitharecurrentneuralnetwork