Cargando…

GlottisNetV2: Temporal Glottal Midline Detection Using Deep Convolutional Neural Networks

High-speed videoendoscopy is a major tool for quantitative laryngology. Glottis segmentation and glottal midline detection are crucial for computing vocal fold-specific, quantitative parameters. However, fully automated solutions show limited clinical applicability. Especially unbiased glottal midli...

Descripción completa

Detalles Bibliográficos
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	IEEE 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933989/ https://www.ncbi.nlm.nih.gov/pubmed/36816097 http://dx.doi.org/10.1109/JTEHM.2023.3237859

_version_	1784889785856294912
collection	PubMed
description	High-speed videoendoscopy is a major tool for quantitative laryngology. Glottis segmentation and glottal midline detection are crucial for computing vocal fold-specific, quantitative parameters. However, fully automated solutions show limited clinical applicability. Especially unbiased glottal midline detection remains a challenging problem. We developed a multitask deep neural network for glottis segmentation and glottal midline detection. We used techniques from pose estimation to estimate the anterior and posterior points in endoscopy images. Neural networks were set up in TensorFlow/Keras and trained and evaluated with the BAGLS dataset. We found that a dual decoder deep neural network termed GlottisNetV2 outperforms the previously proposed GlottisNet in terms of MAPE on the test dataset (1.85% to 6.3%) while converging faster. Using various hyperparameter tunings, we allow fast and directed training. Using temporal variant data on an additional data set designed for this task, we can improve the median prediction accuracy from 2.1% to 1.76% when using 12 consecutive frames and additional temporal filtering. We found that temporal glottal midline detection using a dual decoder architecture together with keypoint estimation allows accurate midline prediction. We show that our proposed architecture allows stable and reliable glottal midline predictions ready for clinical use and analysis of symmetry measures.
format	Online Article Text
id	pubmed-9933989
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	IEEE
record_format	MEDLINE/PubMed
spelling	pubmed-99339892023-02-17 GlottisNetV2: Temporal Glottal Midline Detection Using Deep Convolutional Neural Networks IEEE J Transl Eng Health Med Article High-speed videoendoscopy is a major tool for quantitative laryngology. Glottis segmentation and glottal midline detection are crucial for computing vocal fold-specific, quantitative parameters. However, fully automated solutions show limited clinical applicability. Especially unbiased glottal midline detection remains a challenging problem. We developed a multitask deep neural network for glottis segmentation and glottal midline detection. We used techniques from pose estimation to estimate the anterior and posterior points in endoscopy images. Neural networks were set up in TensorFlow/Keras and trained and evaluated with the BAGLS dataset. We found that a dual decoder deep neural network termed GlottisNetV2 outperforms the previously proposed GlottisNet in terms of MAPE on the test dataset (1.85% to 6.3%) while converging faster. Using various hyperparameter tunings, we allow fast and directed training. Using temporal variant data on an additional data set designed for this task, we can improve the median prediction accuracy from 2.1% to 1.76% when using 12 consecutive frames and additional temporal filtering. We found that temporal glottal midline detection using a dual decoder architecture together with keypoint estimation allows accurate midline prediction. We show that our proposed architecture allows stable and reliable glottal midline predictions ready for clinical use and analysis of symmetry measures. IEEE 2023-01-19 /pmc/articles/PMC9933989/ /pubmed/36816097 http://dx.doi.org/10.1109/JTEHM.2023.3237859 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
spellingShingle	Article GlottisNetV2: Temporal Glottal Midline Detection Using Deep Convolutional Neural Networks
title	GlottisNetV2: Temporal Glottal Midline Detection Using Deep Convolutional Neural Networks
title_full	GlottisNetV2: Temporal Glottal Midline Detection Using Deep Convolutional Neural Networks
title_fullStr	GlottisNetV2: Temporal Glottal Midline Detection Using Deep Convolutional Neural Networks
title_full_unstemmed	GlottisNetV2: Temporal Glottal Midline Detection Using Deep Convolutional Neural Networks
title_short	GlottisNetV2: Temporal Glottal Midline Detection Using Deep Convolutional Neural Networks
title_sort	glottisnetv2: temporal glottal midline detection using deep convolutional neural networks
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933989/ https://www.ncbi.nlm.nih.gov/pubmed/36816097 http://dx.doi.org/10.1109/JTEHM.2023.3237859
work_keys_str_mv	AT glottisnetv2temporalglottalmidlinedetectionusingdeepconvolutionalneuralnetworks AT glottisnetv2temporalglottalmidlinedetectionusingdeepconvolutionalneuralnetworks AT glottisnetv2temporalglottalmidlinedetectionusingdeepconvolutionalneuralnetworks AT glottisnetv2temporalglottalmidlinedetectionusingdeepconvolutionalneuralnetworks

GlottisNetV2: Temporal Glottal Midline Detection Using Deep Convolutional Neural Networks

Ejemplares similares