Cargando…

3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators

Video laryngoscope is available for visualizing the motion of vocal cords and aid in the assessment of analyzing the larynx-related lesion preliminarily. Laryngeal Electromyography (EMG) needs to be performed to diagnose the factors of vocal cord paralysis, which may cause patient feeling unwell. Th...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, I-Miao, Yeh, Pin-Yu, Hsieh, Ya-Chu, Chang, Ting-Chi, Shih, Samantha, Shen, Wen-Fang, Chin, Chiun-Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10009724/
https://www.ncbi.nlm.nih.gov/pubmed/36923825
http://dx.doi.org/10.1016/j.heliyon.2023.e14242
_version_ 1784906047380520960
author Chen, I-Miao
Yeh, Pin-Yu
Hsieh, Ya-Chu
Chang, Ting-Chi
Shih, Samantha
Shen, Wen-Fang
Chin, Chiun-Li
author_facet Chen, I-Miao
Yeh, Pin-Yu
Hsieh, Ya-Chu
Chang, Ting-Chi
Shih, Samantha
Shen, Wen-Fang
Chin, Chiun-Li
author_sort Chen, I-Miao
collection PubMed
description Video laryngoscope is available for visualizing the motion of vocal cords and aid in the assessment of analyzing the larynx-related lesion preliminarily. Laryngeal Electromyography (EMG) needs to be performed to diagnose the factors of vocal cord paralysis, which may cause patient feeling unwell. Thus, the problem is the lack of credible larynx indicators to evaluate larynx-related diseases in the department of otolaryngology. Therefore, this paper aims to propose a 3D VOSNet model, which has the characteristics of sequence segmentation to extract the time-series features in the video laryngoscope. The 3D VOSNet model can keep the time-series features of three images before and after of the specific image to achieve translation and occlusion invariance, which explicitly signifies that our model can segment and classify each item in the video of laryngoscopy not affected by extrinsic causes such as shaking or occlusion during laryngoscope. Numerical results revealed that the testing accuracy rates of the glottal, right vocal cord, and the left vocal cord are 89.91%, 94.63%, and 93.48%, respectively. Our proposed model can segment glottal and vocal cords from the sequence of laryngoscopy. Finally, using the proposed algorithm computes six larynx indicators, which are the area of the glottal, area of vocal cords, length of vocal cords, deviation of length of vocal cords, and symmetry of the vocal cords. In order to assist otolaryngologists in staying credible and objective when making decisions without any doubt during diagnosis and also explaining the clinical symptoms of the larynx such as vocal cord paralysis to patients after diagnosis, our proposed algorithm provides otolaryngologists with explainable indicators (X-indicators).
format Online
Article
Text
id pubmed-10009724
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-100097242023-03-14 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators Chen, I-Miao Yeh, Pin-Yu Hsieh, Ya-Chu Chang, Ting-Chi Shih, Samantha Shen, Wen-Fang Chin, Chiun-Li Heliyon Research Article Video laryngoscope is available for visualizing the motion of vocal cords and aid in the assessment of analyzing the larynx-related lesion preliminarily. Laryngeal Electromyography (EMG) needs to be performed to diagnose the factors of vocal cord paralysis, which may cause patient feeling unwell. Thus, the problem is the lack of credible larynx indicators to evaluate larynx-related diseases in the department of otolaryngology. Therefore, this paper aims to propose a 3D VOSNet model, which has the characteristics of sequence segmentation to extract the time-series features in the video laryngoscope. The 3D VOSNet model can keep the time-series features of three images before and after of the specific image to achieve translation and occlusion invariance, which explicitly signifies that our model can segment and classify each item in the video of laryngoscopy not affected by extrinsic causes such as shaking or occlusion during laryngoscope. Numerical results revealed that the testing accuracy rates of the glottal, right vocal cord, and the left vocal cord are 89.91%, 94.63%, and 93.48%, respectively. Our proposed model can segment glottal and vocal cords from the sequence of laryngoscopy. Finally, using the proposed algorithm computes six larynx indicators, which are the area of the glottal, area of vocal cords, length of vocal cords, deviation of length of vocal cords, and symmetry of the vocal cords. In order to assist otolaryngologists in staying credible and objective when making decisions without any doubt during diagnosis and also explaining the clinical symptoms of the larynx such as vocal cord paralysis to patients after diagnosis, our proposed algorithm provides otolaryngologists with explainable indicators (X-indicators). Elsevier 2023-03-03 /pmc/articles/PMC10009724/ /pubmed/36923825 http://dx.doi.org/10.1016/j.heliyon.2023.e14242 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Chen, I-Miao
Yeh, Pin-Yu
Hsieh, Ya-Chu
Chang, Ting-Chi
Shih, Samantha
Shen, Wen-Fang
Chin, Chiun-Li
3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators
title 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators
title_full 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators
title_fullStr 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators
title_full_unstemmed 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators
title_short 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators
title_sort 3d vosnet: segmentation of endoscopic images of the larynx with subsequent generation of indicators
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10009724/
https://www.ncbi.nlm.nih.gov/pubmed/36923825
http://dx.doi.org/10.1016/j.heliyon.2023.e14242
work_keys_str_mv AT chenimiao 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators
AT yehpinyu 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators
AT hsiehyachu 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators
AT changtingchi 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators
AT shihsamantha 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators
AT shenwenfang 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators
AT chinchiunli 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators