Cargando…
3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators
Video laryngoscope is available for visualizing the motion of vocal cords and aid in the assessment of analyzing the larynx-related lesion preliminarily. Laryngeal Electromyography (EMG) needs to be performed to diagnose the factors of vocal cord paralysis, which may cause patient feeling unwell. Th...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10009724/ https://www.ncbi.nlm.nih.gov/pubmed/36923825 http://dx.doi.org/10.1016/j.heliyon.2023.e14242 |
_version_ | 1784906047380520960 |
---|---|
author | Chen, I-Miao Yeh, Pin-Yu Hsieh, Ya-Chu Chang, Ting-Chi Shih, Samantha Shen, Wen-Fang Chin, Chiun-Li |
author_facet | Chen, I-Miao Yeh, Pin-Yu Hsieh, Ya-Chu Chang, Ting-Chi Shih, Samantha Shen, Wen-Fang Chin, Chiun-Li |
author_sort | Chen, I-Miao |
collection | PubMed |
description | Video laryngoscope is available for visualizing the motion of vocal cords and aid in the assessment of analyzing the larynx-related lesion preliminarily. Laryngeal Electromyography (EMG) needs to be performed to diagnose the factors of vocal cord paralysis, which may cause patient feeling unwell. Thus, the problem is the lack of credible larynx indicators to evaluate larynx-related diseases in the department of otolaryngology. Therefore, this paper aims to propose a 3D VOSNet model, which has the characteristics of sequence segmentation to extract the time-series features in the video laryngoscope. The 3D VOSNet model can keep the time-series features of three images before and after of the specific image to achieve translation and occlusion invariance, which explicitly signifies that our model can segment and classify each item in the video of laryngoscopy not affected by extrinsic causes such as shaking or occlusion during laryngoscope. Numerical results revealed that the testing accuracy rates of the glottal, right vocal cord, and the left vocal cord are 89.91%, 94.63%, and 93.48%, respectively. Our proposed model can segment glottal and vocal cords from the sequence of laryngoscopy. Finally, using the proposed algorithm computes six larynx indicators, which are the area of the glottal, area of vocal cords, length of vocal cords, deviation of length of vocal cords, and symmetry of the vocal cords. In order to assist otolaryngologists in staying credible and objective when making decisions without any doubt during diagnosis and also explaining the clinical symptoms of the larynx such as vocal cord paralysis to patients after diagnosis, our proposed algorithm provides otolaryngologists with explainable indicators (X-indicators). |
format | Online Article Text |
id | pubmed-10009724 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-100097242023-03-14 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators Chen, I-Miao Yeh, Pin-Yu Hsieh, Ya-Chu Chang, Ting-Chi Shih, Samantha Shen, Wen-Fang Chin, Chiun-Li Heliyon Research Article Video laryngoscope is available for visualizing the motion of vocal cords and aid in the assessment of analyzing the larynx-related lesion preliminarily. Laryngeal Electromyography (EMG) needs to be performed to diagnose the factors of vocal cord paralysis, which may cause patient feeling unwell. Thus, the problem is the lack of credible larynx indicators to evaluate larynx-related diseases in the department of otolaryngology. Therefore, this paper aims to propose a 3D VOSNet model, which has the characteristics of sequence segmentation to extract the time-series features in the video laryngoscope. The 3D VOSNet model can keep the time-series features of three images before and after of the specific image to achieve translation and occlusion invariance, which explicitly signifies that our model can segment and classify each item in the video of laryngoscopy not affected by extrinsic causes such as shaking or occlusion during laryngoscope. Numerical results revealed that the testing accuracy rates of the glottal, right vocal cord, and the left vocal cord are 89.91%, 94.63%, and 93.48%, respectively. Our proposed model can segment glottal and vocal cords from the sequence of laryngoscopy. Finally, using the proposed algorithm computes six larynx indicators, which are the area of the glottal, area of vocal cords, length of vocal cords, deviation of length of vocal cords, and symmetry of the vocal cords. In order to assist otolaryngologists in staying credible and objective when making decisions without any doubt during diagnosis and also explaining the clinical symptoms of the larynx such as vocal cord paralysis to patients after diagnosis, our proposed algorithm provides otolaryngologists with explainable indicators (X-indicators). Elsevier 2023-03-03 /pmc/articles/PMC10009724/ /pubmed/36923825 http://dx.doi.org/10.1016/j.heliyon.2023.e14242 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Chen, I-Miao Yeh, Pin-Yu Hsieh, Ya-Chu Chang, Ting-Chi Shih, Samantha Shen, Wen-Fang Chin, Chiun-Li 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators |
title | 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators |
title_full | 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators |
title_fullStr | 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators |
title_full_unstemmed | 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators |
title_short | 3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators |
title_sort | 3d vosnet: segmentation of endoscopic images of the larynx with subsequent generation of indicators |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10009724/ https://www.ncbi.nlm.nih.gov/pubmed/36923825 http://dx.doi.org/10.1016/j.heliyon.2023.e14242 |
work_keys_str_mv | AT chenimiao 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators AT yehpinyu 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators AT hsiehyachu 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators AT changtingchi 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators AT shihsamantha 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators AT shenwenfang 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators AT chinchiunli 3dvosnetsegmentationofendoscopicimagesofthelarynxwithsubsequentgenerationofindicators |