Cargando…

Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope

OBJECTIVE: Endotracheal intubation (ETI) is critical to secure the airway in emergent situations. Although artificial intelligence algorithms are frequently used to analyze medical images, their application to evaluating intraoral structures based on images captured during emergent ETI remains limit...

Descripción completa

Detalles Bibliográficos
Autores principales: Choi, Seung Jae, Kim, Dae Kon, Kim, Byeong Soo, Cho, Minwoo, Jeong, Joo, Jo, You Hwan, Song, Kyoung Jun, Kim, Yu Jin, Kim, Sungwan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10631336/
https://www.ncbi.nlm.nih.gov/pubmed/38025115
http://dx.doi.org/10.1177/20552076231211547
_version_ 1785132352910917632
author Choi, Seung Jae
Kim, Dae Kon
Kim, Byeong Soo
Cho, Minwoo
Jeong, Joo
Jo, You Hwan
Song, Kyoung Jun
Kim, Yu Jin
Kim, Sungwan
author_facet Choi, Seung Jae
Kim, Dae Kon
Kim, Byeong Soo
Cho, Minwoo
Jeong, Joo
Jo, You Hwan
Song, Kyoung Jun
Kim, Yu Jin
Kim, Sungwan
author_sort Choi, Seung Jae
collection PubMed
description OBJECTIVE: Endotracheal intubation (ETI) is critical to secure the airway in emergent situations. Although artificial intelligence algorithms are frequently used to analyze medical images, their application to evaluating intraoral structures based on images captured during emergent ETI remains limited. The aim of this study is to develop an artificial intelligence model for segmenting structures in the oral cavity using video laryngoscope (VL) images. METHODS: From 54 VL videos, clinicians manually labeled images that include motion blur, foggy vision, blood, mucus, and vomitus. Anatomical structures of interest included the tongue, epiglottis, vocal cord, and corniculate cartilage. EfficientNet-B5 with DeepLabv3+, EffecientNet-B5 with U-Net, and Configured Mask R-Convolution Neural Network (CNN) were used; EffecientNet-B5 was pretrained on ImageNet. Dice similarity coefficient (DSC) was used to measure the segmentation performance of the model. Accuracy, recall, specificity, and F1 score were used to evaluate the model's performance in targeting the structure from the value of the intersection over union between the ground truth and prediction mask. RESULTS: The DSC of tongue, epiglottis, vocal cord, and corniculate cartilage obtained from the EfficientNet-B5 with DeepLabv3+, EfficientNet-B5 with U-Net, and Configured Mask R-CNN model were 0.3351/0.7675/0.766/0.6539, 0.0/0.7581/0.7395/0.6906, and 0.1167/0.7677/0.7207/0.57, respectively. Furthermore, the processing speeds (frames per second) of the three models stood at 3, 24, and 32, respectively. CONCLUSIONS: The algorithm developed in this study can assist medical providers performing ETI in emergent situations.
format Online
Article
Text
id pubmed-10631336
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-106313362023-11-06 Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope Choi, Seung Jae Kim, Dae Kon Kim, Byeong Soo Cho, Minwoo Jeong, Joo Jo, You Hwan Song, Kyoung Jun Kim, Yu Jin Kim, Sungwan Digit Health Original Research OBJECTIVE: Endotracheal intubation (ETI) is critical to secure the airway in emergent situations. Although artificial intelligence algorithms are frequently used to analyze medical images, their application to evaluating intraoral structures based on images captured during emergent ETI remains limited. The aim of this study is to develop an artificial intelligence model for segmenting structures in the oral cavity using video laryngoscope (VL) images. METHODS: From 54 VL videos, clinicians manually labeled images that include motion blur, foggy vision, blood, mucus, and vomitus. Anatomical structures of interest included the tongue, epiglottis, vocal cord, and corniculate cartilage. EfficientNet-B5 with DeepLabv3+, EffecientNet-B5 with U-Net, and Configured Mask R-Convolution Neural Network (CNN) were used; EffecientNet-B5 was pretrained on ImageNet. Dice similarity coefficient (DSC) was used to measure the segmentation performance of the model. Accuracy, recall, specificity, and F1 score were used to evaluate the model's performance in targeting the structure from the value of the intersection over union between the ground truth and prediction mask. RESULTS: The DSC of tongue, epiglottis, vocal cord, and corniculate cartilage obtained from the EfficientNet-B5 with DeepLabv3+, EfficientNet-B5 with U-Net, and Configured Mask R-CNN model were 0.3351/0.7675/0.766/0.6539, 0.0/0.7581/0.7395/0.6906, and 0.1167/0.7677/0.7207/0.57, respectively. Furthermore, the processing speeds (frames per second) of the three models stood at 3, 24, and 32, respectively. CONCLUSIONS: The algorithm developed in this study can assist medical providers performing ETI in emergent situations. SAGE Publications 2023-11-06 /pmc/articles/PMC10631336/ /pubmed/38025115 http://dx.doi.org/10.1177/20552076231211547 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 License (https://creativecommons.org/licenses/by-nc-nd/4.0/) which permits non-commercial use, reproduction and distribution of the work as published without adaptation or alteration, without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Choi, Seung Jae
Kim, Dae Kon
Kim, Byeong Soo
Cho, Minwoo
Jeong, Joo
Jo, You Hwan
Song, Kyoung Jun
Kim, Yu Jin
Kim, Sungwan
Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope
title Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope
title_full Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope
title_fullStr Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope
title_full_unstemmed Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope
title_short Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope
title_sort mask r-cnn based multiclass segmentation model for endotracheal intubation using video laryngoscope
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10631336/
https://www.ncbi.nlm.nih.gov/pubmed/38025115
http://dx.doi.org/10.1177/20552076231211547
work_keys_str_mv AT choiseungjae maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope
AT kimdaekon maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope
AT kimbyeongsoo maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope
AT chominwoo maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope
AT jeongjoo maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope
AT joyouhwan maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope
AT songkyoungjun maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope
AT kimyujin maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope
AT kimsungwan maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope