Cargando…
Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope
OBJECTIVE: Endotracheal intubation (ETI) is critical to secure the airway in emergent situations. Although artificial intelligence algorithms are frequently used to analyze medical images, their application to evaluating intraoral structures based on images captured during emergent ETI remains limit...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10631336/ https://www.ncbi.nlm.nih.gov/pubmed/38025115 http://dx.doi.org/10.1177/20552076231211547 |
_version_ | 1785132352910917632 |
---|---|
author | Choi, Seung Jae Kim, Dae Kon Kim, Byeong Soo Cho, Minwoo Jeong, Joo Jo, You Hwan Song, Kyoung Jun Kim, Yu Jin Kim, Sungwan |
author_facet | Choi, Seung Jae Kim, Dae Kon Kim, Byeong Soo Cho, Minwoo Jeong, Joo Jo, You Hwan Song, Kyoung Jun Kim, Yu Jin Kim, Sungwan |
author_sort | Choi, Seung Jae |
collection | PubMed |
description | OBJECTIVE: Endotracheal intubation (ETI) is critical to secure the airway in emergent situations. Although artificial intelligence algorithms are frequently used to analyze medical images, their application to evaluating intraoral structures based on images captured during emergent ETI remains limited. The aim of this study is to develop an artificial intelligence model for segmenting structures in the oral cavity using video laryngoscope (VL) images. METHODS: From 54 VL videos, clinicians manually labeled images that include motion blur, foggy vision, blood, mucus, and vomitus. Anatomical structures of interest included the tongue, epiglottis, vocal cord, and corniculate cartilage. EfficientNet-B5 with DeepLabv3+, EffecientNet-B5 with U-Net, and Configured Mask R-Convolution Neural Network (CNN) were used; EffecientNet-B5 was pretrained on ImageNet. Dice similarity coefficient (DSC) was used to measure the segmentation performance of the model. Accuracy, recall, specificity, and F1 score were used to evaluate the model's performance in targeting the structure from the value of the intersection over union between the ground truth and prediction mask. RESULTS: The DSC of tongue, epiglottis, vocal cord, and corniculate cartilage obtained from the EfficientNet-B5 with DeepLabv3+, EfficientNet-B5 with U-Net, and Configured Mask R-CNN model were 0.3351/0.7675/0.766/0.6539, 0.0/0.7581/0.7395/0.6906, and 0.1167/0.7677/0.7207/0.57, respectively. Furthermore, the processing speeds (frames per second) of the three models stood at 3, 24, and 32, respectively. CONCLUSIONS: The algorithm developed in this study can assist medical providers performing ETI in emergent situations. |
format | Online Article Text |
id | pubmed-10631336 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-106313362023-11-06 Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope Choi, Seung Jae Kim, Dae Kon Kim, Byeong Soo Cho, Minwoo Jeong, Joo Jo, You Hwan Song, Kyoung Jun Kim, Yu Jin Kim, Sungwan Digit Health Original Research OBJECTIVE: Endotracheal intubation (ETI) is critical to secure the airway in emergent situations. Although artificial intelligence algorithms are frequently used to analyze medical images, their application to evaluating intraoral structures based on images captured during emergent ETI remains limited. The aim of this study is to develop an artificial intelligence model for segmenting structures in the oral cavity using video laryngoscope (VL) images. METHODS: From 54 VL videos, clinicians manually labeled images that include motion blur, foggy vision, blood, mucus, and vomitus. Anatomical structures of interest included the tongue, epiglottis, vocal cord, and corniculate cartilage. EfficientNet-B5 with DeepLabv3+, EffecientNet-B5 with U-Net, and Configured Mask R-Convolution Neural Network (CNN) were used; EffecientNet-B5 was pretrained on ImageNet. Dice similarity coefficient (DSC) was used to measure the segmentation performance of the model. Accuracy, recall, specificity, and F1 score were used to evaluate the model's performance in targeting the structure from the value of the intersection over union between the ground truth and prediction mask. RESULTS: The DSC of tongue, epiglottis, vocal cord, and corniculate cartilage obtained from the EfficientNet-B5 with DeepLabv3+, EfficientNet-B5 with U-Net, and Configured Mask R-CNN model were 0.3351/0.7675/0.766/0.6539, 0.0/0.7581/0.7395/0.6906, and 0.1167/0.7677/0.7207/0.57, respectively. Furthermore, the processing speeds (frames per second) of the three models stood at 3, 24, and 32, respectively. CONCLUSIONS: The algorithm developed in this study can assist medical providers performing ETI in emergent situations. SAGE Publications 2023-11-06 /pmc/articles/PMC10631336/ /pubmed/38025115 http://dx.doi.org/10.1177/20552076231211547 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 License (https://creativecommons.org/licenses/by-nc-nd/4.0/) which permits non-commercial use, reproduction and distribution of the work as published without adaptation or alteration, without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Choi, Seung Jae Kim, Dae Kon Kim, Byeong Soo Cho, Minwoo Jeong, Joo Jo, You Hwan Song, Kyoung Jun Kim, Yu Jin Kim, Sungwan Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope |
title | Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope |
title_full | Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope |
title_fullStr | Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope |
title_full_unstemmed | Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope |
title_short | Mask R-CNN based multiclass segmentation model for endotracheal intubation using video laryngoscope |
title_sort | mask r-cnn based multiclass segmentation model for endotracheal intubation using video laryngoscope |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10631336/ https://www.ncbi.nlm.nih.gov/pubmed/38025115 http://dx.doi.org/10.1177/20552076231211547 |
work_keys_str_mv | AT choiseungjae maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope AT kimdaekon maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope AT kimbyeongsoo maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope AT chominwoo maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope AT jeongjoo maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope AT joyouhwan maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope AT songkyoungjun maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope AT kimyujin maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope AT kimsungwan maskrcnnbasedmulticlasssegmentationmodelforendotrachealintubationusingvideolaryngoscope |