Cargando…

Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition

In recent times, many studies concerning surgical video analysis are being conducted due to its growing importance in many medical applications. In particular, it is very important to be able to recognize the current surgical phase because the phase information can be utilized in various ways both d...

Descripción completa

Detalles Bibliográficos
Autores principales:	Park, Minyoung, Oh, Seungtaek, Jeong, Taikyeong, Yu, Sungwook
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9818879/ https://www.ncbi.nlm.nih.gov/pubmed/36611399 http://dx.doi.org/10.3390/diagnostics13010107

_version_	1784865094049464320
author	Park, Minyoung Oh, Seungtaek Jeong, Taikyeong Yu, Sungwook
author_facet	Park, Minyoung Oh, Seungtaek Jeong, Taikyeong Yu, Sungwook
author_sort	Park, Minyoung
collection	PubMed
description	In recent times, many studies concerning surgical video analysis are being conducted due to its growing importance in many medical applications. In particular, it is very important to be able to recognize the current surgical phase because the phase information can be utilized in various ways both during and after surgery. This paper proposes an efficient phase recognition network, called MomentNet, for cholecystectomy endoscopic videos. Unlike LSTM-based network, MomentNet is based on a multi-stage temporal convolutional network. Besides, to improve the phase prediction accuracy, the proposed method adopts a new loss function to supplement the general cross entropy loss function. The new loss function significantly improves the performance of the phase recognition network by constraining un-desirable phase transition and preventing over-segmentation. In addition, MomnetNet effectively applies positional encoding techniques, which are commonly applied in transformer architectures, to the multi-stage temporal convolution network. By using the positional encoding techniques, MomentNet can provide important temporal context, resulting in higher phase prediction accuracy. Furthermore, the MomentNet applies label smoothing technique to suppress overfitting and replaces the backbone network for feature extraction to further improve the network performance. As a result, the MomentNet achieves 92.31% accuracy in the phase recognition task with the Cholec80 dataset, which is 4.55% higher than that of the baseline architecture.
format	Online Article Text
id	pubmed-9818879
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-98188792023-01-07 Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition Park, Minyoung Oh, Seungtaek Jeong, Taikyeong Yu, Sungwook Diagnostics (Basel) Article In recent times, many studies concerning surgical video analysis are being conducted due to its growing importance in many medical applications. In particular, it is very important to be able to recognize the current surgical phase because the phase information can be utilized in various ways both during and after surgery. This paper proposes an efficient phase recognition network, called MomentNet, for cholecystectomy endoscopic videos. Unlike LSTM-based network, MomentNet is based on a multi-stage temporal convolutional network. Besides, to improve the phase prediction accuracy, the proposed method adopts a new loss function to supplement the general cross entropy loss function. The new loss function significantly improves the performance of the phase recognition network by constraining un-desirable phase transition and preventing over-segmentation. In addition, MomnetNet effectively applies positional encoding techniques, which are commonly applied in transformer architectures, to the multi-stage temporal convolution network. By using the positional encoding techniques, MomentNet can provide important temporal context, resulting in higher phase prediction accuracy. Furthermore, the MomentNet applies label smoothing technique to suppress overfitting and replaces the backbone network for feature extraction to further improve the network performance. As a result, the MomentNet achieves 92.31% accuracy in the phase recognition task with the Cholec80 dataset, which is 4.55% higher than that of the baseline architecture. MDPI 2022-12-29 /pmc/articles/PMC9818879/ /pubmed/36611399 http://dx.doi.org/10.3390/diagnostics13010107 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Park, Minyoung Oh, Seungtaek Jeong, Taikyeong Yu, Sungwook Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition
title	Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition
title_full	Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition
title_fullStr	Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition
title_full_unstemmed	Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition
title_short	Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition
title_sort	multi-stage temporal convolutional network with moment loss and positional encoding for surgical phase recognition
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9818879/ https://www.ncbi.nlm.nih.gov/pubmed/36611399 http://dx.doi.org/10.3390/diagnostics13010107
work_keys_str_mv	AT parkminyoung multistagetemporalconvolutionalnetworkwithmomentlossandpositionalencodingforsurgicalphaserecognition AT ohseungtaek multistagetemporalconvolutionalnetworkwithmomentlossandpositionalencodingforsurgicalphaserecognition AT jeongtaikyeong multistagetemporalconvolutionalnetworkwithmomentlossandpositionalencodingforsurgicalphaserecognition AT yusungwook multistagetemporalconvolutionalnetworkwithmomentlossandpositionalencodingforsurgicalphaserecognition

Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition

Ejemplares similares