Cargando…

Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures

PURPOSE: Automatic segmentation and classification of surgical activity is crucial for providing advanced support in computer-assisted interventions and autonomous functionalities in robot-assisted surgeries. Prior works have focused on recognizing either coarse activities, such as phases, or fine-g...

Descripción completa

Detalles Bibliográficos
Autores principales: Ramesh, Sanat, Dall’Alba, Diego, Gonzalez, Cristians, Yu, Tong, Mascagni, Pietro, Mutter, Didier, Marescaux, Jacques, Fiorini, Paolo, Padoy, Nicolas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8260406/
https://www.ncbi.nlm.nih.gov/pubmed/34013464
http://dx.doi.org/10.1007/s11548-021-02388-z
_version_ 1783718806619684864
author Ramesh, Sanat
Dall’Alba, Diego
Gonzalez, Cristians
Yu, Tong
Mascagni, Pietro
Mutter, Didier
Marescaux, Jacques
Fiorini, Paolo
Padoy, Nicolas
author_facet Ramesh, Sanat
Dall’Alba, Diego
Gonzalez, Cristians
Yu, Tong
Mascagni, Pietro
Mutter, Didier
Marescaux, Jacques
Fiorini, Paolo
Padoy, Nicolas
author_sort Ramesh, Sanat
collection PubMed
description PURPOSE: Automatic segmentation and classification of surgical activity is crucial for providing advanced support in computer-assisted interventions and autonomous functionalities in robot-assisted surgeries. Prior works have focused on recognizing either coarse activities, such as phases, or fine-grained activities, such as gestures. This work aims at jointly recognizing two complementary levels of granularity directly from videos, namely phases and steps. METHODS: We introduce two correlated surgical activities, phases and steps, for the laparoscopic gastric bypass procedure. We propose a multi-task multi-stage temporal convolutional network (MTMS-TCN) along with a multi-task convolutional neural network (CNN) training setup to jointly predict the phases and steps and benefit from their complementarity to better evaluate the execution of the procedure. We evaluate the proposed method on a large video dataset consisting of 40 surgical procedures (Bypass40). RESULTS: We present experimental results from several baseline models for both phase and step recognition on the Bypass40. The proposed MTMS-TCN method outperforms single-task methods in both phase and step recognition by 1-2% in accuracy, precision and recall. Furthermore, for step recognition, MTMS-TCN achieves a superior performance of 3-6% compared to LSTM-based models on all metrics. CONCLUSION: In this work, we present a multi-task multi-stage temporal convolutional network for surgical activity recognition, which shows improved results compared to single-task models on a gastric bypass dataset with multi-level annotations. The proposed method shows that the joint modeling of phases and steps is beneficial to improve the overall recognition of each type of activity.
format Online
Article
Text
id pubmed-8260406
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-82604062021-07-20 Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas Int J Comput Assist Radiol Surg Original Article PURPOSE: Automatic segmentation and classification of surgical activity is crucial for providing advanced support in computer-assisted interventions and autonomous functionalities in robot-assisted surgeries. Prior works have focused on recognizing either coarse activities, such as phases, or fine-grained activities, such as gestures. This work aims at jointly recognizing two complementary levels of granularity directly from videos, namely phases and steps. METHODS: We introduce two correlated surgical activities, phases and steps, for the laparoscopic gastric bypass procedure. We propose a multi-task multi-stage temporal convolutional network (MTMS-TCN) along with a multi-task convolutional neural network (CNN) training setup to jointly predict the phases and steps and benefit from their complementarity to better evaluate the execution of the procedure. We evaluate the proposed method on a large video dataset consisting of 40 surgical procedures (Bypass40). RESULTS: We present experimental results from several baseline models for both phase and step recognition on the Bypass40. The proposed MTMS-TCN method outperforms single-task methods in both phase and step recognition by 1-2% in accuracy, precision and recall. Furthermore, for step recognition, MTMS-TCN achieves a superior performance of 3-6% compared to LSTM-based models on all metrics. CONCLUSION: In this work, we present a multi-task multi-stage temporal convolutional network for surgical activity recognition, which shows improved results compared to single-task models on a gastric bypass dataset with multi-level annotations. The proposed method shows that the joint modeling of phases and steps is beneficial to improve the overall recognition of each type of activity. Springer International Publishing 2021-05-19 2021 /pmc/articles/PMC8260406/ /pubmed/34013464 http://dx.doi.org/10.1007/s11548-021-02388-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Article
Ramesh, Sanat
Dall’Alba, Diego
Gonzalez, Cristians
Yu, Tong
Mascagni, Pietro
Mutter, Didier
Marescaux, Jacques
Fiorini, Paolo
Padoy, Nicolas
Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures
title Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures
title_full Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures
title_fullStr Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures
title_full_unstemmed Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures
title_short Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures
title_sort multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8260406/
https://www.ncbi.nlm.nih.gov/pubmed/34013464
http://dx.doi.org/10.1007/s11548-021-02388-z
work_keys_str_mv AT rameshsanat multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures
AT dallalbadiego multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures
AT gonzalezcristians multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures
AT yutong multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures
AT mascagnipietro multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures
AT mutterdidier multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures
AT marescauxjacques multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures
AT fiorinipaolo multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures
AT padoynicolas multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures