Cargando…
Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures
PURPOSE: Automatic segmentation and classification of surgical activity is crucial for providing advanced support in computer-assisted interventions and autonomous functionalities in robot-assisted surgeries. Prior works have focused on recognizing either coarse activities, such as phases, or fine-g...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8260406/ https://www.ncbi.nlm.nih.gov/pubmed/34013464 http://dx.doi.org/10.1007/s11548-021-02388-z |
_version_ | 1783718806619684864 |
---|---|
author | Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas |
author_facet | Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas |
author_sort | Ramesh, Sanat |
collection | PubMed |
description | PURPOSE: Automatic segmentation and classification of surgical activity is crucial for providing advanced support in computer-assisted interventions and autonomous functionalities in robot-assisted surgeries. Prior works have focused on recognizing either coarse activities, such as phases, or fine-grained activities, such as gestures. This work aims at jointly recognizing two complementary levels of granularity directly from videos, namely phases and steps. METHODS: We introduce two correlated surgical activities, phases and steps, for the laparoscopic gastric bypass procedure. We propose a multi-task multi-stage temporal convolutional network (MTMS-TCN) along with a multi-task convolutional neural network (CNN) training setup to jointly predict the phases and steps and benefit from their complementarity to better evaluate the execution of the procedure. We evaluate the proposed method on a large video dataset consisting of 40 surgical procedures (Bypass40). RESULTS: We present experimental results from several baseline models for both phase and step recognition on the Bypass40. The proposed MTMS-TCN method outperforms single-task methods in both phase and step recognition by 1-2% in accuracy, precision and recall. Furthermore, for step recognition, MTMS-TCN achieves a superior performance of 3-6% compared to LSTM-based models on all metrics. CONCLUSION: In this work, we present a multi-task multi-stage temporal convolutional network for surgical activity recognition, which shows improved results compared to single-task models on a gastric bypass dataset with multi-level annotations. The proposed method shows that the joint modeling of phases and steps is beneficial to improve the overall recognition of each type of activity. |
format | Online Article Text |
id | pubmed-8260406 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-82604062021-07-20 Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas Int J Comput Assist Radiol Surg Original Article PURPOSE: Automatic segmentation and classification of surgical activity is crucial for providing advanced support in computer-assisted interventions and autonomous functionalities in robot-assisted surgeries. Prior works have focused on recognizing either coarse activities, such as phases, or fine-grained activities, such as gestures. This work aims at jointly recognizing two complementary levels of granularity directly from videos, namely phases and steps. METHODS: We introduce two correlated surgical activities, phases and steps, for the laparoscopic gastric bypass procedure. We propose a multi-task multi-stage temporal convolutional network (MTMS-TCN) along with a multi-task convolutional neural network (CNN) training setup to jointly predict the phases and steps and benefit from their complementarity to better evaluate the execution of the procedure. We evaluate the proposed method on a large video dataset consisting of 40 surgical procedures (Bypass40). RESULTS: We present experimental results from several baseline models for both phase and step recognition on the Bypass40. The proposed MTMS-TCN method outperforms single-task methods in both phase and step recognition by 1-2% in accuracy, precision and recall. Furthermore, for step recognition, MTMS-TCN achieves a superior performance of 3-6% compared to LSTM-based models on all metrics. CONCLUSION: In this work, we present a multi-task multi-stage temporal convolutional network for surgical activity recognition, which shows improved results compared to single-task models on a gastric bypass dataset with multi-level annotations. The proposed method shows that the joint modeling of phases and steps is beneficial to improve the overall recognition of each type of activity. Springer International Publishing 2021-05-19 2021 /pmc/articles/PMC8260406/ /pubmed/34013464 http://dx.doi.org/10.1007/s11548-021-02388-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Original Article Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures |
title | Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures |
title_full | Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures |
title_fullStr | Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures |
title_full_unstemmed | Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures |
title_short | Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures |
title_sort | multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8260406/ https://www.ncbi.nlm.nih.gov/pubmed/34013464 http://dx.doi.org/10.1007/s11548-021-02388-z |
work_keys_str_mv | AT rameshsanat multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures AT dallalbadiego multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures AT gonzalezcristians multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures AT yutong multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures AT mascagnipietro multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures AT mutterdidier multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures AT marescauxjacques multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures AT fiorinipaolo multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures AT padoynicolas multitasktemporalconvolutionalnetworksforjointrecognitionofsurgicalphasesandstepsingastricbypassprocedures |