Cargando…

TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos

PURPOSE: Automatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential...

Descripción completa

Detalles Bibliográficos
Autores principales: Ramesh, Sanat, Dall’Alba, Diego, Gonzalez, Cristians, Yu, Tong, Mascagni, Pietro, Mutter, Didier, Marescaux, Jacques, Fiorini, Paolo, Padoy, Nicolas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491694/
https://www.ncbi.nlm.nih.gov/pubmed/36944845
http://dx.doi.org/10.1007/s11548-023-02864-8
_version_ 1785104114589368320
author Ramesh, Sanat
Dall’Alba, Diego
Gonzalez, Cristians
Yu, Tong
Mascagni, Pietro
Mutter, Didier
Marescaux, Jacques
Fiorini, Paolo
Padoy, Nicolas
author_facet Ramesh, Sanat
Dall’Alba, Diego
Gonzalez, Cristians
Yu, Tong
Mascagni, Pietro
Mutter, Didier
Marescaux, Jacques
Fiorini, Paolo
Padoy, Nicolas
author_sort Ramesh, Sanat
collection PubMed
description PURPOSE: Automatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential to improve the generalization of these methods. This has spurred work on automated and simplified augmentation strategies for image classification and object detection on datasets of still images. Extending such augmentation methods to videos is not straightforward, as the temporal dimension needs to be considered. Furthermore, surgical videos pose additional challenges as they are composed of multiple, interconnected, and long-duration activities. METHODS: This work proposes a new simplified augmentation method, called TRandAugment, specifically designed for long surgical videos, that treats each video as an assemble of temporal segments and applies consistent but random transformations to each segment. The proposed augmentation method is used to train an end-to-end spatiotemporal model consisting of a CNN (ResNet50) followed by a TCN. RESULTS: The effectiveness of the proposed method is demonstrated on two surgical video datasets, namely Bypass40 and CATARACTS, and two tasks, surgical phase and step recognition. TRandAugment adds a performance boost of 1–6% over previous state-of-the-art methods, that uses manually designed augmentations. CONCLUSION: This work presents a simplified and automated augmentation method for long surgical videos. The proposed method has been validated on different datasets and tasks indicating the importance of devising temporal augmentation methods for long surgical videos.
format Online
Article
Text
id pubmed-10491694
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-104916942023-09-10 TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas Int J Comput Assist Radiol Surg Original Article PURPOSE: Automatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential to improve the generalization of these methods. This has spurred work on automated and simplified augmentation strategies for image classification and object detection on datasets of still images. Extending such augmentation methods to videos is not straightforward, as the temporal dimension needs to be considered. Furthermore, surgical videos pose additional challenges as they are composed of multiple, interconnected, and long-duration activities. METHODS: This work proposes a new simplified augmentation method, called TRandAugment, specifically designed for long surgical videos, that treats each video as an assemble of temporal segments and applies consistent but random transformations to each segment. The proposed augmentation method is used to train an end-to-end spatiotemporal model consisting of a CNN (ResNet50) followed by a TCN. RESULTS: The effectiveness of the proposed method is demonstrated on two surgical video datasets, namely Bypass40 and CATARACTS, and two tasks, surgical phase and step recognition. TRandAugment adds a performance boost of 1–6% over previous state-of-the-art methods, that uses manually designed augmentations. CONCLUSION: This work presents a simplified and automated augmentation method for long surgical videos. The proposed method has been validated on different datasets and tasks indicating the importance of devising temporal augmentation methods for long surgical videos. Springer International Publishing 2023-03-22 2023 /pmc/articles/PMC10491694/ /pubmed/36944845 http://dx.doi.org/10.1007/s11548-023-02864-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Article
Ramesh, Sanat
Dall’Alba, Diego
Gonzalez, Cristians
Yu, Tong
Mascagni, Pietro
Mutter, Didier
Marescaux, Jacques
Fiorini, Paolo
Padoy, Nicolas
TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title_full TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title_fullStr TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title_full_unstemmed TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title_short TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title_sort trandaugment: temporal random augmentation strategy for surgical activity recognition from videos
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491694/
https://www.ncbi.nlm.nih.gov/pubmed/36944845
http://dx.doi.org/10.1007/s11548-023-02864-8
work_keys_str_mv AT rameshsanat trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos
AT dallalbadiego trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos
AT gonzalezcristians trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos
AT yutong trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos
AT mascagnipietro trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos
AT mutterdidier trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos
AT marescauxjacques trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos
AT fiorinipaolo trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos
AT padoynicolas trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos