Cargando…

TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos

PURPOSE: Automatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ramesh, Sanat, Dall’Alba, Diego, Gonzalez, Cristians, Yu, Tong, Mascagni, Pietro, Mutter, Didier, Marescaux, Jacques, Fiorini, Paolo, Padoy, Nicolas
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2023
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491694/ https://www.ncbi.nlm.nih.gov/pubmed/36944845 http://dx.doi.org/10.1007/s11548-023-02864-8

_version_	1785104114589368320
author	Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas
author_facet	Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas
author_sort	Ramesh, Sanat
collection	PubMed
description	PURPOSE: Automatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential to improve the generalization of these methods. This has spurred work on automated and simplified augmentation strategies for image classification and object detection on datasets of still images. Extending such augmentation methods to videos is not straightforward, as the temporal dimension needs to be considered. Furthermore, surgical videos pose additional challenges as they are composed of multiple, interconnected, and long-duration activities. METHODS: This work proposes a new simplified augmentation method, called TRandAugment, specifically designed for long surgical videos, that treats each video as an assemble of temporal segments and applies consistent but random transformations to each segment. The proposed augmentation method is used to train an end-to-end spatiotemporal model consisting of a CNN (ResNet50) followed by a TCN. RESULTS: The effectiveness of the proposed method is demonstrated on two surgical video datasets, namely Bypass40 and CATARACTS, and two tasks, surgical phase and step recognition. TRandAugment adds a performance boost of 1–6% over previous state-of-the-art methods, that uses manually designed augmentations. CONCLUSION: This work presents a simplified and automated augmentation method for long surgical videos. The proposed method has been validated on different datasets and tasks indicating the importance of devising temporal augmentation methods for long surgical videos.
format	Online Article Text
id	pubmed-10491694
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-104916942023-09-10 TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas Int J Comput Assist Radiol Surg Original Article PURPOSE: Automatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential to improve the generalization of these methods. This has spurred work on automated and simplified augmentation strategies for image classification and object detection on datasets of still images. Extending such augmentation methods to videos is not straightforward, as the temporal dimension needs to be considered. Furthermore, surgical videos pose additional challenges as they are composed of multiple, interconnected, and long-duration activities. METHODS: This work proposes a new simplified augmentation method, called TRandAugment, specifically designed for long surgical videos, that treats each video as an assemble of temporal segments and applies consistent but random transformations to each segment. The proposed augmentation method is used to train an end-to-end spatiotemporal model consisting of a CNN (ResNet50) followed by a TCN. RESULTS: The effectiveness of the proposed method is demonstrated on two surgical video datasets, namely Bypass40 and CATARACTS, and two tasks, surgical phase and step recognition. TRandAugment adds a performance boost of 1–6% over previous state-of-the-art methods, that uses manually designed augmentations. CONCLUSION: This work presents a simplified and automated augmentation method for long surgical videos. The proposed method has been validated on different datasets and tasks indicating the importance of devising temporal augmentation methods for long surgical videos. Springer International Publishing 2023-03-22 2023 /pmc/articles/PMC10491694/ /pubmed/36944845 http://dx.doi.org/10.1007/s11548-023-02864-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Original Article Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title	TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title_full	TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title_fullStr	TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title_full_unstemmed	TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title_short	TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
title_sort	trandaugment: temporal random augmentation strategy for surgical activity recognition from videos
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491694/ https://www.ncbi.nlm.nih.gov/pubmed/36944845 http://dx.doi.org/10.1007/s11548-023-02864-8
work_keys_str_mv	AT rameshsanat trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT dallalbadiego trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT gonzalezcristians trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT yutong trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT mascagnipietro trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT mutterdidier trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT marescauxjacques trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT fiorinipaolo trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT padoynicolas trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos

TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos

Ejemplares similares