Cargando…
TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos
PURPOSE: Automatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491694/ https://www.ncbi.nlm.nih.gov/pubmed/36944845 http://dx.doi.org/10.1007/s11548-023-02864-8 |
_version_ | 1785104114589368320 |
---|---|
author | Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas |
author_facet | Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas |
author_sort | Ramesh, Sanat |
collection | PubMed |
description | PURPOSE: Automatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential to improve the generalization of these methods. This has spurred work on automated and simplified augmentation strategies for image classification and object detection on datasets of still images. Extending such augmentation methods to videos is not straightforward, as the temporal dimension needs to be considered. Furthermore, surgical videos pose additional challenges as they are composed of multiple, interconnected, and long-duration activities. METHODS: This work proposes a new simplified augmentation method, called TRandAugment, specifically designed for long surgical videos, that treats each video as an assemble of temporal segments and applies consistent but random transformations to each segment. The proposed augmentation method is used to train an end-to-end spatiotemporal model consisting of a CNN (ResNet50) followed by a TCN. RESULTS: The effectiveness of the proposed method is demonstrated on two surgical video datasets, namely Bypass40 and CATARACTS, and two tasks, surgical phase and step recognition. TRandAugment adds a performance boost of 1–6% over previous state-of-the-art methods, that uses manually designed augmentations. CONCLUSION: This work presents a simplified and automated augmentation method for long surgical videos. The proposed method has been validated on different datasets and tasks indicating the importance of devising temporal augmentation methods for long surgical videos. |
format | Online Article Text |
id | pubmed-10491694 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-104916942023-09-10 TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas Int J Comput Assist Radiol Surg Original Article PURPOSE: Automatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential to improve the generalization of these methods. This has spurred work on automated and simplified augmentation strategies for image classification and object detection on datasets of still images. Extending such augmentation methods to videos is not straightforward, as the temporal dimension needs to be considered. Furthermore, surgical videos pose additional challenges as they are composed of multiple, interconnected, and long-duration activities. METHODS: This work proposes a new simplified augmentation method, called TRandAugment, specifically designed for long surgical videos, that treats each video as an assemble of temporal segments and applies consistent but random transformations to each segment. The proposed augmentation method is used to train an end-to-end spatiotemporal model consisting of a CNN (ResNet50) followed by a TCN. RESULTS: The effectiveness of the proposed method is demonstrated on two surgical video datasets, namely Bypass40 and CATARACTS, and two tasks, surgical phase and step recognition. TRandAugment adds a performance boost of 1–6% over previous state-of-the-art methods, that uses manually designed augmentations. CONCLUSION: This work presents a simplified and automated augmentation method for long surgical videos. The proposed method has been validated on different datasets and tasks indicating the importance of devising temporal augmentation methods for long surgical videos. Springer International Publishing 2023-03-22 2023 /pmc/articles/PMC10491694/ /pubmed/36944845 http://dx.doi.org/10.1007/s11548-023-02864-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Original Article Ramesh, Sanat Dall’Alba, Diego Gonzalez, Cristians Yu, Tong Mascagni, Pietro Mutter, Didier Marescaux, Jacques Fiorini, Paolo Padoy, Nicolas TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos |
title | TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos |
title_full | TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos |
title_fullStr | TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos |
title_full_unstemmed | TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos |
title_short | TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos |
title_sort | trandaugment: temporal random augmentation strategy for surgical activity recognition from videos |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491694/ https://www.ncbi.nlm.nih.gov/pubmed/36944845 http://dx.doi.org/10.1007/s11548-023-02864-8 |
work_keys_str_mv | AT rameshsanat trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT dallalbadiego trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT gonzalezcristians trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT yutong trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT mascagnipietro trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT mutterdidier trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT marescauxjacques trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT fiorinipaolo trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos AT padoynicolas trandaugmenttemporalrandomaugmentationstrategyforsurgicalactivityrecognitionfromvideos |