Cargando…

Temporally consistent sequence-to-sequence translation of cataract surgeries

PURPOSE: Image-to-image translation methods can address the lack of diversity in publicly available cataract surgery data. However, applying image-to-image translation to videos—which are frequently used in medical downstream applications—induces artifacts. Additional spatio-temporal constraints are...

Descripción completa

Detalles Bibliográficos
Autores principales: Frisch, Yannik, Fuchs, Moritz, Mukhopadhyay, Anirban
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10329626/
https://www.ncbi.nlm.nih.gov/pubmed/37219806
http://dx.doi.org/10.1007/s11548-023-02925-y
_version_ 1785070059505319936
author Frisch, Yannik
Fuchs, Moritz
Mukhopadhyay, Anirban
author_facet Frisch, Yannik
Fuchs, Moritz
Mukhopadhyay, Anirban
author_sort Frisch, Yannik
collection PubMed
description PURPOSE: Image-to-image translation methods can address the lack of diversity in publicly available cataract surgery data. However, applying image-to-image translation to videos—which are frequently used in medical downstream applications—induces artifacts. Additional spatio-temporal constraints are needed to produce realistic translations and improve the temporal consistency of translated image sequences. METHODS: We introduce a motion-translation module that translates optical flows between domains to impose such constraints. We combine it with a shared latent space translation model to improve image quality. Evaluations are conducted regarding translated sequences’ image quality and temporal consistency, where we propose novel quantitative metrics for the latter. Finally, the downstream task of surgical phase classification is evaluated when retraining it with additional synthetic translated data. RESULTS: Our proposed method produces more consistent translations than state-of-the-art baselines. Moreover, it stays competitive in terms of the per-image translation quality. We further show the benefit of consistently translated cataract surgery sequences for improving the downstream task of surgical phase prediction. CONCLUSION: The proposed module increases the temporal consistency of translated sequences. Furthermore, imposed temporal constraints increase the usability of translated data in downstream tasks. This allows overcoming some of the hurdles of surgical data acquisition and annotation and enables improving models’ performance by translating between existing datasets of sequential frames. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11548-023-02925-y.
format Online
Article
Text
id pubmed-10329626
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-103296262023-07-10 Temporally consistent sequence-to-sequence translation of cataract surgeries Frisch, Yannik Fuchs, Moritz Mukhopadhyay, Anirban Int J Comput Assist Radiol Surg Review Article PURPOSE: Image-to-image translation methods can address the lack of diversity in publicly available cataract surgery data. However, applying image-to-image translation to videos—which are frequently used in medical downstream applications—induces artifacts. Additional spatio-temporal constraints are needed to produce realistic translations and improve the temporal consistency of translated image sequences. METHODS: We introduce a motion-translation module that translates optical flows between domains to impose such constraints. We combine it with a shared latent space translation model to improve image quality. Evaluations are conducted regarding translated sequences’ image quality and temporal consistency, where we propose novel quantitative metrics for the latter. Finally, the downstream task of surgical phase classification is evaluated when retraining it with additional synthetic translated data. RESULTS: Our proposed method produces more consistent translations than state-of-the-art baselines. Moreover, it stays competitive in terms of the per-image translation quality. We further show the benefit of consistently translated cataract surgery sequences for improving the downstream task of surgical phase prediction. CONCLUSION: The proposed module increases the temporal consistency of translated sequences. Furthermore, imposed temporal constraints increase the usability of translated data in downstream tasks. This allows overcoming some of the hurdles of surgical data acquisition and annotation and enables improving models’ performance by translating between existing datasets of sequential frames. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11548-023-02925-y. Springer International Publishing 2023-05-23 2023 /pmc/articles/PMC10329626/ /pubmed/37219806 http://dx.doi.org/10.1007/s11548-023-02925-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Review Article
Frisch, Yannik
Fuchs, Moritz
Mukhopadhyay, Anirban
Temporally consistent sequence-to-sequence translation of cataract surgeries
title Temporally consistent sequence-to-sequence translation of cataract surgeries
title_full Temporally consistent sequence-to-sequence translation of cataract surgeries
title_fullStr Temporally consistent sequence-to-sequence translation of cataract surgeries
title_full_unstemmed Temporally consistent sequence-to-sequence translation of cataract surgeries
title_short Temporally consistent sequence-to-sequence translation of cataract surgeries
title_sort temporally consistent sequence-to-sequence translation of cataract surgeries
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10329626/
https://www.ncbi.nlm.nih.gov/pubmed/37219806
http://dx.doi.org/10.1007/s11548-023-02925-y
work_keys_str_mv AT frischyannik temporallyconsistentsequencetosequencetranslationofcataractsurgeries
AT fuchsmoritz temporallyconsistentsequencetosequencetranslationofcataractsurgeries
AT mukhopadhyayanirban temporallyconsistentsequencetosequencetranslationofcataractsurgeries