Cargando…

Swin transformer-based GAN for multi-modal medical image translation

Medical image-to-image translation is considered a new direction with many potential applications in the medical field. The medical image-to-image translation is dominated by two models, including supervised Pix2Pix and unsupervised cyclic-consistency generative adversarial network (GAN). However, e...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Shouang, Wang, Chengyan, Chen, Weibo, Lyu, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395186/
https://www.ncbi.nlm.nih.gov/pubmed/36003791
http://dx.doi.org/10.3389/fonc.2022.942511
_version_ 1784771633256333312
author Yan, Shouang
Wang, Chengyan
Chen, Weibo
Lyu, Jun
author_facet Yan, Shouang
Wang, Chengyan
Chen, Weibo
Lyu, Jun
author_sort Yan, Shouang
collection PubMed
description Medical image-to-image translation is considered a new direction with many potential applications in the medical field. The medical image-to-image translation is dominated by two models, including supervised Pix2Pix and unsupervised cyclic-consistency generative adversarial network (GAN). However, existing methods still have two shortcomings: 1) the Pix2Pix requires paired and pixel-aligned images, which are difficult to acquire. Nevertheless, the optimum output of the cycle-consistency model may not be unique. 2) They are still deficient in capturing the global features and modeling long-distance interactions, which are critical for regions with complex anatomical structures. We propose a Swin Transformer-based GAN for Multi-Modal Medical Image Translation, named MMTrans. Specifically, MMTrans consists of a generator, a registration network, and a discriminator. The Swin Transformer-based generator enables to generate images with the same content as source modality images and similar style information of target modality images. The encoder part of the registration network, based on Swin Transformer, is utilized to predict deformable vector fields. The convolution-based discriminator determines whether the target modality images are similar to the generator or from the real images. Extensive experiments conducted using the public dataset and clinical datasets showed that our network outperformed other advanced medical image translation methods in both aligned and unpaired datasets and has great potential to be applied in clinical applications.
format Online
Article
Text
id pubmed-9395186
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93951862022-08-23 Swin transformer-based GAN for multi-modal medical image translation Yan, Shouang Wang, Chengyan Chen, Weibo Lyu, Jun Front Oncol Oncology Medical image-to-image translation is considered a new direction with many potential applications in the medical field. The medical image-to-image translation is dominated by two models, including supervised Pix2Pix and unsupervised cyclic-consistency generative adversarial network (GAN). However, existing methods still have two shortcomings: 1) the Pix2Pix requires paired and pixel-aligned images, which are difficult to acquire. Nevertheless, the optimum output of the cycle-consistency model may not be unique. 2) They are still deficient in capturing the global features and modeling long-distance interactions, which are critical for regions with complex anatomical structures. We propose a Swin Transformer-based GAN for Multi-Modal Medical Image Translation, named MMTrans. Specifically, MMTrans consists of a generator, a registration network, and a discriminator. The Swin Transformer-based generator enables to generate images with the same content as source modality images and similar style information of target modality images. The encoder part of the registration network, based on Swin Transformer, is utilized to predict deformable vector fields. The convolution-based discriminator determines whether the target modality images are similar to the generator or from the real images. Extensive experiments conducted using the public dataset and clinical datasets showed that our network outperformed other advanced medical image translation methods in both aligned and unpaired datasets and has great potential to be applied in clinical applications. Frontiers Media S.A. 2022-08-08 /pmc/articles/PMC9395186/ /pubmed/36003791 http://dx.doi.org/10.3389/fonc.2022.942511 Text en Copyright © 2022 Yan, Wang, Chen and Lyu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Yan, Shouang
Wang, Chengyan
Chen, Weibo
Lyu, Jun
Swin transformer-based GAN for multi-modal medical image translation
title Swin transformer-based GAN for multi-modal medical image translation
title_full Swin transformer-based GAN for multi-modal medical image translation
title_fullStr Swin transformer-based GAN for multi-modal medical image translation
title_full_unstemmed Swin transformer-based GAN for multi-modal medical image translation
title_short Swin transformer-based GAN for multi-modal medical image translation
title_sort swin transformer-based gan for multi-modal medical image translation
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395186/
https://www.ncbi.nlm.nih.gov/pubmed/36003791
http://dx.doi.org/10.3389/fonc.2022.942511
work_keys_str_mv AT yanshouang swintransformerbasedganformultimodalmedicalimagetranslation
AT wangchengyan swintransformerbasedganformultimodalmedicalimagetranslation
AT chenweibo swintransformerbasedganformultimodalmedicalimagetranslation
AT lyujun swintransformerbasedganformultimodalmedicalimagetranslation