Cargando…
Swin transformer-based GAN for multi-modal medical image translation
Medical image-to-image translation is considered a new direction with many potential applications in the medical field. The medical image-to-image translation is dominated by two models, including supervised Pix2Pix and unsupervised cyclic-consistency generative adversarial network (GAN). However, e...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395186/ https://www.ncbi.nlm.nih.gov/pubmed/36003791 http://dx.doi.org/10.3389/fonc.2022.942511 |
_version_ | 1784771633256333312 |
---|---|
author | Yan, Shouang Wang, Chengyan Chen, Weibo Lyu, Jun |
author_facet | Yan, Shouang Wang, Chengyan Chen, Weibo Lyu, Jun |
author_sort | Yan, Shouang |
collection | PubMed |
description | Medical image-to-image translation is considered a new direction with many potential applications in the medical field. The medical image-to-image translation is dominated by two models, including supervised Pix2Pix and unsupervised cyclic-consistency generative adversarial network (GAN). However, existing methods still have two shortcomings: 1) the Pix2Pix requires paired and pixel-aligned images, which are difficult to acquire. Nevertheless, the optimum output of the cycle-consistency model may not be unique. 2) They are still deficient in capturing the global features and modeling long-distance interactions, which are critical for regions with complex anatomical structures. We propose a Swin Transformer-based GAN for Multi-Modal Medical Image Translation, named MMTrans. Specifically, MMTrans consists of a generator, a registration network, and a discriminator. The Swin Transformer-based generator enables to generate images with the same content as source modality images and similar style information of target modality images. The encoder part of the registration network, based on Swin Transformer, is utilized to predict deformable vector fields. The convolution-based discriminator determines whether the target modality images are similar to the generator or from the real images. Extensive experiments conducted using the public dataset and clinical datasets showed that our network outperformed other advanced medical image translation methods in both aligned and unpaired datasets and has great potential to be applied in clinical applications. |
format | Online Article Text |
id | pubmed-9395186 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93951862022-08-23 Swin transformer-based GAN for multi-modal medical image translation Yan, Shouang Wang, Chengyan Chen, Weibo Lyu, Jun Front Oncol Oncology Medical image-to-image translation is considered a new direction with many potential applications in the medical field. The medical image-to-image translation is dominated by two models, including supervised Pix2Pix and unsupervised cyclic-consistency generative adversarial network (GAN). However, existing methods still have two shortcomings: 1) the Pix2Pix requires paired and pixel-aligned images, which are difficult to acquire. Nevertheless, the optimum output of the cycle-consistency model may not be unique. 2) They are still deficient in capturing the global features and modeling long-distance interactions, which are critical for regions with complex anatomical structures. We propose a Swin Transformer-based GAN for Multi-Modal Medical Image Translation, named MMTrans. Specifically, MMTrans consists of a generator, a registration network, and a discriminator. The Swin Transformer-based generator enables to generate images with the same content as source modality images and similar style information of target modality images. The encoder part of the registration network, based on Swin Transformer, is utilized to predict deformable vector fields. The convolution-based discriminator determines whether the target modality images are similar to the generator or from the real images. Extensive experiments conducted using the public dataset and clinical datasets showed that our network outperformed other advanced medical image translation methods in both aligned and unpaired datasets and has great potential to be applied in clinical applications. Frontiers Media S.A. 2022-08-08 /pmc/articles/PMC9395186/ /pubmed/36003791 http://dx.doi.org/10.3389/fonc.2022.942511 Text en Copyright © 2022 Yan, Wang, Chen and Lyu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Oncology Yan, Shouang Wang, Chengyan Chen, Weibo Lyu, Jun Swin transformer-based GAN for multi-modal medical image translation |
title | Swin transformer-based GAN for multi-modal medical image translation |
title_full | Swin transformer-based GAN for multi-modal medical image translation |
title_fullStr | Swin transformer-based GAN for multi-modal medical image translation |
title_full_unstemmed | Swin transformer-based GAN for multi-modal medical image translation |
title_short | Swin transformer-based GAN for multi-modal medical image translation |
title_sort | swin transformer-based gan for multi-modal medical image translation |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9395186/ https://www.ncbi.nlm.nih.gov/pubmed/36003791 http://dx.doi.org/10.3389/fonc.2022.942511 |
work_keys_str_mv | AT yanshouang swintransformerbasedganformultimodalmedicalimagetranslation AT wangchengyan swintransformerbasedganformultimodalmedicalimagetranslation AT chenweibo swintransformerbasedganformultimodalmedicalimagetranslation AT lyujun swintransformerbasedganformultimodalmedicalimagetranslation |