Cargando…

CRPGAN: Learning image-to-image translation of two unpaired images by cross-attention mechanism and parallelization strategy

Unsupervised image-to-image translation (UI2I) tasks aim to find a mapping between the source and the target domains from unpaired training data. Previous methods can not effectively capture the differences between the source and the target domain on different scales and often leads to poor quality...

Descripción completa

Detalles Bibliográficos
Autores principales:	Feng, Long, Geng, Guohua, Li, Qihang, Jiang, Yi, Li, Zhan, Li, Kang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2023
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9821492/ https://www.ncbi.nlm.nih.gov/pubmed/36607995 http://dx.doi.org/10.1371/journal.pone.0280073

_version_	1784865710271365120
author	Feng, Long Geng, Guohua Li, Qihang Jiang, Yi Li, Zhan Li, Kang
author_facet	Feng, Long Geng, Guohua Li, Qihang Jiang, Yi Li, Zhan Li, Kang
author_sort	Feng, Long
collection	PubMed
description	Unsupervised image-to-image translation (UI2I) tasks aim to find a mapping between the source and the target domains from unpaired training data. Previous methods can not effectively capture the differences between the source and the target domain on different scales and often leads to poor quality of the generated images, noise, distortion, and other conditions that do not match human vision perception, and has high time complexity. To address this problem, we propose a multi-scale training structure and a progressive growth generator method to solve UI2I task. Our method refines the generated images from global structures to local details by adding new convolution blocks continuously and shares the network parameters in different scales and also in the same scale of network. Finally, we propose a new Cross-CBAM mechanism (CRCBAM), which uses a multi-layer spatial attention and channel attention cross structure to generate more refined style images. Experiments on our collected Opera Face, and other open datasets Summer↔Winter, Horse↔Zebra, Photo↔Van Gogh, show that the proposed algorithm is superior to other state-of-art algorithms.
format	Online Article Text
id	pubmed-9821492
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-98214922023-01-07 CRPGAN: Learning image-to-image translation of two unpaired images by cross-attention mechanism and parallelization strategy Feng, Long Geng, Guohua Li, Qihang Jiang, Yi Li, Zhan Li, Kang PLoS One Research Article Unsupervised image-to-image translation (UI2I) tasks aim to find a mapping between the source and the target domains from unpaired training data. Previous methods can not effectively capture the differences between the source and the target domain on different scales and often leads to poor quality of the generated images, noise, distortion, and other conditions that do not match human vision perception, and has high time complexity. To address this problem, we propose a multi-scale training structure and a progressive growth generator method to solve UI2I task. Our method refines the generated images from global structures to local details by adding new convolution blocks continuously and shares the network parameters in different scales and also in the same scale of network. Finally, we propose a new Cross-CBAM mechanism (CRCBAM), which uses a multi-layer spatial attention and channel attention cross structure to generate more refined style images. Experiments on our collected Opera Face, and other open datasets Summer↔Winter, Horse↔Zebra, Photo↔Van Gogh, show that the proposed algorithm is superior to other state-of-art algorithms. Public Library of Science 2023-01-06 /pmc/articles/PMC9821492/ /pubmed/36607995 http://dx.doi.org/10.1371/journal.pone.0280073 Text en © 2023 Feng et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Feng, Long Geng, Guohua Li, Qihang Jiang, Yi Li, Zhan Li, Kang CRPGAN: Learning image-to-image translation of two unpaired images by cross-attention mechanism and parallelization strategy
title	CRPGAN: Learning image-to-image translation of two unpaired images by cross-attention mechanism and parallelization strategy
title_full	CRPGAN: Learning image-to-image translation of two unpaired images by cross-attention mechanism and parallelization strategy
title_fullStr	CRPGAN: Learning image-to-image translation of two unpaired images by cross-attention mechanism and parallelization strategy
title_full_unstemmed	CRPGAN: Learning image-to-image translation of two unpaired images by cross-attention mechanism and parallelization strategy
title_short	CRPGAN: Learning image-to-image translation of two unpaired images by cross-attention mechanism and parallelization strategy
title_sort	crpgan: learning image-to-image translation of two unpaired images by cross-attention mechanism and parallelization strategy
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9821492/ https://www.ncbi.nlm.nih.gov/pubmed/36607995 http://dx.doi.org/10.1371/journal.pone.0280073
work_keys_str_mv	AT fenglong crpganlearningimagetoimagetranslationoftwounpairedimagesbycrossattentionmechanismandparallelizationstrategy AT gengguohua crpganlearningimagetoimagetranslationoftwounpairedimagesbycrossattentionmechanismandparallelizationstrategy AT liqihang crpganlearningimagetoimagetranslationoftwounpairedimagesbycrossattentionmechanismandparallelizationstrategy AT jiangyi crpganlearningimagetoimagetranslationoftwounpairedimagesbycrossattentionmechanismandparallelizationstrategy AT lizhan crpganlearningimagetoimagetranslationoftwounpairedimagesbycrossattentionmechanismandparallelizationstrategy AT likang crpganlearningimagetoimagetranslationoftwounpairedimagesbycrossattentionmechanismandparallelizationstrategy

CRPGAN: Learning image-to-image translation of two unpaired images by cross-attention mechanism and parallelization strategy

Ejemplares similares