Cargando…

Hint-Based Image Colorization Based on Hierarchical Vision Transformer

Hint-based image colorization is an image-to-image translation task that aims at creating a full-color image from an input luminance image when a small set of color values for some pixels are given as hints. Though traditional deep-learning-based methods have been proposed in the literature, they ar...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Subin, Jung, Yong Ju
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570951/ https://www.ncbi.nlm.nih.gov/pubmed/36236517 http://dx.doi.org/10.3390/s22197419

_version_	1784810240416415744
author	Lee, Subin Jung, Yong Ju
author_facet	Lee, Subin Jung, Yong Ju
author_sort	Lee, Subin
collection	PubMed
description	Hint-based image colorization is an image-to-image translation task that aims at creating a full-color image from an input luminance image when a small set of color values for some pixels are given as hints. Though traditional deep-learning-based methods have been proposed in the literature, they are based on convolution neural networks (CNNs) that have strong spatial locality due to the convolution operations. This often causes non-trivial visual artifacts in the colorization results, such as false color and color bleeding artifacts. To overcome this limitation, this study proposes a vision transformer-based colorization network. The proposed hint-based colorization network has a hierarchical vision transformer architecture in the form of an encoder-decoder structure based on transformer blocks. As the proposed method uses the transformer blocks that can learn rich long-range dependency, it can achieve visually plausible colorization results, even with a small number of color hints. Through the verification experiments, the results reveal that the proposed transformer model outperforms the conventional CNN-based models. In addition, we qualitatively analyze the effect of the long-range dependency of the transformer model on hint-based image colorization.
format	Online Article Text
id	pubmed-9570951
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-95709512022-10-17 Hint-Based Image Colorization Based on Hierarchical Vision Transformer Lee, Subin Jung, Yong Ju Sensors (Basel) Article Hint-based image colorization is an image-to-image translation task that aims at creating a full-color image from an input luminance image when a small set of color values for some pixels are given as hints. Though traditional deep-learning-based methods have been proposed in the literature, they are based on convolution neural networks (CNNs) that have strong spatial locality due to the convolution operations. This often causes non-trivial visual artifacts in the colorization results, such as false color and color bleeding artifacts. To overcome this limitation, this study proposes a vision transformer-based colorization network. The proposed hint-based colorization network has a hierarchical vision transformer architecture in the form of an encoder-decoder structure based on transformer blocks. As the proposed method uses the transformer blocks that can learn rich long-range dependency, it can achieve visually plausible colorization results, even with a small number of color hints. Through the verification experiments, the results reveal that the proposed transformer model outperforms the conventional CNN-based models. In addition, we qualitatively analyze the effect of the long-range dependency of the transformer model on hint-based image colorization. MDPI 2022-09-29 /pmc/articles/PMC9570951/ /pubmed/36236517 http://dx.doi.org/10.3390/s22197419 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Lee, Subin Jung, Yong Ju Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title	Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title_full	Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title_fullStr	Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title_full_unstemmed	Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title_short	Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title_sort	hint-based image colorization based on hierarchical vision transformer
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570951/ https://www.ncbi.nlm.nih.gov/pubmed/36236517 http://dx.doi.org/10.3390/s22197419
work_keys_str_mv	AT leesubin hintbasedimagecolorizationbasedonhierarchicalvisiontransformer AT jungyongju hintbasedimagecolorizationbasedonhierarchicalvisiontransformer

Hint-Based Image Colorization Based on Hierarchical Vision Transformer

Ejemplares similares