Cargando…

Hint-Based Image Colorization Based on Hierarchical Vision Transformer

Hint-based image colorization is an image-to-image translation task that aims at creating a full-color image from an input luminance image when a small set of color values for some pixels are given as hints. Though traditional deep-learning-based methods have been proposed in the literature, they ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Subin, Jung, Yong Ju
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570951/
https://www.ncbi.nlm.nih.gov/pubmed/36236517
http://dx.doi.org/10.3390/s22197419
_version_ 1784810240416415744
author Lee, Subin
Jung, Yong Ju
author_facet Lee, Subin
Jung, Yong Ju
author_sort Lee, Subin
collection PubMed
description Hint-based image colorization is an image-to-image translation task that aims at creating a full-color image from an input luminance image when a small set of color values for some pixels are given as hints. Though traditional deep-learning-based methods have been proposed in the literature, they are based on convolution neural networks (CNNs) that have strong spatial locality due to the convolution operations. This often causes non-trivial visual artifacts in the colorization results, such as false color and color bleeding artifacts. To overcome this limitation, this study proposes a vision transformer-based colorization network. The proposed hint-based colorization network has a hierarchical vision transformer architecture in the form of an encoder-decoder structure based on transformer blocks. As the proposed method uses the transformer blocks that can learn rich long-range dependency, it can achieve visually plausible colorization results, even with a small number of color hints. Through the verification experiments, the results reveal that the proposed transformer model outperforms the conventional CNN-based models. In addition, we qualitatively analyze the effect of the long-range dependency of the transformer model on hint-based image colorization.
format Online
Article
Text
id pubmed-9570951
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95709512022-10-17 Hint-Based Image Colorization Based on Hierarchical Vision Transformer Lee, Subin Jung, Yong Ju Sensors (Basel) Article Hint-based image colorization is an image-to-image translation task that aims at creating a full-color image from an input luminance image when a small set of color values for some pixels are given as hints. Though traditional deep-learning-based methods have been proposed in the literature, they are based on convolution neural networks (CNNs) that have strong spatial locality due to the convolution operations. This often causes non-trivial visual artifacts in the colorization results, such as false color and color bleeding artifacts. To overcome this limitation, this study proposes a vision transformer-based colorization network. The proposed hint-based colorization network has a hierarchical vision transformer architecture in the form of an encoder-decoder structure based on transformer blocks. As the proposed method uses the transformer blocks that can learn rich long-range dependency, it can achieve visually plausible colorization results, even with a small number of color hints. Through the verification experiments, the results reveal that the proposed transformer model outperforms the conventional CNN-based models. In addition, we qualitatively analyze the effect of the long-range dependency of the transformer model on hint-based image colorization. MDPI 2022-09-29 /pmc/articles/PMC9570951/ /pubmed/36236517 http://dx.doi.org/10.3390/s22197419 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lee, Subin
Jung, Yong Ju
Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title_full Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title_fullStr Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title_full_unstemmed Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title_short Hint-Based Image Colorization Based on Hierarchical Vision Transformer
title_sort hint-based image colorization based on hierarchical vision transformer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570951/
https://www.ncbi.nlm.nih.gov/pubmed/36236517
http://dx.doi.org/10.3390/s22197419
work_keys_str_mv AT leesubin hintbasedimagecolorizationbasedonhierarchicalvisiontransformer
AT jungyongju hintbasedimagecolorizationbasedonhierarchicalvisiontransformer