Cargando…

Style attention based global-local aware GAN for personalized facial caricature generation

INTRODUCTION: Caricature is an exaggerated pictorial representation of a person, which is widely used in entertainment and political media. Recently, GAN-based methods achieved automatic caricature generation through transferring caricature style and performing shape exaggeration simultaneously. How...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Xiuzhi, Chen, Wenting, Xie, Weicheng, Shen, Linlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10027787/
https://www.ncbi.nlm.nih.gov/pubmed/36960177
http://dx.doi.org/10.3389/fnins.2023.1136416
_version_ 1784909785719635968
author Zhao, Xiuzhi
Chen, Wenting
Xie, Weicheng
Shen, Linlin
author_facet Zhao, Xiuzhi
Chen, Wenting
Xie, Weicheng
Shen, Linlin
author_sort Zhao, Xiuzhi
collection PubMed
description INTRODUCTION: Caricature is an exaggerated pictorial representation of a person, which is widely used in entertainment and political media. Recently, GAN-based methods achieved automatic caricature generation through transferring caricature style and performing shape exaggeration simultaneously. However, the caricature synthesized by these methods cannot perfectly reflect the characteristics of the subject, whose shape exaggeration are not reasonable and requires facial landmarks of caricature. In addition, the existing methods always produce the bad cases in caricature style due to the simpleness of their style transfer method. METHODS: In this paper, we propose a Style Attention based Global-local Aware GAN to apply the characteristics of a subject to generate personalized caricature. To integrate the facial characteristics of a subject, we introduce a landmark-based warp controller for personalized shape exaggeration, which employs the facial landmarks as control points to warp image according to its facial features, without requirement of the facial landmarks of caricature. To fuse the facial feature with caricature style appropriately, we introduce a style-attention module, which adopts an attention mechanism, instead of the simple Adaptive Instance Normalization (AdaIN) for style transfer. To reduce the bad cases and increase the quality of generated caricatures, we propose a multi-scale discriminator to both globally and locally discriminate the synthesized and real caricature, which improves the whole structure and realistic details of the synthesized caricature. RESULTS: Experimental results on two publicly available datasets, the WebCaricature and the CaVINet datasets, validate the effectiveness of our proposed method and suggest that our proposed method achieves better performance than the existing methods. DISCUSSION: The caricatures generated by the proposed method can not only preserve the identity of input photo but also the characteristic shape exaggeration for each person, which are highly close to the real caricatures drawn by real artists. It indicates that our method can be adopted in the real application.
format Online
Article
Text
id pubmed-10027787
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100277872023-03-22 Style attention based global-local aware GAN for personalized facial caricature generation Zhao, Xiuzhi Chen, Wenting Xie, Weicheng Shen, Linlin Front Neurosci Neuroscience INTRODUCTION: Caricature is an exaggerated pictorial representation of a person, which is widely used in entertainment and political media. Recently, GAN-based methods achieved automatic caricature generation through transferring caricature style and performing shape exaggeration simultaneously. However, the caricature synthesized by these methods cannot perfectly reflect the characteristics of the subject, whose shape exaggeration are not reasonable and requires facial landmarks of caricature. In addition, the existing methods always produce the bad cases in caricature style due to the simpleness of their style transfer method. METHODS: In this paper, we propose a Style Attention based Global-local Aware GAN to apply the characteristics of a subject to generate personalized caricature. To integrate the facial characteristics of a subject, we introduce a landmark-based warp controller for personalized shape exaggeration, which employs the facial landmarks as control points to warp image according to its facial features, without requirement of the facial landmarks of caricature. To fuse the facial feature with caricature style appropriately, we introduce a style-attention module, which adopts an attention mechanism, instead of the simple Adaptive Instance Normalization (AdaIN) for style transfer. To reduce the bad cases and increase the quality of generated caricatures, we propose a multi-scale discriminator to both globally and locally discriminate the synthesized and real caricature, which improves the whole structure and realistic details of the synthesized caricature. RESULTS: Experimental results on two publicly available datasets, the WebCaricature and the CaVINet datasets, validate the effectiveness of our proposed method and suggest that our proposed method achieves better performance than the existing methods. DISCUSSION: The caricatures generated by the proposed method can not only preserve the identity of input photo but also the characteristic shape exaggeration for each person, which are highly close to the real caricatures drawn by real artists. It indicates that our method can be adopted in the real application. Frontiers Media S.A. 2023-03-07 /pmc/articles/PMC10027787/ /pubmed/36960177 http://dx.doi.org/10.3389/fnins.2023.1136416 Text en Copyright © 2023 Zhao, Chen, Xie and Shen. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Zhao, Xiuzhi
Chen, Wenting
Xie, Weicheng
Shen, Linlin
Style attention based global-local aware GAN for personalized facial caricature generation
title Style attention based global-local aware GAN for personalized facial caricature generation
title_full Style attention based global-local aware GAN for personalized facial caricature generation
title_fullStr Style attention based global-local aware GAN for personalized facial caricature generation
title_full_unstemmed Style attention based global-local aware GAN for personalized facial caricature generation
title_short Style attention based global-local aware GAN for personalized facial caricature generation
title_sort style attention based global-local aware gan for personalized facial caricature generation
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10027787/
https://www.ncbi.nlm.nih.gov/pubmed/36960177
http://dx.doi.org/10.3389/fnins.2023.1136416
work_keys_str_mv AT zhaoxiuzhi styleattentionbasedgloballocalawareganforpersonalizedfacialcaricaturegeneration
AT chenwenting styleattentionbasedgloballocalawareganforpersonalizedfacialcaricaturegeneration
AT xieweicheng styleattentionbasedgloballocalawareganforpersonalizedfacialcaricaturegeneration
AT shenlinlin styleattentionbasedgloballocalawareganforpersonalizedfacialcaricaturegeneration