Cargando…

Disentangling the latent space of GANs for semantic face editing

Disentanglement research is a critical and important issue in the field of image editing. In order to perform disentangled editing on images generated by generative models, this paper presents an unsupervised, model-agnostic, two-stage trained editing framework. This work addresses the problem of di...

Descripción completa

Detalles Bibliográficos
Autores principales:	Niu, Yongjie, Zhou, Mingquan, Li, Zhan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2023
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10602338/ https://www.ncbi.nlm.nih.gov/pubmed/37883462 http://dx.doi.org/10.1371/journal.pone.0293496

_version_	1785126380512477184
author	Niu, Yongjie Zhou, Mingquan Li, Zhan
author_facet	Niu, Yongjie Zhou, Mingquan Li, Zhan
author_sort	Niu, Yongjie
collection	PubMed
description	Disentanglement research is a critical and important issue in the field of image editing. In order to perform disentangled editing on images generated by generative models, this paper presents an unsupervised, model-agnostic, two-stage trained editing framework. This work addresses the problem of discovering interpretable, disentangled directions of edited image attributes in the latent space of generative models. This effort’s primary objective was to address the limitations discovered in previous research, mainly (a) the discovered editing directions are interpretable but significantly entangled, i.e., changes to one attribute affect the others and (b) Prior research has utilized direction discovery and direction disentanglement separately, and they can’t work synergistically. More specifically, this paper proposes a two-stage training method that discovers the editing direction with semantics, perturbs the dimension of the direction vector, adjusts it with a penalty mechanism, and makes the editing direction more disentangled. This allows easy distinguishable image editing, such as age and facial expressions in facial images. Experimentally compared to other methods, the proposed method outperforms them both qualitatively and quantitatively in terms of interpretability, disentanglement, and distinguishability of the generated images. The implementation of our method is available at https://github.com/ydniuyongjie/twoStageForFaceEdit.
format	Online Article Text
id	pubmed-10602338
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-106023382023-10-27 Disentangling the latent space of GANs for semantic face editing Niu, Yongjie Zhou, Mingquan Li, Zhan PLoS One Research Article Disentanglement research is a critical and important issue in the field of image editing. In order to perform disentangled editing on images generated by generative models, this paper presents an unsupervised, model-agnostic, two-stage trained editing framework. This work addresses the problem of discovering interpretable, disentangled directions of edited image attributes in the latent space of generative models. This effort’s primary objective was to address the limitations discovered in previous research, mainly (a) the discovered editing directions are interpretable but significantly entangled, i.e., changes to one attribute affect the others and (b) Prior research has utilized direction discovery and direction disentanglement separately, and they can’t work synergistically. More specifically, this paper proposes a two-stage training method that discovers the editing direction with semantics, perturbs the dimension of the direction vector, adjusts it with a penalty mechanism, and makes the editing direction more disentangled. This allows easy distinguishable image editing, such as age and facial expressions in facial images. Experimentally compared to other methods, the proposed method outperforms them both qualitatively and quantitatively in terms of interpretability, disentanglement, and distinguishability of the generated images. The implementation of our method is available at https://github.com/ydniuyongjie/twoStageForFaceEdit. Public Library of Science 2023-10-26 /pmc/articles/PMC10602338/ /pubmed/37883462 http://dx.doi.org/10.1371/journal.pone.0293496 Text en © 2023 Niu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Niu, Yongjie Zhou, Mingquan Li, Zhan Disentangling the latent space of GANs for semantic face editing
title	Disentangling the latent space of GANs for semantic face editing
title_full	Disentangling the latent space of GANs for semantic face editing
title_fullStr	Disentangling the latent space of GANs for semantic face editing
title_full_unstemmed	Disentangling the latent space of GANs for semantic face editing
title_short	Disentangling the latent space of GANs for semantic face editing
title_sort	disentangling the latent space of gans for semantic face editing
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10602338/ https://www.ncbi.nlm.nih.gov/pubmed/37883462 http://dx.doi.org/10.1371/journal.pone.0293496
work_keys_str_mv	AT niuyongjie disentanglingthelatentspaceofgansforsemanticfaceediting AT zhoumingquan disentanglingthelatentspaceofgansforsemanticfaceediting AT lizhan disentanglingthelatentspaceofgansforsemanticfaceediting

Disentangling the latent space of GANs for semantic face editing

Ejemplares similares