Cargando…

A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space

Recently, Generative Adversarial Networks (GAN) has been greatly developed and widely used in image synthesis. A Style-Based Generator Architecture for Generative Adversarial Networks (StyleGAN) which is the foremost, continues to develop human face inversion domain. StyleGAN uses insufficient vecto...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kang, Byungseok, Jo, Youngjae
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2023
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697537/ http://dx.doi.org/10.1371/journal.pone.0295316

_version_	1785154770389958656
author	Kang, Byungseok Jo, Youngjae
author_facet	Kang, Byungseok Jo, Youngjae
author_sort	Kang, Byungseok
collection	PubMed
description	Recently, Generative Adversarial Networks (GAN) has been greatly developed and widely used in image synthesis. A Style-Based Generator Architecture for Generative Adversarial Networks (StyleGAN) which is the foremost, continues to develop human face inversion domain. StyleGAN uses insufficient vector space to express more than one million pixels. It is difficult to apply in real business due to distortion-edit tradeoff problem in latent space. To overcome this, we propose a novel semantic segment encoder (SSE) with improved face inversion quality by narrowing the size of restoration latent space. Encoder’s learning area is minimized to logical semantic-segment units that can be recognized by humans. The proposed encoder does not affect other segments because only one segment is edited at a time. To verify the face inversion quality, we compared with the latest encoders both Pixel2style2Pixel and RestyleEncoder. Experimental result shows that the proposed encoder improved distortion quality around 20% while maintain editing performance.
format	Online Article Text
id	pubmed-10697537
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-106975372023-12-06 A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space Kang, Byungseok Jo, Youngjae PLoS One Research Article Recently, Generative Adversarial Networks (GAN) has been greatly developed and widely used in image synthesis. A Style-Based Generator Architecture for Generative Adversarial Networks (StyleGAN) which is the foremost, continues to develop human face inversion domain. StyleGAN uses insufficient vector space to express more than one million pixels. It is difficult to apply in real business due to distortion-edit tradeoff problem in latent space. To overcome this, we propose a novel semantic segment encoder (SSE) with improved face inversion quality by narrowing the size of restoration latent space. Encoder’s learning area is minimized to logical semantic-segment units that can be recognized by humans. The proposed encoder does not affect other segments because only one segment is edited at a time. To verify the face inversion quality, we compared with the latest encoders both Pixel2style2Pixel and RestyleEncoder. Experimental result shows that the proposed encoder improved distortion quality around 20% while maintain editing performance. Public Library of Science 2023-12-05 /pmc/articles/PMC10697537/ http://dx.doi.org/10.1371/journal.pone.0295316 Text en © 2023 Kang, Jo https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Kang, Byungseok Jo, Youngjae A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title	A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title_full	A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title_fullStr	A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title_full_unstemmed	A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title_short	A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title_sort	semantic segment encoder (sse): improving human face inversion quality through minimized learning space
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697537/ http://dx.doi.org/10.1371/journal.pone.0295316
work_keys_str_mv	AT kangbyungseok asemanticsegmentencodersseimprovinghumanfaceinversionqualitythroughminimizedlearningspace AT joyoungjae asemanticsegmentencodersseimprovinghumanfaceinversionqualitythroughminimizedlearningspace AT kangbyungseok semanticsegmentencodersseimprovinghumanfaceinversionqualitythroughminimizedlearningspace AT joyoungjae semanticsegmentencodersseimprovinghumanfaceinversionqualitythroughminimizedlearningspace

A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space

Ejemplares similares