Cargando…

A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space

Recently, Generative Adversarial Networks (GAN) has been greatly developed and widely used in image synthesis. A Style-Based Generator Architecture for Generative Adversarial Networks (StyleGAN) which is the foremost, continues to develop human face inversion domain. StyleGAN uses insufficient vecto...

Descripción completa

Detalles Bibliográficos
Autores principales: Kang, Byungseok, Jo, Youngjae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697537/
http://dx.doi.org/10.1371/journal.pone.0295316
_version_ 1785154770389958656
author Kang, Byungseok
Jo, Youngjae
author_facet Kang, Byungseok
Jo, Youngjae
author_sort Kang, Byungseok
collection PubMed
description Recently, Generative Adversarial Networks (GAN) has been greatly developed and widely used in image synthesis. A Style-Based Generator Architecture for Generative Adversarial Networks (StyleGAN) which is the foremost, continues to develop human face inversion domain. StyleGAN uses insufficient vector space to express more than one million pixels. It is difficult to apply in real business due to distortion-edit tradeoff problem in latent space. To overcome this, we propose a novel semantic segment encoder (SSE) with improved face inversion quality by narrowing the size of restoration latent space. Encoder’s learning area is minimized to logical semantic-segment units that can be recognized by humans. The proposed encoder does not affect other segments because only one segment is edited at a time. To verify the face inversion quality, we compared with the latest encoders both Pixel2style2Pixel and RestyleEncoder. Experimental result shows that the proposed encoder improved distortion quality around 20% while maintain editing performance.
format Online
Article
Text
id pubmed-10697537
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-106975372023-12-06 A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space Kang, Byungseok Jo, Youngjae PLoS One Research Article Recently, Generative Adversarial Networks (GAN) has been greatly developed and widely used in image synthesis. A Style-Based Generator Architecture for Generative Adversarial Networks (StyleGAN) which is the foremost, continues to develop human face inversion domain. StyleGAN uses insufficient vector space to express more than one million pixels. It is difficult to apply in real business due to distortion-edit tradeoff problem in latent space. To overcome this, we propose a novel semantic segment encoder (SSE) with improved face inversion quality by narrowing the size of restoration latent space. Encoder’s learning area is minimized to logical semantic-segment units that can be recognized by humans. The proposed encoder does not affect other segments because only one segment is edited at a time. To verify the face inversion quality, we compared with the latest encoders both Pixel2style2Pixel and RestyleEncoder. Experimental result shows that the proposed encoder improved distortion quality around 20% while maintain editing performance. Public Library of Science 2023-12-05 /pmc/articles/PMC10697537/ http://dx.doi.org/10.1371/journal.pone.0295316 Text en © 2023 Kang, Jo https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kang, Byungseok
Jo, Youngjae
A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title_full A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title_fullStr A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title_full_unstemmed A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title_short A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space
title_sort semantic segment encoder (sse): improving human face inversion quality through minimized learning space
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697537/
http://dx.doi.org/10.1371/journal.pone.0295316
work_keys_str_mv AT kangbyungseok asemanticsegmentencodersseimprovinghumanfaceinversionqualitythroughminimizedlearningspace
AT joyoungjae asemanticsegmentencodersseimprovinghumanfaceinversionqualitythroughminimizedlearningspace
AT kangbyungseok semanticsegmentencodersseimprovinghumanfaceinversionqualitythroughminimizedlearningspace
AT joyoungjae semanticsegmentencodersseimprovinghumanfaceinversionqualitythroughminimizedlearningspace