Cargando…

Exploiting an Intermediate Latent Space between Photo and Sketch for Face Photo-Sketch Recognition †

The photo-sketch matching problem is challenging because the modality gap between a photo and a sketch is very large. This work features a novel approach to the use of an intermediate latent space between the two modalities that circumvents the problem of modality gap for face photo-sketch recogniti...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bae, Seho, Din, Nizam Ud, Park, Hyunkyu, Yi, Juneho
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570829/ https://www.ncbi.nlm.nih.gov/pubmed/36236398 http://dx.doi.org/10.3390/s22197299

_version_	1784810208118177792
author	Bae, Seho Din, Nizam Ud Park, Hyunkyu Yi, Juneho
author_facet	Bae, Seho Din, Nizam Ud Park, Hyunkyu Yi, Juneho
author_sort	Bae, Seho
collection	PubMed
description	The photo-sketch matching problem is challenging because the modality gap between a photo and a sketch is very large. This work features a novel approach to the use of an intermediate latent space between the two modalities that circumvents the problem of modality gap for face photo-sketch recognition. To set up a stable homogenous latent space between a photo and a sketch that is effective for matching, we utilize a bidirectional (photo → sketch and sketch → photo) collaborative synthesis network and equip the latent space with rich representation power. To provide rich representation power, we employ StyleGAN architectures, such as StyleGAN and StyleGAN2. The proposed latent space equipped with rich representation power enables us to conduct accurate matching because we can effectively align the distributions of the two modalities in the latent space. In addition, to resolve the problem of insufficient paired photo/sketch samples for training, we introduce a three-step training scheme. Extensive evaluation on a public composite face sketch database confirms superior performance of the proposed approach compared to existing state-of-the-art methods. The proposed methodology can be employed in matching other modality pairs.
format	Online Article Text
id	pubmed-9570829
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-95708292022-10-17 Exploiting an Intermediate Latent Space between Photo and Sketch for Face Photo-Sketch Recognition † Bae, Seho Din, Nizam Ud Park, Hyunkyu Yi, Juneho Sensors (Basel) Article The photo-sketch matching problem is challenging because the modality gap between a photo and a sketch is very large. This work features a novel approach to the use of an intermediate latent space between the two modalities that circumvents the problem of modality gap for face photo-sketch recognition. To set up a stable homogenous latent space between a photo and a sketch that is effective for matching, we utilize a bidirectional (photo → sketch and sketch → photo) collaborative synthesis network and equip the latent space with rich representation power. To provide rich representation power, we employ StyleGAN architectures, such as StyleGAN and StyleGAN2. The proposed latent space equipped with rich representation power enables us to conduct accurate matching because we can effectively align the distributions of the two modalities in the latent space. In addition, to resolve the problem of insufficient paired photo/sketch samples for training, we introduce a three-step training scheme. Extensive evaluation on a public composite face sketch database confirms superior performance of the proposed approach compared to existing state-of-the-art methods. The proposed methodology can be employed in matching other modality pairs. MDPI 2022-09-26 /pmc/articles/PMC9570829/ /pubmed/36236398 http://dx.doi.org/10.3390/s22197299 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Bae, Seho Din, Nizam Ud Park, Hyunkyu Yi, Juneho Exploiting an Intermediate Latent Space between Photo and Sketch for Face Photo-Sketch Recognition †
title	Exploiting an Intermediate Latent Space between Photo and Sketch for Face Photo-Sketch Recognition †
title_full	Exploiting an Intermediate Latent Space between Photo and Sketch for Face Photo-Sketch Recognition †
title_fullStr	Exploiting an Intermediate Latent Space between Photo and Sketch for Face Photo-Sketch Recognition †
title_full_unstemmed	Exploiting an Intermediate Latent Space between Photo and Sketch for Face Photo-Sketch Recognition †
title_short	Exploiting an Intermediate Latent Space between Photo and Sketch for Face Photo-Sketch Recognition †
title_sort	exploiting an intermediate latent space between photo and sketch for face photo-sketch recognition †
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570829/ https://www.ncbi.nlm.nih.gov/pubmed/36236398 http://dx.doi.org/10.3390/s22197299
work_keys_str_mv	AT baeseho exploitinganintermediatelatentspacebetweenphotoandsketchforfacephotosketchrecognition AT dinnizamud exploitinganintermediatelatentspacebetweenphotoandsketchforfacephotosketchrecognition AT parkhyunkyu exploitinganintermediatelatentspacebetweenphotoandsketchforfacephotosketchrecognition AT yijuneho exploitinganintermediatelatentspacebetweenphotoandsketchforfacephotosketchrecognition

Exploiting an Intermediate Latent Space between Photo and Sketch for Face Photo-Sketch Recognition †

Ejemplares similares