Cargando…

Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation

In the field of intelligent vehicle technology, there is a high dependence on images captured under challenging conditions to develop robust perception algorithms. However, acquiring these images can be both time-consuming and dangerous. To address this issue, unpaired image-to-image translation mod...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yang, Hanting, Carballo, Alexander, Zhang, Yuxiao, Takeda, Kazuya
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611349/ https://www.ncbi.nlm.nih.gov/pubmed/37896492 http://dx.doi.org/10.3390/s23208398

_version_	1785128470450274304
author	Yang, Hanting Carballo, Alexander Zhang, Yuxiao Takeda, Kazuya
author_facet	Yang, Hanting Carballo, Alexander Zhang, Yuxiao Takeda, Kazuya
author_sort	Yang, Hanting
collection	PubMed
description	In the field of intelligent vehicle technology, there is a high dependence on images captured under challenging conditions to develop robust perception algorithms. However, acquiring these images can be both time-consuming and dangerous. To address this issue, unpaired image-to-image translation models offer a solution by synthesizing samples of the desired domain, thus eliminating the reliance on ground truth supervision. However, the current methods predominantly focus on single projections rather than multiple solutions, not to mention controlling the direction of generation, which creates a scope for enhancement. In this study, we propose a generative adversarial network (GAN)–based model, which incorporates both a style encoder and a content encoder, specifically designed to extract relevant information from an image. Further, we employ a decoder to reconstruct an image using these encoded features, while ensuring that the generated output remains within a permissible range by applying a self-regression module to constrain the style latent space. By modifying the hyperparameters, we can generate controllable outputs with specific style codes. We evaluate the performance of our model by generating snow scenes on the Cityscapes and the EuroCity Persons datasets. The results reveal the effectiveness of our proposed methodology, thereby reinforcing the benefits of our approach in the ongoing evolution of intelligent vehicle technology.
format	Online Article Text
id	pubmed-10611349
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-106113492023-10-28 Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation Yang, Hanting Carballo, Alexander Zhang, Yuxiao Takeda, Kazuya Sensors (Basel) Article In the field of intelligent vehicle technology, there is a high dependence on images captured under challenging conditions to develop robust perception algorithms. However, acquiring these images can be both time-consuming and dangerous. To address this issue, unpaired image-to-image translation models offer a solution by synthesizing samples of the desired domain, thus eliminating the reliance on ground truth supervision. However, the current methods predominantly focus on single projections rather than multiple solutions, not to mention controlling the direction of generation, which creates a scope for enhancement. In this study, we propose a generative adversarial network (GAN)–based model, which incorporates both a style encoder and a content encoder, specifically designed to extract relevant information from an image. Further, we employ a decoder to reconstruct an image using these encoded features, while ensuring that the generated output remains within a permissible range by applying a self-regression module to constrain the style latent space. By modifying the hyperparameters, we can generate controllable outputs with specific style codes. We evaluate the performance of our model by generating snow scenes on the Cityscapes and the EuroCity Persons datasets. The results reveal the effectiveness of our proposed methodology, thereby reinforcing the benefits of our approach in the ongoing evolution of intelligent vehicle technology. MDPI 2023-10-12 /pmc/articles/PMC10611349/ /pubmed/37896492 http://dx.doi.org/10.3390/s23208398 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Yang, Hanting Carballo, Alexander Zhang, Yuxiao Takeda, Kazuya Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title	Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title_full	Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title_fullStr	Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title_full_unstemmed	Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title_short	Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title_sort	controllable unsupervised snow synthesis by latent style space manipulation
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611349/ https://www.ncbi.nlm.nih.gov/pubmed/37896492 http://dx.doi.org/10.3390/s23208398
work_keys_str_mv	AT yanghanting controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation AT carballoalexander controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation AT zhangyuxiao controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation AT takedakazuya controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation

Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation

Ejemplares similares