Cargando…

Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation

In the field of intelligent vehicle technology, there is a high dependence on images captured under challenging conditions to develop robust perception algorithms. However, acquiring these images can be both time-consuming and dangerous. To address this issue, unpaired image-to-image translation mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Hanting, Carballo, Alexander, Zhang, Yuxiao, Takeda, Kazuya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611349/
https://www.ncbi.nlm.nih.gov/pubmed/37896492
http://dx.doi.org/10.3390/s23208398
_version_ 1785128470450274304
author Yang, Hanting
Carballo, Alexander
Zhang, Yuxiao
Takeda, Kazuya
author_facet Yang, Hanting
Carballo, Alexander
Zhang, Yuxiao
Takeda, Kazuya
author_sort Yang, Hanting
collection PubMed
description In the field of intelligent vehicle technology, there is a high dependence on images captured under challenging conditions to develop robust perception algorithms. However, acquiring these images can be both time-consuming and dangerous. To address this issue, unpaired image-to-image translation models offer a solution by synthesizing samples of the desired domain, thus eliminating the reliance on ground truth supervision. However, the current methods predominantly focus on single projections rather than multiple solutions, not to mention controlling the direction of generation, which creates a scope for enhancement. In this study, we propose a generative adversarial network (GAN)–based model, which incorporates both a style encoder and a content encoder, specifically designed to extract relevant information from an image. Further, we employ a decoder to reconstruct an image using these encoded features, while ensuring that the generated output remains within a permissible range by applying a self-regression module to constrain the style latent space. By modifying the hyperparameters, we can generate controllable outputs with specific style codes. We evaluate the performance of our model by generating snow scenes on the Cityscapes and the EuroCity Persons datasets. The results reveal the effectiveness of our proposed methodology, thereby reinforcing the benefits of our approach in the ongoing evolution of intelligent vehicle technology.
format Online
Article
Text
id pubmed-10611349
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106113492023-10-28 Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation Yang, Hanting Carballo, Alexander Zhang, Yuxiao Takeda, Kazuya Sensors (Basel) Article In the field of intelligent vehicle technology, there is a high dependence on images captured under challenging conditions to develop robust perception algorithms. However, acquiring these images can be both time-consuming and dangerous. To address this issue, unpaired image-to-image translation models offer a solution by synthesizing samples of the desired domain, thus eliminating the reliance on ground truth supervision. However, the current methods predominantly focus on single projections rather than multiple solutions, not to mention controlling the direction of generation, which creates a scope for enhancement. In this study, we propose a generative adversarial network (GAN)–based model, which incorporates both a style encoder and a content encoder, specifically designed to extract relevant information from an image. Further, we employ a decoder to reconstruct an image using these encoded features, while ensuring that the generated output remains within a permissible range by applying a self-regression module to constrain the style latent space. By modifying the hyperparameters, we can generate controllable outputs with specific style codes. We evaluate the performance of our model by generating snow scenes on the Cityscapes and the EuroCity Persons datasets. The results reveal the effectiveness of our proposed methodology, thereby reinforcing the benefits of our approach in the ongoing evolution of intelligent vehicle technology. MDPI 2023-10-12 /pmc/articles/PMC10611349/ /pubmed/37896492 http://dx.doi.org/10.3390/s23208398 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yang, Hanting
Carballo, Alexander
Zhang, Yuxiao
Takeda, Kazuya
Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title_full Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title_fullStr Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title_full_unstemmed Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title_short Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
title_sort controllable unsupervised snow synthesis by latent style space manipulation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611349/
https://www.ncbi.nlm.nih.gov/pubmed/37896492
http://dx.doi.org/10.3390/s23208398
work_keys_str_mv AT yanghanting controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation
AT carballoalexander controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation
AT zhangyuxiao controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation
AT takedakazuya controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation