Cargando…
Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation
In the field of intelligent vehicle technology, there is a high dependence on images captured under challenging conditions to develop robust perception algorithms. However, acquiring these images can be both time-consuming and dangerous. To address this issue, unpaired image-to-image translation mod...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611349/ https://www.ncbi.nlm.nih.gov/pubmed/37896492 http://dx.doi.org/10.3390/s23208398 |
_version_ | 1785128470450274304 |
---|---|
author | Yang, Hanting Carballo, Alexander Zhang, Yuxiao Takeda, Kazuya |
author_facet | Yang, Hanting Carballo, Alexander Zhang, Yuxiao Takeda, Kazuya |
author_sort | Yang, Hanting |
collection | PubMed |
description | In the field of intelligent vehicle technology, there is a high dependence on images captured under challenging conditions to develop robust perception algorithms. However, acquiring these images can be both time-consuming and dangerous. To address this issue, unpaired image-to-image translation models offer a solution by synthesizing samples of the desired domain, thus eliminating the reliance on ground truth supervision. However, the current methods predominantly focus on single projections rather than multiple solutions, not to mention controlling the direction of generation, which creates a scope for enhancement. In this study, we propose a generative adversarial network (GAN)–based model, which incorporates both a style encoder and a content encoder, specifically designed to extract relevant information from an image. Further, we employ a decoder to reconstruct an image using these encoded features, while ensuring that the generated output remains within a permissible range by applying a self-regression module to constrain the style latent space. By modifying the hyperparameters, we can generate controllable outputs with specific style codes. We evaluate the performance of our model by generating snow scenes on the Cityscapes and the EuroCity Persons datasets. The results reveal the effectiveness of our proposed methodology, thereby reinforcing the benefits of our approach in the ongoing evolution of intelligent vehicle technology. |
format | Online Article Text |
id | pubmed-10611349 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-106113492023-10-28 Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation Yang, Hanting Carballo, Alexander Zhang, Yuxiao Takeda, Kazuya Sensors (Basel) Article In the field of intelligent vehicle technology, there is a high dependence on images captured under challenging conditions to develop robust perception algorithms. However, acquiring these images can be both time-consuming and dangerous. To address this issue, unpaired image-to-image translation models offer a solution by synthesizing samples of the desired domain, thus eliminating the reliance on ground truth supervision. However, the current methods predominantly focus on single projections rather than multiple solutions, not to mention controlling the direction of generation, which creates a scope for enhancement. In this study, we propose a generative adversarial network (GAN)–based model, which incorporates both a style encoder and a content encoder, specifically designed to extract relevant information from an image. Further, we employ a decoder to reconstruct an image using these encoded features, while ensuring that the generated output remains within a permissible range by applying a self-regression module to constrain the style latent space. By modifying the hyperparameters, we can generate controllable outputs with specific style codes. We evaluate the performance of our model by generating snow scenes on the Cityscapes and the EuroCity Persons datasets. The results reveal the effectiveness of our proposed methodology, thereby reinforcing the benefits of our approach in the ongoing evolution of intelligent vehicle technology. MDPI 2023-10-12 /pmc/articles/PMC10611349/ /pubmed/37896492 http://dx.doi.org/10.3390/s23208398 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Yang, Hanting Carballo, Alexander Zhang, Yuxiao Takeda, Kazuya Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation |
title | Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation |
title_full | Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation |
title_fullStr | Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation |
title_full_unstemmed | Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation |
title_short | Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation |
title_sort | controllable unsupervised snow synthesis by latent style space manipulation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611349/ https://www.ncbi.nlm.nih.gov/pubmed/37896492 http://dx.doi.org/10.3390/s23208398 |
work_keys_str_mv | AT yanghanting controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation AT carballoalexander controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation AT zhangyuxiao controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation AT takedakazuya controllableunsupervisedsnowsynthesisbylatentstylespacemanipulation |