Cargando…

Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement

In this paper, we propose a sequential variational autoencoder for video disentanglement, which is a representation learning method that can be used to separately extract static and dynamic features from videos. Building sequential variational autoencoders with a two-stream architecture induces indu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Haga, Takeshi, Kera, Hiroshi, Kawamoto, Kazuhiko
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006877/ https://www.ncbi.nlm.nih.gov/pubmed/36904719 http://dx.doi.org/10.3390/s23052515

_version_	1784905379540369408
author	Haga, Takeshi Kera, Hiroshi Kawamoto, Kazuhiko
author_facet	Haga, Takeshi Kera, Hiroshi Kawamoto, Kazuhiko
author_sort	Haga, Takeshi
collection	PubMed
description	In this paper, we propose a sequential variational autoencoder for video disentanglement, which is a representation learning method that can be used to separately extract static and dynamic features from videos. Building sequential variational autoencoders with a two-stream architecture induces inductive bias for video disentanglement. However, our preliminary experiment demonstrated that the two-stream architecture is insufficient for video disentanglement because static features frequently contain dynamic features. Additionally, we found that dynamic features are not discriminative in the latent space. To address these problems, we introduced an adversarial classifier using supervised learning into the two-stream architecture. The strong inductive bias through supervision separates dynamic features from static features and yields discriminative representations of the dynamic features. Through a comparison with other sequential variational autoencoders, we qualitatively and quantitatively demonstrate the effectiveness of the proposed method on the Sprites and MUG datasets.
format	Online Article Text
id	pubmed-10006877
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-100068772023-03-12 Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement Haga, Takeshi Kera, Hiroshi Kawamoto, Kazuhiko Sensors (Basel) Article In this paper, we propose a sequential variational autoencoder for video disentanglement, which is a representation learning method that can be used to separately extract static and dynamic features from videos. Building sequential variational autoencoders with a two-stream architecture induces inductive bias for video disentanglement. However, our preliminary experiment demonstrated that the two-stream architecture is insufficient for video disentanglement because static features frequently contain dynamic features. Additionally, we found that dynamic features are not discriminative in the latent space. To address these problems, we introduced an adversarial classifier using supervised learning into the two-stream architecture. The strong inductive bias through supervision separates dynamic features from static features and yields discriminative representations of the dynamic features. Through a comparison with other sequential variational autoencoders, we qualitatively and quantitatively demonstrate the effectiveness of the proposed method on the Sprites and MUG datasets. MDPI 2023-02-24 /pmc/articles/PMC10006877/ /pubmed/36904719 http://dx.doi.org/10.3390/s23052515 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Haga, Takeshi Kera, Hiroshi Kawamoto, Kazuhiko Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement
title	Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement
title_full	Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement
title_fullStr	Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement
title_full_unstemmed	Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement
title_short	Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement
title_sort	sequential variational autoencoder with adversarial classifier for video disentanglement
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006877/ https://www.ncbi.nlm.nih.gov/pubmed/36904719 http://dx.doi.org/10.3390/s23052515
work_keys_str_mv	AT hagatakeshi sequentialvariationalautoencoderwithadversarialclassifierforvideodisentanglement AT kerahiroshi sequentialvariationalautoencoderwithadversarialclassifierforvideodisentanglement AT kawamotokazuhiko sequentialvariationalautoencoderwithadversarialclassifierforvideodisentanglement

Sequential Variational Autoencoder with Adversarial Classifier for Video Disentanglement

Ejemplares similares