Cargando…

Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets

In this study, a novel prediction method for predicting important scenes in baseball videos using a time-lag aware latent variable model (Tl-LVM) is proposed. Tl-LVM adopts a multimodal variational autoencoder using tweets and videos as the latent variable model. It calculates the latent features fr...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hirasawa, Kaito, Maeda, Keisuke, Ogawa, Takahiro, Haseyama, Miki
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Communication
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9002476/ https://www.ncbi.nlm.nih.gov/pubmed/35408079 http://dx.doi.org/10.3390/s22072465

_version_	1784685899205836800
author	Hirasawa, Kaito Maeda, Keisuke Ogawa, Takahiro Haseyama, Miki
author_facet	Hirasawa, Kaito Maeda, Keisuke Ogawa, Takahiro Haseyama, Miki
author_sort	Hirasawa, Kaito
collection	PubMed
description	In this study, a novel prediction method for predicting important scenes in baseball videos using a time-lag aware latent variable model (Tl-LVM) is proposed. Tl-LVM adopts a multimodal variational autoencoder using tweets and videos as the latent variable model. It calculates the latent features from these tweets and videos and predicts important scenes using these latent features. Since time lags exist between posted tweets and events, Tl-LVM introduces the loss considering time lags by correlating the feature into the loss function of the multimodal variational autoencoder. Furthermore, Tl-LVM can train the encoder, decoder, and important scene predictor, simultaneously, using this loss function. This is the novelty of Tl-LVM, and this work is the first end-to-end prediction model of important scenes that considers time lags to the best of our knowledge. It is the contribution of Tl-LVM to realize high-quality prediction using latent features that consider time lags between tweets and multiple corresponding previous events. Experimental results using actual tweets and baseball videos show the effectiveness of Tl-LVM.
format	Online Article Text
id	pubmed-9002476
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-90024762022-04-13 Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets Hirasawa, Kaito Maeda, Keisuke Ogawa, Takahiro Haseyama, Miki Sensors (Basel) Communication In this study, a novel prediction method for predicting important scenes in baseball videos using a time-lag aware latent variable model (Tl-LVM) is proposed. Tl-LVM adopts a multimodal variational autoencoder using tweets and videos as the latent variable model. It calculates the latent features from these tweets and videos and predicts important scenes using these latent features. Since time lags exist between posted tweets and events, Tl-LVM introduces the loss considering time lags by correlating the feature into the loss function of the multimodal variational autoencoder. Furthermore, Tl-LVM can train the encoder, decoder, and important scene predictor, simultaneously, using this loss function. This is the novelty of Tl-LVM, and this work is the first end-to-end prediction model of important scenes that considers time lags to the best of our knowledge. It is the contribution of Tl-LVM to realize high-quality prediction using latent features that consider time lags between tweets and multiple corresponding previous events. Experimental results using actual tweets and baseball videos show the effectiveness of Tl-LVM. MDPI 2022-03-23 /pmc/articles/PMC9002476/ /pubmed/35408079 http://dx.doi.org/10.3390/s22072465 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Communication Hirasawa, Kaito Maeda, Keisuke Ogawa, Takahiro Haseyama, Miki Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title	Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title_full	Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title_fullStr	Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title_full_unstemmed	Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title_short	Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title_sort	time-lag aware latent variable model for prediction of important scenes using baseball videos and tweets
topic	Communication
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9002476/ https://www.ncbi.nlm.nih.gov/pubmed/35408079 http://dx.doi.org/10.3390/s22072465
work_keys_str_mv	AT hirasawakaito timelagawarelatentvariablemodelforpredictionofimportantscenesusingbaseballvideosandtweets AT maedakeisuke timelagawarelatentvariablemodelforpredictionofimportantscenesusingbaseballvideosandtweets AT ogawatakahiro timelagawarelatentvariablemodelforpredictionofimportantscenesusingbaseballvideosandtweets AT haseyamamiki timelagawarelatentvariablemodelforpredictionofimportantscenesusingbaseballvideosandtweets

Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets

Ejemplares similares