Cargando…

Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets

In this study, a novel prediction method for predicting important scenes in baseball videos using a time-lag aware latent variable model (Tl-LVM) is proposed. Tl-LVM adopts a multimodal variational autoencoder using tweets and videos as the latent variable model. It calculates the latent features fr...

Descripción completa

Detalles Bibliográficos
Autores principales: Hirasawa, Kaito, Maeda, Keisuke, Ogawa, Takahiro, Haseyama, Miki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9002476/
https://www.ncbi.nlm.nih.gov/pubmed/35408079
http://dx.doi.org/10.3390/s22072465
_version_ 1784685899205836800
author Hirasawa, Kaito
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
author_facet Hirasawa, Kaito
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
author_sort Hirasawa, Kaito
collection PubMed
description In this study, a novel prediction method for predicting important scenes in baseball videos using a time-lag aware latent variable model (Tl-LVM) is proposed. Tl-LVM adopts a multimodal variational autoencoder using tweets and videos as the latent variable model. It calculates the latent features from these tweets and videos and predicts important scenes using these latent features. Since time lags exist between posted tweets and events, Tl-LVM introduces the loss considering time lags by correlating the feature into the loss function of the multimodal variational autoencoder. Furthermore, Tl-LVM can train the encoder, decoder, and important scene predictor, simultaneously, using this loss function. This is the novelty of Tl-LVM, and this work is the first end-to-end prediction model of important scenes that considers time lags to the best of our knowledge. It is the contribution of Tl-LVM to realize high-quality prediction using latent features that consider time lags between tweets and multiple corresponding previous events. Experimental results using actual tweets and baseball videos show the effectiveness of Tl-LVM.
format Online
Article
Text
id pubmed-9002476
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-90024762022-04-13 Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets Hirasawa, Kaito Maeda, Keisuke Ogawa, Takahiro Haseyama, Miki Sensors (Basel) Communication In this study, a novel prediction method for predicting important scenes in baseball videos using a time-lag aware latent variable model (Tl-LVM) is proposed. Tl-LVM adopts a multimodal variational autoencoder using tweets and videos as the latent variable model. It calculates the latent features from these tweets and videos and predicts important scenes using these latent features. Since time lags exist between posted tweets and events, Tl-LVM introduces the loss considering time lags by correlating the feature into the loss function of the multimodal variational autoencoder. Furthermore, Tl-LVM can train the encoder, decoder, and important scene predictor, simultaneously, using this loss function. This is the novelty of Tl-LVM, and this work is the first end-to-end prediction model of important scenes that considers time lags to the best of our knowledge. It is the contribution of Tl-LVM to realize high-quality prediction using latent features that consider time lags between tweets and multiple corresponding previous events. Experimental results using actual tweets and baseball videos show the effectiveness of Tl-LVM. MDPI 2022-03-23 /pmc/articles/PMC9002476/ /pubmed/35408079 http://dx.doi.org/10.3390/s22072465 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Hirasawa, Kaito
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title_full Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title_fullStr Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title_full_unstemmed Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title_short Time-Lag Aware Latent Variable Model for Prediction of Important Scenes Using Baseball Videos and Tweets
title_sort time-lag aware latent variable model for prediction of important scenes using baseball videos and tweets
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9002476/
https://www.ncbi.nlm.nih.gov/pubmed/35408079
http://dx.doi.org/10.3390/s22072465
work_keys_str_mv AT hirasawakaito timelagawarelatentvariablemodelforpredictionofimportantscenesusingbaseballvideosandtweets
AT maedakeisuke timelagawarelatentvariablemodelforpredictionofimportantscenesusingbaseballvideosandtweets
AT ogawatakahiro timelagawarelatentvariablemodelforpredictionofimportantscenesusingbaseballvideosandtweets
AT haseyamamiki timelagawarelatentvariablemodelforpredictionofimportantscenesusingbaseballvideosandtweets