Cargando…

Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder †

A new method for the detection of important scenes in baseball videos via a time-lag-aware multimodal variational autoencoder (Tl-MVAE) is presented in this paper. Tl-MVAE estimates latent features calculated from tweet, video, and audio features extracted from tweets and videos. Then, important sce...

Descripción completa

Detalles Bibliográficos
Autores principales: Hirasawa, Kaito, Maeda, Keisuke, Ogawa, Takahiro, Haseyama, Miki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7999231/
https://www.ncbi.nlm.nih.gov/pubmed/33799412
http://dx.doi.org/10.3390/s21062045
_version_ 1783670734336294912
author Hirasawa, Kaito
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
author_facet Hirasawa, Kaito
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
author_sort Hirasawa, Kaito
collection PubMed
description A new method for the detection of important scenes in baseball videos via a time-lag-aware multimodal variational autoencoder (Tl-MVAE) is presented in this paper. Tl-MVAE estimates latent features calculated from tweet, video, and audio features extracted from tweets and videos. Then, important scenes are detected by estimating the probability of the scene being important from estimated latent features. It should be noted that there exist time-lags between tweets posted by users and videos. To consider the time-lags between tweet features and other features calculated from corresponding multiple previous events, the feature transformation based on feature correlation considering such time-lags is newly introduced to the encoder in MVAE in the proposed method. This is the biggest contribution of the Tl-MVAE. Experimental results obtained from actual baseball videos and their corresponding tweets show the effectiveness of the proposed method.
format Online
Article
Text
id pubmed-7999231
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79992312021-03-28 Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder † Hirasawa, Kaito Maeda, Keisuke Ogawa, Takahiro Haseyama, Miki Sensors (Basel) Article A new method for the detection of important scenes in baseball videos via a time-lag-aware multimodal variational autoencoder (Tl-MVAE) is presented in this paper. Tl-MVAE estimates latent features calculated from tweet, video, and audio features extracted from tweets and videos. Then, important scenes are detected by estimating the probability of the scene being important from estimated latent features. It should be noted that there exist time-lags between tweets posted by users and videos. To consider the time-lags between tweet features and other features calculated from corresponding multiple previous events, the feature transformation based on feature correlation considering such time-lags is newly introduced to the encoder in MVAE in the proposed method. This is the biggest contribution of the Tl-MVAE. Experimental results obtained from actual baseball videos and their corresponding tweets show the effectiveness of the proposed method. MDPI 2021-03-14 /pmc/articles/PMC7999231/ /pubmed/33799412 http://dx.doi.org/10.3390/s21062045 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hirasawa, Kaito
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder †
title Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder †
title_full Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder †
title_fullStr Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder †
title_full_unstemmed Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder †
title_short Detection of Important Scenes in Baseball Videos via a Time-Lag-Aware Multimodal Variational Autoencoder †
title_sort detection of important scenes in baseball videos via a time-lag-aware multimodal variational autoencoder †
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7999231/
https://www.ncbi.nlm.nih.gov/pubmed/33799412
http://dx.doi.org/10.3390/s21062045
work_keys_str_mv AT hirasawakaito detectionofimportantscenesinbaseballvideosviaatimelagawaremultimodalvariationalautoencoder
AT maedakeisuke detectionofimportantscenesinbaseballvideosviaatimelagawaremultimodalvariationalautoencoder
AT ogawatakahiro detectionofimportantscenesinbaseballvideosviaatimelagawaremultimodalvariationalautoencoder
AT haseyamamiki detectionofimportantscenesinbaseballvideosviaatimelagawaremultimodalvariationalautoencoder