Cargando…

Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation

Individuals spend time on online video-sharing platforms searching for videos. Video summarization helps search through many videos efficiently and quickly. In this paper, we propose an unsupervised video summarization method based on deep reinforcement learning with an interpolation method. To trai...

Descripción completa

Detalles Bibliográficos
Autores principales: Yoon, Ui Nyoung, Hong, Myung Duk, Jo, Geun-Sik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099378/
https://www.ncbi.nlm.nih.gov/pubmed/37050439
http://dx.doi.org/10.3390/s23073384
_version_ 1785025038826602496
author Yoon, Ui Nyoung
Hong, Myung Duk
Jo, Geun-Sik
author_facet Yoon, Ui Nyoung
Hong, Myung Duk
Jo, Geun-Sik
author_sort Yoon, Ui Nyoung
collection PubMed
description Individuals spend time on online video-sharing platforms searching for videos. Video summarization helps search through many videos efficiently and quickly. In this paper, we propose an unsupervised video summarization method based on deep reinforcement learning with an interpolation method. To train the video summarization network efficiently, we used the graph-level features and designed a reinforcement learning-based video summarization framework with a temporal consistency reward function and other reward functions. Our temporal consistency reward function helped to select keyframes uniformly. We present a lightweight video summarization network with transformer and CNN networks to capture the global and local contexts to efficiently predict the keyframe-level importance score of the video in a short length. The output importance score of the network was interpolated to fit the video length. Using the predicted importance score, we calculated the reward based on the reward functions, which helped select interesting keyframes efficiently and uniformly. We evaluated the proposed method on two datasets, SumMe and TVSum. The experimental results illustrate that the proposed method showed a state-of-the-art performance compared to the latest unsupervised video summarization methods, which we demonstrate and analyze experimentally.
format Online
Article
Text
id pubmed-10099378
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100993782023-04-14 Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation Yoon, Ui Nyoung Hong, Myung Duk Jo, Geun-Sik Sensors (Basel) Article Individuals spend time on online video-sharing platforms searching for videos. Video summarization helps search through many videos efficiently and quickly. In this paper, we propose an unsupervised video summarization method based on deep reinforcement learning with an interpolation method. To train the video summarization network efficiently, we used the graph-level features and designed a reinforcement learning-based video summarization framework with a temporal consistency reward function and other reward functions. Our temporal consistency reward function helped to select keyframes uniformly. We present a lightweight video summarization network with transformer and CNN networks to capture the global and local contexts to efficiently predict the keyframe-level importance score of the video in a short length. The output importance score of the network was interpolated to fit the video length. Using the predicted importance score, we calculated the reward based on the reward functions, which helped select interesting keyframes efficiently and uniformly. We evaluated the proposed method on two datasets, SumMe and TVSum. The experimental results illustrate that the proposed method showed a state-of-the-art performance compared to the latest unsupervised video summarization methods, which we demonstrate and analyze experimentally. MDPI 2023-03-23 /pmc/articles/PMC10099378/ /pubmed/37050439 http://dx.doi.org/10.3390/s23073384 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yoon, Ui Nyoung
Hong, Myung Duk
Jo, Geun-Sik
Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
title Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
title_full Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
title_fullStr Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
title_full_unstemmed Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
title_short Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
title_sort unsupervised video summarization based on deep reinforcement learning with interpolation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099378/
https://www.ncbi.nlm.nih.gov/pubmed/37050439
http://dx.doi.org/10.3390/s23073384
work_keys_str_mv AT yoonuinyoung unsupervisedvideosummarizationbasedondeepreinforcementlearningwithinterpolation
AT hongmyungduk unsupervisedvideosummarizationbasedondeepreinforcementlearningwithinterpolation
AT jogeunsik unsupervisedvideosummarizationbasedondeepreinforcementlearningwithinterpolation