Cargando…

A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency

Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xu, Li, Yujie, Wang, Haoyu, Huang, Longzhao, Ding, Shuxue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571073/
https://www.ncbi.nlm.nih.gov/pubmed/36236789
http://dx.doi.org/10.3390/s22197689
_version_ 1784810271201558528
author Wang, Xu
Li, Yujie
Wang, Haoyu
Huang, Longzhao
Ding, Shuxue
author_facet Wang, Xu
Li, Yujie
Wang, Haoyu
Huang, Longzhao
Ding, Shuxue
author_sort Wang, Xu
collection PubMed
description Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep summarization network with auxiliary summarization losses to address this problem. We introduce an unsupervised auxiliary summarization loss module with LSTM and a swish activation function to capture the long-term dependencies for video summarization, which can be easily integrated with various networks. The proposed model is an unsupervised framework for deep reinforcement learning that does not depend on any labels or user interactions. Additionally, we implement a reward function ([Formula: see text]) that jointly considers the consistency, diversity, and representativeness of generated summaries. Furthermore, the proposed model is lightweight and can be successfully deployed on mobile devices and enhance the experience of mobile users and reduce pressure on server operations. We conducted experiments on two benchmark datasets and the results demonstrate that our proposed unsupervised approach can obtain better summaries than existing video summarization methods. Furthermore, the proposed algorithm can generate higher F scores with a nearly 6.3% increase on the SumMe dataset and a 2.2% increase on the TVSum dataset compared to the DR-DSN model.
format Online
Article
Text
id pubmed-9571073
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95710732022-10-17 A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency Wang, Xu Li, Yujie Wang, Haoyu Huang, Longzhao Ding, Shuxue Sensors (Basel) Article Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep summarization network with auxiliary summarization losses to address this problem. We introduce an unsupervised auxiliary summarization loss module with LSTM and a swish activation function to capture the long-term dependencies for video summarization, which can be easily integrated with various networks. The proposed model is an unsupervised framework for deep reinforcement learning that does not depend on any labels or user interactions. Additionally, we implement a reward function ([Formula: see text]) that jointly considers the consistency, diversity, and representativeness of generated summaries. Furthermore, the proposed model is lightweight and can be successfully deployed on mobile devices and enhance the experience of mobile users and reduce pressure on server operations. We conducted experiments on two benchmark datasets and the results demonstrate that our proposed unsupervised approach can obtain better summaries than existing video summarization methods. Furthermore, the proposed algorithm can generate higher F scores with a nearly 6.3% increase on the SumMe dataset and a 2.2% increase on the TVSum dataset compared to the DR-DSN model. MDPI 2022-10-10 /pmc/articles/PMC9571073/ /pubmed/36236789 http://dx.doi.org/10.3390/s22197689 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Xu
Li, Yujie
Wang, Haoyu
Huang, Longzhao
Ding, Shuxue
A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title_full A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title_fullStr A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title_full_unstemmed A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title_short A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title_sort video summarization model based on deep reinforcement learning with long-term dependency
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571073/
https://www.ncbi.nlm.nih.gov/pubmed/36236789
http://dx.doi.org/10.3390/s22197689
work_keys_str_mv AT wangxu avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency
AT liyujie avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency
AT wanghaoyu avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency
AT huanglongzhao avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency
AT dingshuxue avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency
AT wangxu videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency
AT liyujie videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency
AT wanghaoyu videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency
AT huanglongzhao videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency
AT dingshuxue videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency