Cargando…
A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571073/ https://www.ncbi.nlm.nih.gov/pubmed/36236789 http://dx.doi.org/10.3390/s22197689 |
_version_ | 1784810271201558528 |
---|---|
author | Wang, Xu Li, Yujie Wang, Haoyu Huang, Longzhao Ding, Shuxue |
author_facet | Wang, Xu Li, Yujie Wang, Haoyu Huang, Longzhao Ding, Shuxue |
author_sort | Wang, Xu |
collection | PubMed |
description | Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep summarization network with auxiliary summarization losses to address this problem. We introduce an unsupervised auxiliary summarization loss module with LSTM and a swish activation function to capture the long-term dependencies for video summarization, which can be easily integrated with various networks. The proposed model is an unsupervised framework for deep reinforcement learning that does not depend on any labels or user interactions. Additionally, we implement a reward function ([Formula: see text]) that jointly considers the consistency, diversity, and representativeness of generated summaries. Furthermore, the proposed model is lightweight and can be successfully deployed on mobile devices and enhance the experience of mobile users and reduce pressure on server operations. We conducted experiments on two benchmark datasets and the results demonstrate that our proposed unsupervised approach can obtain better summaries than existing video summarization methods. Furthermore, the proposed algorithm can generate higher F scores with a nearly 6.3% increase on the SumMe dataset and a 2.2% increase on the TVSum dataset compared to the DR-DSN model. |
format | Online Article Text |
id | pubmed-9571073 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95710732022-10-17 A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency Wang, Xu Li, Yujie Wang, Haoyu Huang, Longzhao Ding, Shuxue Sensors (Basel) Article Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep summarization network with auxiliary summarization losses to address this problem. We introduce an unsupervised auxiliary summarization loss module with LSTM and a swish activation function to capture the long-term dependencies for video summarization, which can be easily integrated with various networks. The proposed model is an unsupervised framework for deep reinforcement learning that does not depend on any labels or user interactions. Additionally, we implement a reward function ([Formula: see text]) that jointly considers the consistency, diversity, and representativeness of generated summaries. Furthermore, the proposed model is lightweight and can be successfully deployed on mobile devices and enhance the experience of mobile users and reduce pressure on server operations. We conducted experiments on two benchmark datasets and the results demonstrate that our proposed unsupervised approach can obtain better summaries than existing video summarization methods. Furthermore, the proposed algorithm can generate higher F scores with a nearly 6.3% increase on the SumMe dataset and a 2.2% increase on the TVSum dataset compared to the DR-DSN model. MDPI 2022-10-10 /pmc/articles/PMC9571073/ /pubmed/36236789 http://dx.doi.org/10.3390/s22197689 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Xu Li, Yujie Wang, Haoyu Huang, Longzhao Ding, Shuxue A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency |
title | A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency |
title_full | A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency |
title_fullStr | A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency |
title_full_unstemmed | A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency |
title_short | A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency |
title_sort | video summarization model based on deep reinforcement learning with long-term dependency |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571073/ https://www.ncbi.nlm.nih.gov/pubmed/36236789 http://dx.doi.org/10.3390/s22197689 |
work_keys_str_mv | AT wangxu avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT liyujie avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT wanghaoyu avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT huanglongzhao avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT dingshuxue avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT wangxu videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT liyujie videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT wanghaoyu videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT huanglongzhao videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT dingshuxue videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency |