Cargando…

A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency

Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Xu, Li, Yujie, Wang, Haoyu, Huang, Longzhao, Ding, Shuxue
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571073/ https://www.ncbi.nlm.nih.gov/pubmed/36236789 http://dx.doi.org/10.3390/s22197689

_version_	1784810271201558528
author	Wang, Xu Li, Yujie Wang, Haoyu Huang, Longzhao Ding, Shuxue
author_facet	Wang, Xu Li, Yujie Wang, Haoyu Huang, Longzhao Ding, Shuxue
author_sort	Wang, Xu
collection	PubMed
description	Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep summarization network with auxiliary summarization losses to address this problem. We introduce an unsupervised auxiliary summarization loss module with LSTM and a swish activation function to capture the long-term dependencies for video summarization, which can be easily integrated with various networks. The proposed model is an unsupervised framework for deep reinforcement learning that does not depend on any labels or user interactions. Additionally, we implement a reward function ([Formula: see text]) that jointly considers the consistency, diversity, and representativeness of generated summaries. Furthermore, the proposed model is lightweight and can be successfully deployed on mobile devices and enhance the experience of mobile users and reduce pressure on server operations. We conducted experiments on two benchmark datasets and the results demonstrate that our proposed unsupervised approach can obtain better summaries than existing video summarization methods. Furthermore, the proposed algorithm can generate higher F scores with a nearly 6.3% increase on the SumMe dataset and a 2.2% increase on the TVSum dataset compared to the DR-DSN model.
format	Online Article Text
id	pubmed-9571073
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-95710732022-10-17 A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency Wang, Xu Li, Yujie Wang, Haoyu Huang, Longzhao Ding, Shuxue Sensors (Basel) Article Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep summarization network with auxiliary summarization losses to address this problem. We introduce an unsupervised auxiliary summarization loss module with LSTM and a swish activation function to capture the long-term dependencies for video summarization, which can be easily integrated with various networks. The proposed model is an unsupervised framework for deep reinforcement learning that does not depend on any labels or user interactions. Additionally, we implement a reward function ([Formula: see text]) that jointly considers the consistency, diversity, and representativeness of generated summaries. Furthermore, the proposed model is lightweight and can be successfully deployed on mobile devices and enhance the experience of mobile users and reduce pressure on server operations. We conducted experiments on two benchmark datasets and the results demonstrate that our proposed unsupervised approach can obtain better summaries than existing video summarization methods. Furthermore, the proposed algorithm can generate higher F scores with a nearly 6.3% increase on the SumMe dataset and a 2.2% increase on the TVSum dataset compared to the DR-DSN model. MDPI 2022-10-10 /pmc/articles/PMC9571073/ /pubmed/36236789 http://dx.doi.org/10.3390/s22197689 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Xu Li, Yujie Wang, Haoyu Huang, Longzhao Ding, Shuxue A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title	A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title_full	A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title_fullStr	A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title_full_unstemmed	A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title_short	A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency
title_sort	video summarization model based on deep reinforcement learning with long-term dependency
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571073/ https://www.ncbi.nlm.nih.gov/pubmed/36236789 http://dx.doi.org/10.3390/s22197689
work_keys_str_mv	AT wangxu avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT liyujie avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT wanghaoyu avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT huanglongzhao avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT dingshuxue avideosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT wangxu videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT liyujie videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT wanghaoyu videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT huanglongzhao videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency AT dingshuxue videosummarizationmodelbasedondeepreinforcementlearningwithlongtermdependency

A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency

Ejemplares similares