Cargando…

Research on Volleyball Video Intelligent Description Technology Combining the Long-Term and Short-Term Memory Network and Attention Mechanism

With the development of computer technology, video description, which combines the key technologies in the field of natural language processing and computer vision, has attracted more and more researchers' attention. Among them, how to objectively and efficiently describe high-speed and detaile...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Yuhua, Mo, Yong, Zhang, Heng, Huang, Ruiyin, Chen, Zilong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8531798/
https://www.ncbi.nlm.nih.gov/pubmed/34691171
http://dx.doi.org/10.1155/2021/7088837
_version_ 1784586941561307136
author Gao, Yuhua
Mo, Yong
Zhang, Heng
Huang, Ruiyin
Chen, Zilong
author_facet Gao, Yuhua
Mo, Yong
Zhang, Heng
Huang, Ruiyin
Chen, Zilong
author_sort Gao, Yuhua
collection PubMed
description With the development of computer technology, video description, which combines the key technologies in the field of natural language processing and computer vision, has attracted more and more researchers' attention. Among them, how to objectively and efficiently describe high-speed and detailed sports videos is the key to the development of the video description field. In view of the problems of sentence errors and loss of visual information in the generation of the video description text due to the lack of language learning information in the existing video description methods, a multihead model combining the long-term and short-term memory network and attention mechanism is proposed for the intelligent description of the volleyball video. Through the introduction of the attention mechanism, the model pays much attention to the significant areas in the video when generating sentences. Through the comparative experiment with different models, the results show that the model with the attention mechanism can effectively solve the loss of visual information. Compared with the LSTM and base model, the multihead model proposed in this paper, which combines the long-term and short-term memory network and attention mechanism, has higher scores in all evaluation indexes and significantly improved the quality of the intelligent text description of the volleyball video.
format Online
Article
Text
id pubmed-8531798
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-85317982021-10-23 Research on Volleyball Video Intelligent Description Technology Combining the Long-Term and Short-Term Memory Network and Attention Mechanism Gao, Yuhua Mo, Yong Zhang, Heng Huang, Ruiyin Chen, Zilong Comput Intell Neurosci Research Article With the development of computer technology, video description, which combines the key technologies in the field of natural language processing and computer vision, has attracted more and more researchers' attention. Among them, how to objectively and efficiently describe high-speed and detailed sports videos is the key to the development of the video description field. In view of the problems of sentence errors and loss of visual information in the generation of the video description text due to the lack of language learning information in the existing video description methods, a multihead model combining the long-term and short-term memory network and attention mechanism is proposed for the intelligent description of the volleyball video. Through the introduction of the attention mechanism, the model pays much attention to the significant areas in the video when generating sentences. Through the comparative experiment with different models, the results show that the model with the attention mechanism can effectively solve the loss of visual information. Compared with the LSTM and base model, the multihead model proposed in this paper, which combines the long-term and short-term memory network and attention mechanism, has higher scores in all evaluation indexes and significantly improved the quality of the intelligent text description of the volleyball video. Hindawi 2021-10-14 /pmc/articles/PMC8531798/ /pubmed/34691171 http://dx.doi.org/10.1155/2021/7088837 Text en Copyright © 2021 Yuhua Gao et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Gao, Yuhua
Mo, Yong
Zhang, Heng
Huang, Ruiyin
Chen, Zilong
Research on Volleyball Video Intelligent Description Technology Combining the Long-Term and Short-Term Memory Network and Attention Mechanism
title Research on Volleyball Video Intelligent Description Technology Combining the Long-Term and Short-Term Memory Network and Attention Mechanism
title_full Research on Volleyball Video Intelligent Description Technology Combining the Long-Term and Short-Term Memory Network and Attention Mechanism
title_fullStr Research on Volleyball Video Intelligent Description Technology Combining the Long-Term and Short-Term Memory Network and Attention Mechanism
title_full_unstemmed Research on Volleyball Video Intelligent Description Technology Combining the Long-Term and Short-Term Memory Network and Attention Mechanism
title_short Research on Volleyball Video Intelligent Description Technology Combining the Long-Term and Short-Term Memory Network and Attention Mechanism
title_sort research on volleyball video intelligent description technology combining the long-term and short-term memory network and attention mechanism
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8531798/
https://www.ncbi.nlm.nih.gov/pubmed/34691171
http://dx.doi.org/10.1155/2021/7088837
work_keys_str_mv AT gaoyuhua researchonvolleyballvideointelligentdescriptiontechnologycombiningthelongtermandshorttermmemorynetworkandattentionmechanism
AT moyong researchonvolleyballvideointelligentdescriptiontechnologycombiningthelongtermandshorttermmemorynetworkandattentionmechanism
AT zhangheng researchonvolleyballvideointelligentdescriptiontechnologycombiningthelongtermandshorttermmemorynetworkandattentionmechanism
AT huangruiyin researchonvolleyballvideointelligentdescriptiontechnologycombiningthelongtermandshorttermmemorynetworkandattentionmechanism
AT chenzilong researchonvolleyballvideointelligentdescriptiontechnologycombiningthelongtermandshorttermmemorynetworkandattentionmechanism