Cargando…
Research on Video Captioning Based on Multifeature Fusion
Aiming at the problems that the existing video captioning models pay attention to incomplete information and the generation of expression text is not accurate enough, a video captioning model that integrates image, audio, and motion optical flow is proposed. A variety of large-scale dataset pretrain...
Autores principales: | Zhao, Hong, Guo, Lan, Chen, ZhiWen, Zheng, HouZe |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071958/ https://www.ncbi.nlm.nih.gov/pubmed/35528356 http://dx.doi.org/10.1155/2022/1204909 |
Ejemplares similares
-
Semantic guidance network for video captioning
por: Guo, Lan, et al.
Publicado: (2023) -
Video captioning based on vision transformer and reinforcement learning
por: Zhao, Hong, et al.
Publicado: (2022) -
A Real-Time Fire Detection Method from Video with Multifeature Fusion
por: Gong, Faming, et al.
Publicado: (2019) -
Prediction and Estimation of River Velocity Based on GAN and Multifeature Fusion
por: Wang, Yan, et al.
Publicado: (2022) -
Automatic Microaneurysms Detection Based on Multifeature Fusion Dictionary Learning
por: Zhou, Wei, et al.
Publicado: (2017)