Cargando…

Research on Video Captioning Based on Multifeature Fusion

Aiming at the problems that the existing video captioning models pay attention to incomplete information and the generation of expression text is not accurate enough, a video captioning model that integrates image, audio, and motion optical flow is proposed. A variety of large-scale dataset pretrain...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Hong, Guo, Lan, Chen, ZhiWen, Zheng, HouZe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071958/
https://www.ncbi.nlm.nih.gov/pubmed/35528356
http://dx.doi.org/10.1155/2022/1204909