Cargando…

Modality attention fusion model with hybrid multi-head self-attention for video understanding

Video question answering (Video-QA) is a subject undergoing intense study in Artificial Intelligence, which is one of the tasks which can evaluate such AI abilities. In this paper, we propose a Modality Attention Fusion framework with Hybrid Multi-head Self-attention (MAF-HMS). MAF-HMS focuses on th...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhuang, Xuqiang, Liu, Fang’ai, Hou, Jian, Hao, Jianhua, Cai, Xiaohong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536548/
https://www.ncbi.nlm.nih.gov/pubmed/36201513
http://dx.doi.org/10.1371/journal.pone.0275156