Cargando…
An effective spatial relational reasoning networks for visual question answering
Visual Question Answering (VQA) is a method of answering questions in natural language based on the content of images and has been widely concerned by researchers. The existing research on the visual question answering model mainly focuses on the point of view of attention mechanism and multi-modal...
Autores principales: | Shen, Xiang, Han, Dezhi, Chen, Chongqing, Luo, Gaofeng, Wu, Zhongdai |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9704574/ https://www.ncbi.nlm.nih.gov/pubmed/36441742 http://dx.doi.org/10.1371/journal.pone.0277693 |
Ejemplares similares
-
An Effective Dense Co-Attention Networks for Visual Question Answering
por: He, Shirong, et al.
Publicado: (2020) -
Multi-Modal Explicit Sparse Attention Networks for Visual Question Answering
por: Guo, Zihan, et al.
Publicado: (2020) -
Learning to Reason on Tree Structures for Knowledge-Based Visual Question Answering
por: Li, Qifeng, et al.
Publicado: (2022) -
Multi-modal adaptive gated mechanism for visual question answering
por: Xu, Yangshuyi, et al.
Publicado: (2023) -
Deep Modular Bilinear Attention Network for Visual Question Answering
por: Yan, Feng, et al.
Publicado: (2022)