Cargando…
Adversarial Learning with Bidirectional Attention for Visual Question Answering
In this paper, we provide external image features and use the internal attention mechanism to solve the VQA problem given a dataset of textual questions and related images. Most previous models for VQA use a pair of images and questions as input. In addition, the model adopts a question-oriented att...
Autores principales: | Li, Qifeng, Tang, Xinyi, Jian, Yi |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8586915/ https://www.ncbi.nlm.nih.gov/pubmed/34770471 http://dx.doi.org/10.3390/s21217164 |
Ejemplares similares
-
Learning to Reason on Tree Structures for Knowledge-Based Visual Question Answering
por: Li, Qifeng, et al.
Publicado: (2022) -
Deep Modular Bilinear Attention Network for Visual Question Answering
por: Yan, Feng, et al.
Publicado: (2022) -
An Effective Dense Co-Attention Networks for Visual Question Answering
por: He, Shirong, et al.
Publicado: (2020) -
Multi-Modal Explicit Sparse Attention Networks for Visual Question Answering
por: Guo, Zihan, et al.
Publicado: (2020) -
Parallel multi-head attention and term-weighted question embedding for medical visual question answering
por: Manmadhan, Sruthy, et al.
Publicado: (2023)