Cargando…
Parallel multi-head attention and term-weighted question embedding for medical visual question answering
The goal of medical visual question answering (Med-VQA) is to correctly answer a clinical question posed by a medical image. Medical images are fundamentally different from images in the general domain. As a result, using general domain Visual Question Answering (VQA) models to the medical domain is...
Autores principales: | Manmadhan, Sruthy, Kovoor, Binsu C |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006552/ https://www.ncbi.nlm.nih.gov/pubmed/37362667 http://dx.doi.org/10.1007/s11042-023-14981-2 |
Ejemplares similares
-
Multi-Modal Explicit Sparse Attention Networks for Visual Question Answering
por: Guo, Zihan, et al.
Publicado: (2020) -
Adversarial Learning with Bidirectional Attention for Visual Question Answering
por: Li, Qifeng, et al.
Publicado: (2021) -
The multi-modal fusion in visual question answering: a review of attention mechanisms
por: Lu, Siyu, et al.
Publicado: (2023) -
An Effective Dense Co-Attention Networks for Visual Question Answering
por: He, Shirong, et al.
Publicado: (2020) -
Deep Modular Bilinear Attention Network for Visual Question Answering
por: Yan, Feng, et al.
Publicado: (2022)