Cargando…

Cross Encoder-Decoder Transformer with Global-Local Visual Extractor for Medical Image Captioning

Transformer-based approaches have shown good results in image captioning tasks. However, current approaches have a limitation in generating text from global features of an entire image. Therefore, we propose novel methods for generating better image captioning as follows: (1) The Global-Local Visual...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Hojun, Cho, Hyunjun, Park, Jieun, Chae, Jinyeong, Kim, Jihie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8874388/
https://www.ncbi.nlm.nih.gov/pubmed/35214330
http://dx.doi.org/10.3390/s22041429