Cargando…

Cross Encoder-Decoder Transformer with Global-Local Visual Extractor for Medical Image Captioning

Transformer-based approaches have shown good results in image captioning tasks. However, current approaches have a limitation in generating text from global features of an entire image. Therefore, we propose novel methods for generating better image captioning as follows: (1) The Global-Local Visual...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Hojun, Cho, Hyunjun, Park, Jieun, Chae, Jinyeong, Kim, Jihie
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Perspective
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8874388/ https://www.ncbi.nlm.nih.gov/pubmed/35214330 http://dx.doi.org/10.3390/s22041429

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8874388/
https://www.ncbi.nlm.nih.gov/pubmed/35214330
http://dx.doi.org/10.3390/s22041429

Cross Encoder-Decoder Transformer with Global-Local Visual Extractor for Medical Image Captioning

Internet

Ejemplares similares