Cargando…

Joint Multimodal Embedding and Backtracking Search in Vision-and-Language Navigation

Due to the development of computer vision and natural language processing technologies in recent years, there has been a growing interest in multimodal intelligent tasks that require the ability to concurrently understand various forms of input data such as images and text. Vision-and-language navig...

Descripción completa

Detalles Bibliográficos
Autores principales: Hwang, Jisu, Kim, Incheol
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7867342/
https://www.ncbi.nlm.nih.gov/pubmed/33540789
http://dx.doi.org/10.3390/s21031012