Cargando…

A Modular Vision Language Navigation and Manipulation Framework for Long Horizon Compositional Tasks in Indoor Environment

In this paper we propose a new framework—MoViLan (Modular Vision and Language) for execution of visually grounded natural language instructions for day to day indoor household tasks. While several data-driven, end-to-end learning frameworks have been proposed for targeted navigation tasks based on t...

Descripción completa

Detalles Bibliográficos
Autores principales: Saha, Homagni, Fotouhi, Fateme, Liu, Qisai, Sarkar, Soumik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9340572/
https://www.ncbi.nlm.nih.gov/pubmed/35923304
http://dx.doi.org/10.3389/frobt.2022.930486