Cargando…

Outdoor Vision-and-Language Navigation Needs Object-Level Alignment

In the field of embodied AI, vision-and-language navigation (VLN) is a crucial and challenging multi-modal task. Specifically, outdoor VLN involves an agent navigating within a graph-based environment, while simultaneously interpreting information from real-world urban environments and natural langu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sun, Yanjun, Qiu, Yue, Aoki, Yoshimitsu, Kataoka, Hirokatsu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346337/ https://www.ncbi.nlm.nih.gov/pubmed/37447877 http://dx.doi.org/10.3390/s23136028

Ejemplares similares

Vital information matching in vision-and-language navigation
por: Jia, Zixi, et al.
Publicado: (2022)

Joint Multimodal Embedding and Backtracking Search in Vision-and-Language Navigation
por: Hwang, Jisu, et al.
Publicado: (2021)

Temporal and Fine-Grained Pedestrian Action Recognition on Driving Recorder Database
por: Kataoka, Hirokatsu, et al.
Publicado: (2018)

A Robust Indoor/Outdoor Navigation Filter Fusing Data from Vision and Magneto-Inertial Measurement Unit
por: Caruso, David, et al.
Publicado: (2017)

Outdoor Lighting: Physics, Vision and Perception
por: Schreuder, Duco
Publicado: (2008)

Vision/INS Integrated Navigation System for Poor Vision Navigation Environments
por: Kim, Youngsun, et al.
Publicado: (2016)

A Vision Aided Initial Alignment Method of Strapdown Inertial Navigation Systems in Polar Regions
por: Zhang, Fubin, et al.
Publicado: (2022)

Machine vision and navigation
por: Sergiyenko, Oleg, et al.
Publicado: (2019)

Object representations in the human brain reflect the co-occurrence statistics of vision and language
por: Bonner, Michael F., et al.
Publicado: (2021)

Navigating the vision science Internet
por: Mannion, Damien J.
Publicado: (2012)

Language of vision
por: Kepes, Gyorgy, 1906-2001
Publicado: (1995)

The Language of Vision*
por: Cavanagh, Patrick
Publicado: (2021)

“Vision for Action” in Young Children Aligning Multi-Featured Objects: Development and Comparison with Nonhuman Primates
por: Fragaszy, Dorothy Munkenbeck, et al.
Publicado: (2015)

A Modular Vision Language Navigation and Manipulation Framework for Long Horizon Compositional Tasks in Indoor Environment
por: Saha, Homagni, et al.
Publicado: (2022)

Quantifying navigational information: The catchment volumes of panoramic snapshots in outdoor scenes
por: Murray, Trevor, et al.
Publicado: (2017)

Uncertainty-Aware Visual Perception System for Outdoor Navigation of the Visually Challenged
por: Dimas, George, et al.
Publicado: (2020)

Machine Vision Navigation in Spine Surgery
por: Kalfas, Iain H.
Publicado: (2021)

On the Vision of Objects on and in the Eye
por: Mackenzie, William
Publicado: (1845)

A Mobile Outdoor Augmented Reality Method Combining Deep Learning Object Detection and Spatial Relationships for Geovisualization
por: Rao, Jinmeng, et al.
Publicado: (2017)

Time outdoors positively associates with academic performance: a school-based study with objective monitoring of outdoor time
por: Wang, Jingjing, et al.
Publicado: (2023)

An Aerial–Ground Robotic System for Navigation and Obstacle Mapping in Large Outdoor Areas
por: Garzón, Mario, et al.
Publicado: (2013)

Loosely Coupled GNSS and UWB with INS Integration for Indoor/Outdoor Pedestrian Navigation †
por: Di Pietra, Vincenzo, et al.
Publicado: (2020)

The impact of different distractions on outdoor visual search and object memory
por: Nachtnebel, Sarah Jasmin, et al.
Publicado: (2023)

Indoor and outdoor human behavior and myopia: an objective and dynamic study
por: Harb, Elise N., et al.
Publicado: (2023)

Objectively measured near work, outdoor exposure and myopia in children
por: Wen, Longbo, et al.
Publicado: (2020)

Point Cloud Compression: Impact on Object Detection in Outdoor Contexts
por: Garrote, Luís, et al.
Publicado: (2022)

Integrating vision and echolocation for navigation and perception in bats
por: Danilovich, S., et al.
Publicado: (2019)

Navigating C++ and object-oriented design /
por: Anderson, Paul, 1949-
Publicado: (1998)

Navigating C++ and object-oriented design
por: Anderson, Gail C, et al.
Publicado: (1998)

An Application of Artificial Intelligence to Diagnostic Imaging of Spine Disease: Estimating Spinal Alignment From Moiré Images
por: Watanabe, Kota, et al.
Publicado: (2019)

The C Object System: Using C as a High-Level Object-Oriented Language
por: Deniau, Laurent
Publicado: (2010)

Path Markup Language for Indoor Navigation
por: Cai, Yang, et al.
Publicado: (2020)

Are we seeing clearly? The need for aligned vision and supporting strategies to deliver net-zero electricity systems
por: Ford, Rebecca, et al.
Publicado: (2020)

Vision-Based Real-Time Traversable Region Detection for Mobile Robot in the Outdoors
por: Deng, Fucheng, et al.
Publicado: (2017)

Indoor Scene Change Captioning Based on Multimodality Data
por: Qiu, Yue, et al.
Publicado: (2020)

Multi-View Visual Question Answering with Active Viewpoint Selection
por: Qiu, Yue, et al.
Publicado: (2020)

The Evolution of Visual Roles – Ancient Vision Versus Object Vision
por: Nilsson, Dan-Eric
Publicado: (2022)

Automated measurement: The need for a more objective view of the speech and language of autistic children
por: Leland, Eraine, et al.
Publicado: (2023)

UNav: An Infrastructure-Independent Vision-Based Navigation System for People with Blindness and Low Vision
por: Yang, Anbang, et al.
Publicado: (2022)

Quad Rotorcraft Control: Vision-Based Hovering and Navigation
por: García Carrillo, Luis Rodolfo, et al.
Publicado: (2013)

Cannot write session to /tmp/vufind_sessions/sess_on6g0hq7nd8ltg752ahp9e8fud