Cargando…

Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis

Visual Localization is one of the key enabling technologies for autonomous driving and augmented reality. High quality datasets with accurate 6 Degree-of-Freedom (DoF) reference poses are the foundation for benchmarking and improving existing methods. Traditionally, reference poses have been obtaine...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zichao, Sattler, Torsten, Scaramuzza, Davide
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8550273/
https://www.ncbi.nlm.nih.gov/pubmed/34720404
http://dx.doi.org/10.1007/s11263-020-01399-8
_version_ 1784590925261963264
author Zhang, Zichao
Sattler, Torsten
Scaramuzza, Davide
author_facet Zhang, Zichao
Sattler, Torsten
Scaramuzza, Davide
author_sort Zhang, Zichao
collection PubMed
description Visual Localization is one of the key enabling technologies for autonomous driving and augmented reality. High quality datasets with accurate 6 Degree-of-Freedom (DoF) reference poses are the foundation for benchmarking and improving existing methods. Traditionally, reference poses have been obtained via Structure-from-Motion (SfM). However, SfM itself relies on local features which are prone to fail when images were taken under different conditions, e.g., day/night changes. At the same time, manually annotating feature correspondences is not scalable and potentially inaccurate. In this work, we propose a semi-automated approach to generate reference poses based on feature matching between renderings of a 3D model and real images via learned features. Given an initial pose estimate, our approach iteratively refines the pose based on feature matches against a rendering of the model from the current pose estimate. We significantly improve the nighttime reference poses of the popular Aachen Day–Night dataset, showing that state-of-the-art visual localization methods perform better (up to 47%) than predicted by the original reference poses. We extend the dataset with new nighttime test images, provide uncertainty estimates for our new reference poses, and introduce a new evaluation criterion. We will make our reference poses and our framework publicly available upon publication.
format Online
Article
Text
id pubmed-8550273
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-85502732021-10-29 Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis Zhang, Zichao Sattler, Torsten Scaramuzza, Davide Int J Comput Vis Article Visual Localization is one of the key enabling technologies for autonomous driving and augmented reality. High quality datasets with accurate 6 Degree-of-Freedom (DoF) reference poses are the foundation for benchmarking and improving existing methods. Traditionally, reference poses have been obtained via Structure-from-Motion (SfM). However, SfM itself relies on local features which are prone to fail when images were taken under different conditions, e.g., day/night changes. At the same time, manually annotating feature correspondences is not scalable and potentially inaccurate. In this work, we propose a semi-automated approach to generate reference poses based on feature matching between renderings of a 3D model and real images via learned features. Given an initial pose estimate, our approach iteratively refines the pose based on feature matches against a rendering of the model from the current pose estimate. We significantly improve the nighttime reference poses of the popular Aachen Day–Night dataset, showing that state-of-the-art visual localization methods perform better (up to 47%) than predicted by the original reference poses. We extend the dataset with new nighttime test images, provide uncertainty estimates for our new reference poses, and introduce a new evaluation criterion. We will make our reference poses and our framework publicly available upon publication. Springer US 2020-12-23 2021 /pmc/articles/PMC8550273/ /pubmed/34720404 http://dx.doi.org/10.1007/s11263-020-01399-8 Text en © The Author(s) 2020 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zhang, Zichao
Sattler, Torsten
Scaramuzza, Davide
Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis
title Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis
title_full Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis
title_fullStr Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis
title_full_unstemmed Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis
title_short Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis
title_sort reference pose generation for long-term visual localization via learned features and view synthesis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8550273/
https://www.ncbi.nlm.nih.gov/pubmed/34720404
http://dx.doi.org/10.1007/s11263-020-01399-8
work_keys_str_mv AT zhangzichao referenceposegenerationforlongtermvisuallocalizationvialearnedfeaturesandviewsynthesis
AT sattlertorsten referenceposegenerationforlongtermvisuallocalizationvialearnedfeaturesandviewsynthesis
AT scaramuzzadavide referenceposegenerationforlongtermvisuallocalizationvialearnedfeaturesandviewsynthesis