Cargando…

Improved Visual Localization via Graph Filtering

Vision-based localization is the problem of inferring the pose of the camera given a single image. One commonly used approach relies on image retrieval where the query input is compared against a database of localized support examples and its pose is inferred with the help of the retrieved items. Th...

Descripción completa

Detalles Bibliográficos
Autores principales: Lassance, Carlos, Latif, Yasir, Garg, Ravi, Gripon, Vincent, Reid, Ian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321269/
https://www.ncbi.nlm.nih.gov/pubmed/34460619
http://dx.doi.org/10.3390/jimaging7020020
_version_ 1783730811007139840
author Lassance, Carlos
Latif, Yasir
Garg, Ravi
Gripon, Vincent
Reid, Ian
author_facet Lassance, Carlos
Latif, Yasir
Garg, Ravi
Gripon, Vincent
Reid, Ian
author_sort Lassance, Carlos
collection PubMed
description Vision-based localization is the problem of inferring the pose of the camera given a single image. One commonly used approach relies on image retrieval where the query input is compared against a database of localized support examples and its pose is inferred with the help of the retrieved items. This assumes that images taken from the same places consist of the same landmarks and thus would have similar feature representations. These representations can learn to be robust to different variations in capture conditions like time of the day or weather. In this work, we introduce a framework which aims at enhancing the performance of such retrieval-based localization methods. It consists in taking into account additional information available, such as GPS coordinates or temporal proximity in the acquisition of the images. More precisely, our method consists in constructing a graph based on this additional information that is later used to improve reliability of the retrieval process by filtering the feature representations of support and/or query images. We show that the proposed method is able to significantly improve the localization accuracy on two large scale datasets, as well as the mean average precision in classical image retrieval scenarios.
format Online
Article
Text
id pubmed-8321269
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83212692021-08-26 Improved Visual Localization via Graph Filtering Lassance, Carlos Latif, Yasir Garg, Ravi Gripon, Vincent Reid, Ian J Imaging Article Vision-based localization is the problem of inferring the pose of the camera given a single image. One commonly used approach relies on image retrieval where the query input is compared against a database of localized support examples and its pose is inferred with the help of the retrieved items. This assumes that images taken from the same places consist of the same landmarks and thus would have similar feature representations. These representations can learn to be robust to different variations in capture conditions like time of the day or weather. In this work, we introduce a framework which aims at enhancing the performance of such retrieval-based localization methods. It consists in taking into account additional information available, such as GPS coordinates or temporal proximity in the acquisition of the images. More precisely, our method consists in constructing a graph based on this additional information that is later used to improve reliability of the retrieval process by filtering the feature representations of support and/or query images. We show that the proposed method is able to significantly improve the localization accuracy on two large scale datasets, as well as the mean average precision in classical image retrieval scenarios. MDPI 2021-01-30 /pmc/articles/PMC8321269/ /pubmed/34460619 http://dx.doi.org/10.3390/jimaging7020020 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Lassance, Carlos
Latif, Yasir
Garg, Ravi
Gripon, Vincent
Reid, Ian
Improved Visual Localization via Graph Filtering
title Improved Visual Localization via Graph Filtering
title_full Improved Visual Localization via Graph Filtering
title_fullStr Improved Visual Localization via Graph Filtering
title_full_unstemmed Improved Visual Localization via Graph Filtering
title_short Improved Visual Localization via Graph Filtering
title_sort improved visual localization via graph filtering
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321269/
https://www.ncbi.nlm.nih.gov/pubmed/34460619
http://dx.doi.org/10.3390/jimaging7020020
work_keys_str_mv AT lassancecarlos improvedvisuallocalizationviagraphfiltering
AT latifyasir improvedvisuallocalizationviagraphfiltering
AT gargravi improvedvisuallocalizationviagraphfiltering
AT griponvincent improvedvisuallocalizationviagraphfiltering
AT reidian improvedvisuallocalizationviagraphfiltering