Cargando…
Improved Visual Localization via Graph Filtering
Vision-based localization is the problem of inferring the pose of the camera given a single image. One commonly used approach relies on image retrieval where the query input is compared against a database of localized support examples and its pose is inferred with the help of the retrieved items. Th...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321269/ https://www.ncbi.nlm.nih.gov/pubmed/34460619 http://dx.doi.org/10.3390/jimaging7020020 |
_version_ | 1783730811007139840 |
---|---|
author | Lassance, Carlos Latif, Yasir Garg, Ravi Gripon, Vincent Reid, Ian |
author_facet | Lassance, Carlos Latif, Yasir Garg, Ravi Gripon, Vincent Reid, Ian |
author_sort | Lassance, Carlos |
collection | PubMed |
description | Vision-based localization is the problem of inferring the pose of the camera given a single image. One commonly used approach relies on image retrieval where the query input is compared against a database of localized support examples and its pose is inferred with the help of the retrieved items. This assumes that images taken from the same places consist of the same landmarks and thus would have similar feature representations. These representations can learn to be robust to different variations in capture conditions like time of the day or weather. In this work, we introduce a framework which aims at enhancing the performance of such retrieval-based localization methods. It consists in taking into account additional information available, such as GPS coordinates or temporal proximity in the acquisition of the images. More precisely, our method consists in constructing a graph based on this additional information that is later used to improve reliability of the retrieval process by filtering the feature representations of support and/or query images. We show that the proposed method is able to significantly improve the localization accuracy on two large scale datasets, as well as the mean average precision in classical image retrieval scenarios. |
format | Online Article Text |
id | pubmed-8321269 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83212692021-08-26 Improved Visual Localization via Graph Filtering Lassance, Carlos Latif, Yasir Garg, Ravi Gripon, Vincent Reid, Ian J Imaging Article Vision-based localization is the problem of inferring the pose of the camera given a single image. One commonly used approach relies on image retrieval where the query input is compared against a database of localized support examples and its pose is inferred with the help of the retrieved items. This assumes that images taken from the same places consist of the same landmarks and thus would have similar feature representations. These representations can learn to be robust to different variations in capture conditions like time of the day or weather. In this work, we introduce a framework which aims at enhancing the performance of such retrieval-based localization methods. It consists in taking into account additional information available, such as GPS coordinates or temporal proximity in the acquisition of the images. More precisely, our method consists in constructing a graph based on this additional information that is later used to improve reliability of the retrieval process by filtering the feature representations of support and/or query images. We show that the proposed method is able to significantly improve the localization accuracy on two large scale datasets, as well as the mean average precision in classical image retrieval scenarios. MDPI 2021-01-30 /pmc/articles/PMC8321269/ /pubmed/34460619 http://dx.doi.org/10.3390/jimaging7020020 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ). |
spellingShingle | Article Lassance, Carlos Latif, Yasir Garg, Ravi Gripon, Vincent Reid, Ian Improved Visual Localization via Graph Filtering |
title | Improved Visual Localization via Graph Filtering |
title_full | Improved Visual Localization via Graph Filtering |
title_fullStr | Improved Visual Localization via Graph Filtering |
title_full_unstemmed | Improved Visual Localization via Graph Filtering |
title_short | Improved Visual Localization via Graph Filtering |
title_sort | improved visual localization via graph filtering |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321269/ https://www.ncbi.nlm.nih.gov/pubmed/34460619 http://dx.doi.org/10.3390/jimaging7020020 |
work_keys_str_mv | AT lassancecarlos improvedvisuallocalizationviagraphfiltering AT latifyasir improvedvisuallocalizationviagraphfiltering AT gargravi improvedvisuallocalizationviagraphfiltering AT griponvincent improvedvisuallocalizationviagraphfiltering AT reidian improvedvisuallocalizationviagraphfiltering |