Cargando…

Regression-Based Camera Pose Estimation through Multi-Level Local Features and Global Features

Accurate and robust camera pose estimation is essential for high-level applications such as augmented reality and autonomous driving. Despite the development of global feature-based camera pose regression methods and local feature-based matching guided pose estimation methods, challenging conditions...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Meng, Zhang, Zhihuang, Gong, Yuanhao, Poslad, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10146233/
https://www.ncbi.nlm.nih.gov/pubmed/37112404
http://dx.doi.org/10.3390/s23084063
_version_ 1785034531936403456
author Xu, Meng
Zhang, Zhihuang
Gong, Yuanhao
Poslad, Stefan
author_facet Xu, Meng
Zhang, Zhihuang
Gong, Yuanhao
Poslad, Stefan
author_sort Xu, Meng
collection PubMed
description Accurate and robust camera pose estimation is essential for high-level applications such as augmented reality and autonomous driving. Despite the development of global feature-based camera pose regression methods and local feature-based matching guided pose estimation methods, challenging conditions, such as illumination changes and viewpoint changes, as well as inaccurate keypoint localization, continue to affect the performance of camera pose estimation. In this paper, we propose a novel relative camera pose regression framework that uses global features with rotation consistency and local features with rotation invariance. First, we apply a multi-level deformable network to detect and describe local features, which can learn appearances and gradient information sensitive to rotation variants. Second, we process the detection and description processes using the results from pixel correspondences of the input image pairs. Finally, we propose a novel loss that combines relative regression loss and absolute regression loss, incorporating global features with geometric constraints to optimize the pose estimation model. Our extensive experiments report satisfactory accuracy on the 7Scenes dataset with an average mean translation error of 0.18 m and a rotation error of 7.44° using image pairs as input. Ablation studies were also conducted to verify the effectiveness of the proposed method in the tasks of pose estimation and image matching using the 7Scenes and HPatches datasets.
format Online
Article
Text
id pubmed-10146233
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101462332023-04-29 Regression-Based Camera Pose Estimation through Multi-Level Local Features and Global Features Xu, Meng Zhang, Zhihuang Gong, Yuanhao Poslad, Stefan Sensors (Basel) Article Accurate and robust camera pose estimation is essential for high-level applications such as augmented reality and autonomous driving. Despite the development of global feature-based camera pose regression methods and local feature-based matching guided pose estimation methods, challenging conditions, such as illumination changes and viewpoint changes, as well as inaccurate keypoint localization, continue to affect the performance of camera pose estimation. In this paper, we propose a novel relative camera pose regression framework that uses global features with rotation consistency and local features with rotation invariance. First, we apply a multi-level deformable network to detect and describe local features, which can learn appearances and gradient information sensitive to rotation variants. Second, we process the detection and description processes using the results from pixel correspondences of the input image pairs. Finally, we propose a novel loss that combines relative regression loss and absolute regression loss, incorporating global features with geometric constraints to optimize the pose estimation model. Our extensive experiments report satisfactory accuracy on the 7Scenes dataset with an average mean translation error of 0.18 m and a rotation error of 7.44° using image pairs as input. Ablation studies were also conducted to verify the effectiveness of the proposed method in the tasks of pose estimation and image matching using the 7Scenes and HPatches datasets. MDPI 2023-04-18 /pmc/articles/PMC10146233/ /pubmed/37112404 http://dx.doi.org/10.3390/s23084063 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xu, Meng
Zhang, Zhihuang
Gong, Yuanhao
Poslad, Stefan
Regression-Based Camera Pose Estimation through Multi-Level Local Features and Global Features
title Regression-Based Camera Pose Estimation through Multi-Level Local Features and Global Features
title_full Regression-Based Camera Pose Estimation through Multi-Level Local Features and Global Features
title_fullStr Regression-Based Camera Pose Estimation through Multi-Level Local Features and Global Features
title_full_unstemmed Regression-Based Camera Pose Estimation through Multi-Level Local Features and Global Features
title_short Regression-Based Camera Pose Estimation through Multi-Level Local Features and Global Features
title_sort regression-based camera pose estimation through multi-level local features and global features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10146233/
https://www.ncbi.nlm.nih.gov/pubmed/37112404
http://dx.doi.org/10.3390/s23084063
work_keys_str_mv AT xumeng regressionbasedcameraposeestimationthroughmultilevellocalfeaturesandglobalfeatures
AT zhangzhihuang regressionbasedcameraposeestimationthroughmultilevellocalfeaturesandglobalfeatures
AT gongyuanhao regressionbasedcameraposeestimationthroughmultilevellocalfeaturesandglobalfeatures
AT posladstefan regressionbasedcameraposeestimationthroughmultilevellocalfeaturesandglobalfeatures