Cargando…

Absolute Camera Pose Regression Using an RGB-D Dual-Stream Network and Handcrafted Base Poses

Absolute pose regression (APR) for camera localization is a single-shot approach that encodes the information of a 3D scene in an end-to-end neural network. The camera pose result of APR methods can be observed as the linear combination of the base poses. Previous APR methods’ base poses are learned...

Descripción completa

Detalles Bibliográficos
Autores principales: Kao, Peng-Yuan, Zhang, Rong-Rong, Chen, Timothy, Hung, Yi-Ping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9503637/
https://www.ncbi.nlm.nih.gov/pubmed/36146335
http://dx.doi.org/10.3390/s22186971
_version_ 1784796015264530432
author Kao, Peng-Yuan
Zhang, Rong-Rong
Chen, Timothy
Hung, Yi-Ping
author_facet Kao, Peng-Yuan
Zhang, Rong-Rong
Chen, Timothy
Hung, Yi-Ping
author_sort Kao, Peng-Yuan
collection PubMed
description Absolute pose regression (APR) for camera localization is a single-shot approach that encodes the information of a 3D scene in an end-to-end neural network. The camera pose result of APR methods can be observed as the linear combination of the base poses. Previous APR methods’ base poses are learned from training data. However, the training data can limit the performance of the methods, which cannot be generalized to cover the entire scene. To solve this issue, we use handcrafted base poses instead of learning-based base poses, which prevents overfitting the camera poses of the training data. Moreover, we use a dual-stream network architecture to process color and depth images separately to get more accurate localization. On the 7 Scenes dataset, the proposed method is among the best in median rotation error, and in median translation error, it outperforms previous APR methods. On a more difficult dataset—Oxford RobotCar dataset, the proposed method achieves notable improvements in median translation and rotation errors compared to the state-of-the-art APR methods.
format Online
Article
Text
id pubmed-9503637
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95036372022-09-24 Absolute Camera Pose Regression Using an RGB-D Dual-Stream Network and Handcrafted Base Poses Kao, Peng-Yuan Zhang, Rong-Rong Chen, Timothy Hung, Yi-Ping Sensors (Basel) Article Absolute pose regression (APR) for camera localization is a single-shot approach that encodes the information of a 3D scene in an end-to-end neural network. The camera pose result of APR methods can be observed as the linear combination of the base poses. Previous APR methods’ base poses are learned from training data. However, the training data can limit the performance of the methods, which cannot be generalized to cover the entire scene. To solve this issue, we use handcrafted base poses instead of learning-based base poses, which prevents overfitting the camera poses of the training data. Moreover, we use a dual-stream network architecture to process color and depth images separately to get more accurate localization. On the 7 Scenes dataset, the proposed method is among the best in median rotation error, and in median translation error, it outperforms previous APR methods. On a more difficult dataset—Oxford RobotCar dataset, the proposed method achieves notable improvements in median translation and rotation errors compared to the state-of-the-art APR methods. MDPI 2022-09-15 /pmc/articles/PMC9503637/ /pubmed/36146335 http://dx.doi.org/10.3390/s22186971 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kao, Peng-Yuan
Zhang, Rong-Rong
Chen, Timothy
Hung, Yi-Ping
Absolute Camera Pose Regression Using an RGB-D Dual-Stream Network and Handcrafted Base Poses
title Absolute Camera Pose Regression Using an RGB-D Dual-Stream Network and Handcrafted Base Poses
title_full Absolute Camera Pose Regression Using an RGB-D Dual-Stream Network and Handcrafted Base Poses
title_fullStr Absolute Camera Pose Regression Using an RGB-D Dual-Stream Network and Handcrafted Base Poses
title_full_unstemmed Absolute Camera Pose Regression Using an RGB-D Dual-Stream Network and Handcrafted Base Poses
title_short Absolute Camera Pose Regression Using an RGB-D Dual-Stream Network and Handcrafted Base Poses
title_sort absolute camera pose regression using an rgb-d dual-stream network and handcrafted base poses
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9503637/
https://www.ncbi.nlm.nih.gov/pubmed/36146335
http://dx.doi.org/10.3390/s22186971
work_keys_str_mv AT kaopengyuan absolutecameraposeregressionusinganrgbddualstreamnetworkandhandcraftedbaseposes
AT zhangrongrong absolutecameraposeregressionusinganrgbddualstreamnetworkandhandcraftedbaseposes
AT chentimothy absolutecameraposeregressionusinganrgbddualstreamnetworkandhandcraftedbaseposes
AT hungyiping absolutecameraposeregressionusinganrgbddualstreamnetworkandhandcraftedbaseposes