Cargando…

DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization

Recently, generating dense maps in real-time has become a hot research topic in the mobile robotics community, since dense maps can provide more informative and continuous features compared with sparse maps. Implicit depth representation (e.g., the depth code) derived from deep neural networks has b...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhao, Mingle, Zhou, Dingfu, Song, Xibin, Chen, Xiuwan, Zhang, Liangjun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9102487/ https://www.ncbi.nlm.nih.gov/pubmed/35591079 http://dx.doi.org/10.3390/s22093389

_version_	1784707340384075776
author	Zhao, Mingle Zhou, Dingfu Song, Xibin Chen, Xiuwan Zhang, Liangjun
author_facet	Zhao, Mingle Zhou, Dingfu Song, Xibin Chen, Xiuwan Zhang, Liangjun
author_sort	Zhao, Mingle
collection	PubMed
description	Recently, generating dense maps in real-time has become a hot research topic in the mobile robotics community, since dense maps can provide more informative and continuous features compared with sparse maps. Implicit depth representation (e.g., the depth code) derived from deep neural networks has been employed in the visual-only or visual-inertial simultaneous localization and mapping (SLAM) systems, which achieve promising performances on both camera motion and local dense geometry estimations from monocular images. However, the existing visual-inertial SLAM systems combined with depth codes are either built on a filter-based SLAM framework, which can only update poses and maps in a relatively small local time window, or based on a loosely-coupled framework, while the prior geometric constraints from the depth estimation network have not been employed for boosting the state estimation. To well address these drawbacks, we propose DiT-SLAM, a novel real-time Dense visual-inertial SLAM with implicit depth representation and Tightly-coupled graph optimization. Most importantly, the poses, sparse maps, and low-dimensional depth codes are optimized with the tightly-coupled graph by considering the visual, inertial, and depth residuals simultaneously. Meanwhile, we propose a light-weight monocular depth estimation and completion network, which is combined with attention mechanisms and the conditional variational auto-encoder (CVAE) to predict the uncertainty-aware dense depth maps from more low-dimensional codes. Furthermore, a robust point sampling strategy introducing the spatial distribution of 2D feature points is also proposed to provide geometric constraints in the tightly-coupled optimization, especially for textureless or featureless cases in indoor environments. We evaluate our system on open benchmarks. The proposed methods achieve better performances on both the dense depth estimation and the trajectory estimation compared to the baseline and other systems.
format	Online Article Text
id	pubmed-9102487
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-91024872022-05-14 DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization Zhao, Mingle Zhou, Dingfu Song, Xibin Chen, Xiuwan Zhang, Liangjun Sensors (Basel) Article Recently, generating dense maps in real-time has become a hot research topic in the mobile robotics community, since dense maps can provide more informative and continuous features compared with sparse maps. Implicit depth representation (e.g., the depth code) derived from deep neural networks has been employed in the visual-only or visual-inertial simultaneous localization and mapping (SLAM) systems, which achieve promising performances on both camera motion and local dense geometry estimations from monocular images. However, the existing visual-inertial SLAM systems combined with depth codes are either built on a filter-based SLAM framework, which can only update poses and maps in a relatively small local time window, or based on a loosely-coupled framework, while the prior geometric constraints from the depth estimation network have not been employed for boosting the state estimation. To well address these drawbacks, we propose DiT-SLAM, a novel real-time Dense visual-inertial SLAM with implicit depth representation and Tightly-coupled graph optimization. Most importantly, the poses, sparse maps, and low-dimensional depth codes are optimized with the tightly-coupled graph by considering the visual, inertial, and depth residuals simultaneously. Meanwhile, we propose a light-weight monocular depth estimation and completion network, which is combined with attention mechanisms and the conditional variational auto-encoder (CVAE) to predict the uncertainty-aware dense depth maps from more low-dimensional codes. Furthermore, a robust point sampling strategy introducing the spatial distribution of 2D feature points is also proposed to provide geometric constraints in the tightly-coupled optimization, especially for textureless or featureless cases in indoor environments. We evaluate our system on open benchmarks. The proposed methods achieve better performances on both the dense depth estimation and the trajectory estimation compared to the baseline and other systems. MDPI 2022-04-28 /pmc/articles/PMC9102487/ /pubmed/35591079 http://dx.doi.org/10.3390/s22093389 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhao, Mingle Zhou, Dingfu Song, Xibin Chen, Xiuwan Zhang, Liangjun DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization
title	DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization
title_full	DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization
title_fullStr	DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization
title_full_unstemmed	DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization
title_short	DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization
title_sort	dit-slam: real-time dense visual-inertial slam with implicit depth representation and tightly-coupled graph optimization
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9102487/ https://www.ncbi.nlm.nih.gov/pubmed/35591079 http://dx.doi.org/10.3390/s22093389
work_keys_str_mv	AT zhaomingle ditslamrealtimedensevisualinertialslamwithimplicitdepthrepresentationandtightlycoupledgraphoptimization AT zhoudingfu ditslamrealtimedensevisualinertialslamwithimplicitdepthrepresentationandtightlycoupledgraphoptimization AT songxibin ditslamrealtimedensevisualinertialslamwithimplicitdepthrepresentationandtightlycoupledgraphoptimization AT chenxiuwan ditslamrealtimedensevisualinertialslamwithimplicitdepthrepresentationandtightlycoupledgraphoptimization AT zhangliangjun ditslamrealtimedensevisualinertialslamwithimplicitdepthrepresentationandtightlycoupledgraphoptimization

DiT-SLAM: Real-Time Dense Visual-Inertial SLAM with Implicit Depth Representation and Tightly-Coupled Graph Optimization

Ejemplares similares