Cargando…

An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling

Unsupervised deep learning methods have shown great success in jointly estimating camera pose and depth from monocular videos. However, previous methods mostly ignore the importance of multi-scale information, which is crucial for pose estimation and depth estimation, especially when the motion patt...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhi, Henghui, Yin, Chenyang, Li, Huibin, Pang, Shanmin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9323830/ https://www.ncbi.nlm.nih.gov/pubmed/35890873 http://dx.doi.org/10.3390/s22145193

_version_	1784756650697031680
author	Zhi, Henghui Yin, Chenyang Li, Huibin Pang, Shanmin
author_facet	Zhi, Henghui Yin, Chenyang Li, Huibin Pang, Shanmin
author_sort	Zhi, Henghui
collection	PubMed
description	Unsupervised deep learning methods have shown great success in jointly estimating camera pose and depth from monocular videos. However, previous methods mostly ignore the importance of multi-scale information, which is crucial for pose estimation and depth estimation, especially when the motion pattern is changed. This article proposes an unsupervised framework for monocular visual odometry (VO) that can model multi-scale information. The proposed method utilizes densely linked atrous convolutions to increase the receptive field size without losing image information, and adopts a non-local self-attention mechanism to effectively model the long-range dependency. Both of them can model objects of different scales in the image, thereby improving the accuracy of VO, especially in rotating scenes. Extensive experiments on the KITTI dataset have shown that our approach is competitive with other state-of-the-art unsupervised learning-based monocular methods and is comparable to supervised or model-based methods. In particular, we have achieved state-of-the-art results on rotation estimation.
format	Online Article Text
id	pubmed-9323830
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-93238302022-07-27 An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling Zhi, Henghui Yin, Chenyang Li, Huibin Pang, Shanmin Sensors (Basel) Article Unsupervised deep learning methods have shown great success in jointly estimating camera pose and depth from monocular videos. However, previous methods mostly ignore the importance of multi-scale information, which is crucial for pose estimation and depth estimation, especially when the motion pattern is changed. This article proposes an unsupervised framework for monocular visual odometry (VO) that can model multi-scale information. The proposed method utilizes densely linked atrous convolutions to increase the receptive field size without losing image information, and adopts a non-local self-attention mechanism to effectively model the long-range dependency. Both of them can model objects of different scales in the image, thereby improving the accuracy of VO, especially in rotating scenes. Extensive experiments on the KITTI dataset have shown that our approach is competitive with other state-of-the-art unsupervised learning-based monocular methods and is comparable to supervised or model-based methods. In particular, we have achieved state-of-the-art results on rotation estimation. MDPI 2022-07-11 /pmc/articles/PMC9323830/ /pubmed/35890873 http://dx.doi.org/10.3390/s22145193 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhi, Henghui Yin, Chenyang Li, Huibin Pang, Shanmin An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling
title	An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling
title_full	An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling
title_fullStr	An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling
title_full_unstemmed	An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling
title_short	An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling
title_sort	unsupervised monocular visual odometry based on multi-scale modeling
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9323830/ https://www.ncbi.nlm.nih.gov/pubmed/35890873 http://dx.doi.org/10.3390/s22145193
work_keys_str_mv	AT zhihenghui anunsupervisedmonocularvisualodometrybasedonmultiscalemodeling AT yinchenyang anunsupervisedmonocularvisualodometrybasedonmultiscalemodeling AT lihuibin anunsupervisedmonocularvisualodometrybasedonmultiscalemodeling AT pangshanmin anunsupervisedmonocularvisualodometrybasedonmultiscalemodeling AT zhihenghui unsupervisedmonocularvisualodometrybasedonmultiscalemodeling AT yinchenyang unsupervisedmonocularvisualodometrybasedonmultiscalemodeling AT lihuibin unsupervisedmonocularvisualodometrybasedonmultiscalemodeling AT pangshanmin unsupervisedmonocularvisualodometrybasedonmultiscalemodeling

An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling

Ejemplares similares