Cargando…

SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision

Monocular camera and Lidar are the two most commonly used sensors in unmanned vehicles. Combining the advantages of the two is the current research focus of SLAM and semantic analysis. In this paper, we propose an improved SLAM and semantic reconstruction method based on the fusion of Lidar and mono...

Descripción completa

Detalles Bibliográficos
Autores principales: Lou, Lu, Li, Yitian, Zhang, Qi, Wei, Hanbing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9920633/
https://www.ncbi.nlm.nih.gov/pubmed/36772544
http://dx.doi.org/10.3390/s23031502
_version_ 1784887118274756608
author Lou, Lu
Li, Yitian
Zhang, Qi
Wei, Hanbing
author_facet Lou, Lu
Li, Yitian
Zhang, Qi
Wei, Hanbing
author_sort Lou, Lu
collection PubMed
description Monocular camera and Lidar are the two most commonly used sensors in unmanned vehicles. Combining the advantages of the two is the current research focus of SLAM and semantic analysis. In this paper, we propose an improved SLAM and semantic reconstruction method based on the fusion of Lidar and monocular vision. We fuse the semantic image with the low-resolution 3D Lidar point clouds and generate dense semantic depth maps. Through visual odometry, ORB feature points with depth information are selected to improve positioning accuracy. Our method uses parallel threads to aggregate 3D semantic point clouds while positioning the unmanned vehicle. Experiments are conducted on the public CityScapes and KITTI Visual Odometry datasets, and the results show that compared with the ORB-SLAM2 and DynaSLAM, our positioning error is approximately reduced by 87%; compared with the DEMO and DVL-SLAM, our positioning accuracy improves in most sequences. Our 3D reconstruction quality is better than DynSLAM and contains semantic information. The proposed method has engineering application value in the unmanned vehicles field.
format Online
Article
Text
id pubmed-9920633
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99206332023-02-12 SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision Lou, Lu Li, Yitian Zhang, Qi Wei, Hanbing Sensors (Basel) Article Monocular camera and Lidar are the two most commonly used sensors in unmanned vehicles. Combining the advantages of the two is the current research focus of SLAM and semantic analysis. In this paper, we propose an improved SLAM and semantic reconstruction method based on the fusion of Lidar and monocular vision. We fuse the semantic image with the low-resolution 3D Lidar point clouds and generate dense semantic depth maps. Through visual odometry, ORB feature points with depth information are selected to improve positioning accuracy. Our method uses parallel threads to aggregate 3D semantic point clouds while positioning the unmanned vehicle. Experiments are conducted on the public CityScapes and KITTI Visual Odometry datasets, and the results show that compared with the ORB-SLAM2 and DynaSLAM, our positioning error is approximately reduced by 87%; compared with the DEMO and DVL-SLAM, our positioning accuracy improves in most sequences. Our 3D reconstruction quality is better than DynSLAM and contains semantic information. The proposed method has engineering application value in the unmanned vehicles field. MDPI 2023-01-29 /pmc/articles/PMC9920633/ /pubmed/36772544 http://dx.doi.org/10.3390/s23031502 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lou, Lu
Li, Yitian
Zhang, Qi
Wei, Hanbing
SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision
title SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision
title_full SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision
title_fullStr SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision
title_full_unstemmed SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision
title_short SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision
title_sort slam and 3d semantic reconstruction based on the fusion of lidar and monocular vision
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9920633/
https://www.ncbi.nlm.nih.gov/pubmed/36772544
http://dx.doi.org/10.3390/s23031502
work_keys_str_mv AT loulu slamand3dsemanticreconstructionbasedonthefusionoflidarandmonocularvision
AT liyitian slamand3dsemanticreconstructionbasedonthefusionoflidarandmonocularvision
AT zhangqi slamand3dsemanticreconstructionbasedonthefusionoflidarandmonocularvision
AT weihanbing slamand3dsemanticreconstructionbasedonthefusionoflidarandmonocularvision