Cargando…
SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision
Monocular camera and Lidar are the two most commonly used sensors in unmanned vehicles. Combining the advantages of the two is the current research focus of SLAM and semantic analysis. In this paper, we propose an improved SLAM and semantic reconstruction method based on the fusion of Lidar and mono...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9920633/ https://www.ncbi.nlm.nih.gov/pubmed/36772544 http://dx.doi.org/10.3390/s23031502 |
_version_ | 1784887118274756608 |
---|---|
author | Lou, Lu Li, Yitian Zhang, Qi Wei, Hanbing |
author_facet | Lou, Lu Li, Yitian Zhang, Qi Wei, Hanbing |
author_sort | Lou, Lu |
collection | PubMed |
description | Monocular camera and Lidar are the two most commonly used sensors in unmanned vehicles. Combining the advantages of the two is the current research focus of SLAM and semantic analysis. In this paper, we propose an improved SLAM and semantic reconstruction method based on the fusion of Lidar and monocular vision. We fuse the semantic image with the low-resolution 3D Lidar point clouds and generate dense semantic depth maps. Through visual odometry, ORB feature points with depth information are selected to improve positioning accuracy. Our method uses parallel threads to aggregate 3D semantic point clouds while positioning the unmanned vehicle. Experiments are conducted on the public CityScapes and KITTI Visual Odometry datasets, and the results show that compared with the ORB-SLAM2 and DynaSLAM, our positioning error is approximately reduced by 87%; compared with the DEMO and DVL-SLAM, our positioning accuracy improves in most sequences. Our 3D reconstruction quality is better than DynSLAM and contains semantic information. The proposed method has engineering application value in the unmanned vehicles field. |
format | Online Article Text |
id | pubmed-9920633 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-99206332023-02-12 SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision Lou, Lu Li, Yitian Zhang, Qi Wei, Hanbing Sensors (Basel) Article Monocular camera and Lidar are the two most commonly used sensors in unmanned vehicles. Combining the advantages of the two is the current research focus of SLAM and semantic analysis. In this paper, we propose an improved SLAM and semantic reconstruction method based on the fusion of Lidar and monocular vision. We fuse the semantic image with the low-resolution 3D Lidar point clouds and generate dense semantic depth maps. Through visual odometry, ORB feature points with depth information are selected to improve positioning accuracy. Our method uses parallel threads to aggregate 3D semantic point clouds while positioning the unmanned vehicle. Experiments are conducted on the public CityScapes and KITTI Visual Odometry datasets, and the results show that compared with the ORB-SLAM2 and DynaSLAM, our positioning error is approximately reduced by 87%; compared with the DEMO and DVL-SLAM, our positioning accuracy improves in most sequences. Our 3D reconstruction quality is better than DynSLAM and contains semantic information. The proposed method has engineering application value in the unmanned vehicles field. MDPI 2023-01-29 /pmc/articles/PMC9920633/ /pubmed/36772544 http://dx.doi.org/10.3390/s23031502 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lou, Lu Li, Yitian Zhang, Qi Wei, Hanbing SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision |
title | SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision |
title_full | SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision |
title_fullStr | SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision |
title_full_unstemmed | SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision |
title_short | SLAM and 3D Semantic Reconstruction Based on the Fusion of Lidar and Monocular Vision |
title_sort | slam and 3d semantic reconstruction based on the fusion of lidar and monocular vision |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9920633/ https://www.ncbi.nlm.nih.gov/pubmed/36772544 http://dx.doi.org/10.3390/s23031502 |
work_keys_str_mv | AT loulu slamand3dsemanticreconstructionbasedonthefusionoflidarandmonocularvision AT liyitian slamand3dsemanticreconstructionbasedonthefusionoflidarandmonocularvision AT zhangqi slamand3dsemanticreconstructionbasedonthefusionoflidarandmonocularvision AT weihanbing slamand3dsemanticreconstructionbasedonthefusionoflidarandmonocularvision |