Cargando…

MDS-Net: Multi-Scale Depth Stratification 3D Object Detection from Monocular Images

Monocular 3D object detection is very challenging in autonomous driving due to the lack of depth information. This paper proposes a one-stage monocular 3D object detection network (MDS Net), which uses the anchor-free method to detect 3D objects in a per-pixel prediction. Firstly, a novel depth-base...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Zhouzhen, Song, Yuying, Wu, Jingxuan, Li, Zecheng, Song, Chunyi, Xu, Zhiwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9415185/
https://www.ncbi.nlm.nih.gov/pubmed/36015965
http://dx.doi.org/10.3390/s22166197
_version_ 1784776169651961856
author Xie, Zhouzhen
Song, Yuying
Wu, Jingxuan
Li, Zecheng
Song, Chunyi
Xu, Zhiwei
author_facet Xie, Zhouzhen
Song, Yuying
Wu, Jingxuan
Li, Zecheng
Song, Chunyi
Xu, Zhiwei
author_sort Xie, Zhouzhen
collection PubMed
description Monocular 3D object detection is very challenging in autonomous driving due to the lack of depth information. This paper proposes a one-stage monocular 3D object detection network (MDS Net), which uses the anchor-free method to detect 3D objects in a per-pixel prediction. Firstly, a novel depth-based stratification structure is developed to improve the network’s ability of depth prediction, which exploits the mathematical relationship between the size and the depth in the image of an object based on the pinhole model. Secondly, a new angle loss function is developed to further improve both the accuracy of the angle prediction and the convergence speed of training. An optimized Soft-NMS is finally applied in the post-processing stage to adjust the confidence score of the candidate boxes. Experiment results on the KITTI benchmark demonstrate that the proposed MDS-Net outperforms the existing monocular 3D detection methods in both tasks of 3D detection and BEV detection while fulfilling real-time requirements.
format Online
Article
Text
id pubmed-9415185
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94151852022-08-27 MDS-Net: Multi-Scale Depth Stratification 3D Object Detection from Monocular Images Xie, Zhouzhen Song, Yuying Wu, Jingxuan Li, Zecheng Song, Chunyi Xu, Zhiwei Sensors (Basel) Article Monocular 3D object detection is very challenging in autonomous driving due to the lack of depth information. This paper proposes a one-stage monocular 3D object detection network (MDS Net), which uses the anchor-free method to detect 3D objects in a per-pixel prediction. Firstly, a novel depth-based stratification structure is developed to improve the network’s ability of depth prediction, which exploits the mathematical relationship between the size and the depth in the image of an object based on the pinhole model. Secondly, a new angle loss function is developed to further improve both the accuracy of the angle prediction and the convergence speed of training. An optimized Soft-NMS is finally applied in the post-processing stage to adjust the confidence score of the candidate boxes. Experiment results on the KITTI benchmark demonstrate that the proposed MDS-Net outperforms the existing monocular 3D detection methods in both tasks of 3D detection and BEV detection while fulfilling real-time requirements. MDPI 2022-08-18 /pmc/articles/PMC9415185/ /pubmed/36015965 http://dx.doi.org/10.3390/s22166197 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xie, Zhouzhen
Song, Yuying
Wu, Jingxuan
Li, Zecheng
Song, Chunyi
Xu, Zhiwei
MDS-Net: Multi-Scale Depth Stratification 3D Object Detection from Monocular Images
title MDS-Net: Multi-Scale Depth Stratification 3D Object Detection from Monocular Images
title_full MDS-Net: Multi-Scale Depth Stratification 3D Object Detection from Monocular Images
title_fullStr MDS-Net: Multi-Scale Depth Stratification 3D Object Detection from Monocular Images
title_full_unstemmed MDS-Net: Multi-Scale Depth Stratification 3D Object Detection from Monocular Images
title_short MDS-Net: Multi-Scale Depth Stratification 3D Object Detection from Monocular Images
title_sort mds-net: multi-scale depth stratification 3d object detection from monocular images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9415185/
https://www.ncbi.nlm.nih.gov/pubmed/36015965
http://dx.doi.org/10.3390/s22166197
work_keys_str_mv AT xiezhouzhen mdsnetmultiscaledepthstratification3dobjectdetectionfrommonocularimages
AT songyuying mdsnetmultiscaledepthstratification3dobjectdetectionfrommonocularimages
AT wujingxuan mdsnetmultiscaledepthstratification3dobjectdetectionfrommonocularimages
AT lizecheng mdsnetmultiscaledepthstratification3dobjectdetectionfrommonocularimages
AT songchunyi mdsnetmultiscaledepthstratification3dobjectdetectionfrommonocularimages
AT xuzhiwei mdsnetmultiscaledepthstratification3dobjectdetectionfrommonocularimages