Cargando…

Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes

Driver assistance systems and autonomous robotics rely on the deployment of several sensors for environment perception. Compared to LiDAR systems, the inexpensive vision sensors can capture the 3D scene as perceived by a driver in terms of appearance and depth cues. Indeed, providing 3D image unders...

Descripción completa

Detalles Bibliográficos
Autores principales: Yebes, J. Javier, Bergasa, Luis M., García-Garrido, Miguel Ángel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4431302/
https://www.ncbi.nlm.nih.gov/pubmed/25903553
http://dx.doi.org/10.3390/s150409228
_version_ 1782371320012472320
author Yebes, J. Javier
Bergasa, Luis M.
García-Garrido, Miguel Ángel
author_facet Yebes, J. Javier
Bergasa, Luis M.
García-Garrido, Miguel Ángel
author_sort Yebes, J. Javier
collection PubMed
description Driver assistance systems and autonomous robotics rely on the deployment of several sensors for environment perception. Compared to LiDAR systems, the inexpensive vision sensors can capture the 3D scene as perceived by a driver in terms of appearance and depth cues. Indeed, providing 3D image understanding capabilities to vehicles is an essential target in order to infer scene semantics in urban environments. One of the challenges that arises from the navigation task in naturalistic urban scenarios is the detection of road participants (e.g., cyclists, pedestrians and vehicles). In this regard, this paper tackles the detection and orientation estimation of cars, pedestrians and cyclists, employing the challenging and naturalistic KITTI images. This work proposes 3D-aware features computed from stereo color images in order to capture the appearance and depth peculiarities of the objects in road scenes. The successful part-based object detector, known as DPM, is extended to learn richer models from the 2.5D data (color and disparity), while also carrying out a detailed analysis of the training pipeline. A large set of experiments evaluate the proposals, and the best performing approach is ranked on the KITTI website. Indeed, this is the first work that reports results with stereo data for the KITTI object challenge, achieving increased detection ratios for the classes car and cyclist compared to a baseline DPM.
format Online
Article
Text
id pubmed-4431302
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-44313022015-05-19 Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes Yebes, J. Javier Bergasa, Luis M. García-Garrido, Miguel Ángel Sensors (Basel) Article Driver assistance systems and autonomous robotics rely on the deployment of several sensors for environment perception. Compared to LiDAR systems, the inexpensive vision sensors can capture the 3D scene as perceived by a driver in terms of appearance and depth cues. Indeed, providing 3D image understanding capabilities to vehicles is an essential target in order to infer scene semantics in urban environments. One of the challenges that arises from the navigation task in naturalistic urban scenarios is the detection of road participants (e.g., cyclists, pedestrians and vehicles). In this regard, this paper tackles the detection and orientation estimation of cars, pedestrians and cyclists, employing the challenging and naturalistic KITTI images. This work proposes 3D-aware features computed from stereo color images in order to capture the appearance and depth peculiarities of the objects in road scenes. The successful part-based object detector, known as DPM, is extended to learn richer models from the 2.5D data (color and disparity), while also carrying out a detailed analysis of the training pipeline. A large set of experiments evaluate the proposals, and the best performing approach is ranked on the KITTI website. Indeed, this is the first work that reports results with stereo data for the KITTI object challenge, achieving increased detection ratios for the classes car and cyclist compared to a baseline DPM. MDPI 2015-04-20 /pmc/articles/PMC4431302/ /pubmed/25903553 http://dx.doi.org/10.3390/s150409228 Text en © 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yebes, J. Javier
Bergasa, Luis M.
García-Garrido, Miguel Ángel
Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes
title Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes
title_full Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes
title_fullStr Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes
title_full_unstemmed Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes
title_short Visual Object Recognition with 3D-Aware Features in KITTI Urban Scenes
title_sort visual object recognition with 3d-aware features in kitti urban scenes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4431302/
https://www.ncbi.nlm.nih.gov/pubmed/25903553
http://dx.doi.org/10.3390/s150409228
work_keys_str_mv AT yebesjjavier visualobjectrecognitionwith3dawarefeaturesinkittiurbanscenes
AT bergasaluism visualobjectrecognitionwith3dawarefeaturesinkittiurbanscenes
AT garciagarridomiguelangel visualobjectrecognitionwith3dawarefeaturesinkittiurbanscenes