Cargando…

Exploring RGB+Depth Fusion for Real-Time Object Detection

In this paper, we investigate whether fusing depth information on top of normal RGB data for camera-based object detection can help to increase the performance of current state-of-the-art single-shot detection networks. Indeed, depth sensing is easily acquired using depth cameras such as a Kinect or...

Descripción completa

Detalles Bibliográficos
Autores principales: Ophoff, Tanguy, Van Beeck, Kristof, Goedemé, Toon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412390/
https://www.ncbi.nlm.nih.gov/pubmed/30791476
http://dx.doi.org/10.3390/s19040866
_version_ 1783402594300854272
author Ophoff, Tanguy
Van Beeck, Kristof
Goedemé, Toon
author_facet Ophoff, Tanguy
Van Beeck, Kristof
Goedemé, Toon
author_sort Ophoff, Tanguy
collection PubMed
description In this paper, we investigate whether fusing depth information on top of normal RGB data for camera-based object detection can help to increase the performance of current state-of-the-art single-shot detection networks. Indeed, depth sensing is easily acquired using depth cameras such as a Kinect or stereo setups. We investigate the optimal manner to perform this sensor fusion with a special focus on lightweight single-pass convolutional neural network (CNN) architectures, enabling real-time processing on limited hardware. For this, we implement a network architecture allowing us to parameterize at which network layer both information sources are fused together. We performed exhaustive experiments to determine the optimal fusion point in the network, from which we can conclude that fusing towards the mid to late layers provides the best results. Our best fusion models significantly outperform the baseline RGB network in both accuracy and localization of the detections.
format Online
Article
Text
id pubmed-6412390
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64123902019-04-03 Exploring RGB+Depth Fusion for Real-Time Object Detection Ophoff, Tanguy Van Beeck, Kristof Goedemé, Toon Sensors (Basel) Article In this paper, we investigate whether fusing depth information on top of normal RGB data for camera-based object detection can help to increase the performance of current state-of-the-art single-shot detection networks. Indeed, depth sensing is easily acquired using depth cameras such as a Kinect or stereo setups. We investigate the optimal manner to perform this sensor fusion with a special focus on lightweight single-pass convolutional neural network (CNN) architectures, enabling real-time processing on limited hardware. For this, we implement a network architecture allowing us to parameterize at which network layer both information sources are fused together. We performed exhaustive experiments to determine the optimal fusion point in the network, from which we can conclude that fusing towards the mid to late layers provides the best results. Our best fusion models significantly outperform the baseline RGB network in both accuracy and localization of the detections. MDPI 2019-02-19 /pmc/articles/PMC6412390/ /pubmed/30791476 http://dx.doi.org/10.3390/s19040866 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ophoff, Tanguy
Van Beeck, Kristof
Goedemé, Toon
Exploring RGB+Depth Fusion for Real-Time Object Detection
title Exploring RGB+Depth Fusion for Real-Time Object Detection
title_full Exploring RGB+Depth Fusion for Real-Time Object Detection
title_fullStr Exploring RGB+Depth Fusion for Real-Time Object Detection
title_full_unstemmed Exploring RGB+Depth Fusion for Real-Time Object Detection
title_short Exploring RGB+Depth Fusion for Real-Time Object Detection
title_sort exploring rgb+depth fusion for real-time object detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412390/
https://www.ncbi.nlm.nih.gov/pubmed/30791476
http://dx.doi.org/10.3390/s19040866
work_keys_str_mv AT ophofftanguy exploringrgbdepthfusionforrealtimeobjectdetection
AT vanbeeckkristof exploringrgbdepthfusionforrealtimeobjectdetection
AT goedemetoon exploringrgbdepthfusionforrealtimeobjectdetection