Cargando…

Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network

Convolutional neural networks (CNN) have enabled significant improvements in pedestrian detection owing to the strong representation ability of the CNN features. However, it is generally difficult to reduce false positives on hard negative samples such as tree leaves, traffic lights, poles, etc. Som...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Tianrui, Stathaki, Tania
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6182048/
https://www.ncbi.nlm.nih.gov/pubmed/30344486
http://dx.doi.org/10.3389/fnbot.2018.00064
_version_ 1783362481828134912
author Liu, Tianrui
Stathaki, Tania
author_facet Liu, Tianrui
Stathaki, Tania
author_sort Liu, Tianrui
collection PubMed
description Convolutional neural networks (CNN) have enabled significant improvements in pedestrian detection owing to the strong representation ability of the CNN features. However, it is generally difficult to reduce false positives on hard negative samples such as tree leaves, traffic lights, poles, etc. Some of these hard negatives can be removed by making use of high level semantic vision cues. In this paper, we propose a region-based CNN method which makes use of semantic cues for better pedestrian detection. Our method extends the Faster R-CNN detection framework by adding a branch of network for semantic image segmentation. The semantic network aims to compute complementary higher level semantic features to be integrated with the convolutional features. We make use of multi-resolution feature maps extracted from different network layers in order to ensure good detection accuracy for pedestrians at different scales. Boosted forest is used for training the integrated features in a cascaded manner for hard negatives mining. Experiments on the Caltech pedestrian dataset show improvements on detection accuracy with the semantic network. With the deep VGG16 model, our pedestrian detection method achieves robust detection performance on the Caltech dataset.
format Online
Article
Text
id pubmed-6182048
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-61820482018-10-19 Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network Liu, Tianrui Stathaki, Tania Front Neurorobot Robotics and AI Convolutional neural networks (CNN) have enabled significant improvements in pedestrian detection owing to the strong representation ability of the CNN features. However, it is generally difficult to reduce false positives on hard negative samples such as tree leaves, traffic lights, poles, etc. Some of these hard negatives can be removed by making use of high level semantic vision cues. In this paper, we propose a region-based CNN method which makes use of semantic cues for better pedestrian detection. Our method extends the Faster R-CNN detection framework by adding a branch of network for semantic image segmentation. The semantic network aims to compute complementary higher level semantic features to be integrated with the convolutional features. We make use of multi-resolution feature maps extracted from different network layers in order to ensure good detection accuracy for pedestrians at different scales. Boosted forest is used for training the integrated features in a cascaded manner for hard negatives mining. Experiments on the Caltech pedestrian dataset show improvements on detection accuracy with the semantic network. With the deep VGG16 model, our pedestrian detection method achieves robust detection performance on the Caltech dataset. Frontiers Media S.A. 2018-10-05 /pmc/articles/PMC6182048/ /pubmed/30344486 http://dx.doi.org/10.3389/fnbot.2018.00064 Text en Copyright © 2018 Liu and Stathaki. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Robotics and AI
Liu, Tianrui
Stathaki, Tania
Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network
title Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network
title_full Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network
title_fullStr Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network
title_full_unstemmed Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network
title_short Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network
title_sort faster r-cnn for robust pedestrian detection using semantic segmentation network
topic Robotics and AI
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6182048/
https://www.ncbi.nlm.nih.gov/pubmed/30344486
http://dx.doi.org/10.3389/fnbot.2018.00064
work_keys_str_mv AT liutianrui fasterrcnnforrobustpedestriandetectionusingsemanticsegmentationnetwork
AT stathakitania fasterrcnnforrobustpedestriandetectionusingsemanticsegmentationnetwork