Cargando…
Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network
Convolutional neural networks (CNN) have enabled significant improvements in pedestrian detection owing to the strong representation ability of the CNN features. However, it is generally difficult to reduce false positives on hard negative samples such as tree leaves, traffic lights, poles, etc. Som...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6182048/ https://www.ncbi.nlm.nih.gov/pubmed/30344486 http://dx.doi.org/10.3389/fnbot.2018.00064 |
_version_ | 1783362481828134912 |
---|---|
author | Liu, Tianrui Stathaki, Tania |
author_facet | Liu, Tianrui Stathaki, Tania |
author_sort | Liu, Tianrui |
collection | PubMed |
description | Convolutional neural networks (CNN) have enabled significant improvements in pedestrian detection owing to the strong representation ability of the CNN features. However, it is generally difficult to reduce false positives on hard negative samples such as tree leaves, traffic lights, poles, etc. Some of these hard negatives can be removed by making use of high level semantic vision cues. In this paper, we propose a region-based CNN method which makes use of semantic cues for better pedestrian detection. Our method extends the Faster R-CNN detection framework by adding a branch of network for semantic image segmentation. The semantic network aims to compute complementary higher level semantic features to be integrated with the convolutional features. We make use of multi-resolution feature maps extracted from different network layers in order to ensure good detection accuracy for pedestrians at different scales. Boosted forest is used for training the integrated features in a cascaded manner for hard negatives mining. Experiments on the Caltech pedestrian dataset show improvements on detection accuracy with the semantic network. With the deep VGG16 model, our pedestrian detection method achieves robust detection performance on the Caltech dataset. |
format | Online Article Text |
id | pubmed-6182048 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-61820482018-10-19 Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network Liu, Tianrui Stathaki, Tania Front Neurorobot Robotics and AI Convolutional neural networks (CNN) have enabled significant improvements in pedestrian detection owing to the strong representation ability of the CNN features. However, it is generally difficult to reduce false positives on hard negative samples such as tree leaves, traffic lights, poles, etc. Some of these hard negatives can be removed by making use of high level semantic vision cues. In this paper, we propose a region-based CNN method which makes use of semantic cues for better pedestrian detection. Our method extends the Faster R-CNN detection framework by adding a branch of network for semantic image segmentation. The semantic network aims to compute complementary higher level semantic features to be integrated with the convolutional features. We make use of multi-resolution feature maps extracted from different network layers in order to ensure good detection accuracy for pedestrians at different scales. Boosted forest is used for training the integrated features in a cascaded manner for hard negatives mining. Experiments on the Caltech pedestrian dataset show improvements on detection accuracy with the semantic network. With the deep VGG16 model, our pedestrian detection method achieves robust detection performance on the Caltech dataset. Frontiers Media S.A. 2018-10-05 /pmc/articles/PMC6182048/ /pubmed/30344486 http://dx.doi.org/10.3389/fnbot.2018.00064 Text en Copyright © 2018 Liu and Stathaki. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Liu, Tianrui Stathaki, Tania Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network |
title | Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network |
title_full | Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network |
title_fullStr | Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network |
title_full_unstemmed | Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network |
title_short | Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network |
title_sort | faster r-cnn for robust pedestrian detection using semantic segmentation network |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6182048/ https://www.ncbi.nlm.nih.gov/pubmed/30344486 http://dx.doi.org/10.3389/fnbot.2018.00064 |
work_keys_str_mv | AT liutianrui fasterrcnnforrobustpedestriandetectionusingsemanticsegmentationnetwork AT stathakitania fasterrcnnforrobustpedestriandetectionusingsemanticsegmentationnetwork |