Cargando…

Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection

Region proposal network (RPN) based object detection, such as Faster Regions with CNN (Faster R-CNN), has gained considerable attention due to its high accuracy and fast speed. However, it has room for improvements when used in special application situations, such as the on-board vehicle detection....

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Ye, Liu, Zhenyi, Deng, Weiwen
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6427343/ https://www.ncbi.nlm.nih.gov/pubmed/30832452 http://dx.doi.org/10.3390/s19051089

_version_	1783405188104585216
author	Wang, Ye Liu, Zhenyi Deng, Weiwen
author_facet	Wang, Ye Liu, Zhenyi Deng, Weiwen
author_sort	Wang, Ye
collection	PubMed
description	Region proposal network (RPN) based object detection, such as Faster Regions with CNN (Faster R-CNN), has gained considerable attention due to its high accuracy and fast speed. However, it has room for improvements when used in special application situations, such as the on-board vehicle detection. Original RPN locates multiscale anchors uniformly on each pixel of the last feature map and classifies whether an anchor is part of the foreground or background with one pixel in the last feature map. The receptive field of each pixel in the last feature map is fixed in the original faster R-CNN and does not coincide with the anchor size. Hence, only a certain part can be seen for large vehicles and too much useless information is contained in the feature for small vehicles. This reduces detection accuracy. Furthermore, the perspective projection results in the vehicle bounding box size becoming related to the bounding box position, thereby reducing the effectiveness and accuracy of the uniform anchor generation method. This reduces both detection accuracy and computing speed. After the region proposal stage, many regions of interest (ROI) are generated. The ROI pooling layer projects an ROI to the last feature map and forms a new feature map with a fixed size for final classification and box regression. The number of feature map pixels in the projected region can also influence the detection performance but this is not accurately controlled in former works. In this paper, the original faster R-CNN is optimized, especially for the on-board vehicle detection. This paper tries to solve these above-mentioned problems. The proposed method is tested on the KITTI dataset and the result shows a significant improvement without too many tricky parameter adjustments and training skills. The proposed method can also be used on other objects with obvious foreshortening effects, such as on-board pedestrian detection. The basic idea of the proposed method does not rely on concrete implementation and thus, most deep learning based object detectors with multiscale feature maps can be optimized with it.
format	Online Article Text
id	pubmed-6427343
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-64273432019-04-15 Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection Wang, Ye Liu, Zhenyi Deng, Weiwen Sensors (Basel) Article Region proposal network (RPN) based object detection, such as Faster Regions with CNN (Faster R-CNN), has gained considerable attention due to its high accuracy and fast speed. However, it has room for improvements when used in special application situations, such as the on-board vehicle detection. Original RPN locates multiscale anchors uniformly on each pixel of the last feature map and classifies whether an anchor is part of the foreground or background with one pixel in the last feature map. The receptive field of each pixel in the last feature map is fixed in the original faster R-CNN and does not coincide with the anchor size. Hence, only a certain part can be seen for large vehicles and too much useless information is contained in the feature for small vehicles. This reduces detection accuracy. Furthermore, the perspective projection results in the vehicle bounding box size becoming related to the bounding box position, thereby reducing the effectiveness and accuracy of the uniform anchor generation method. This reduces both detection accuracy and computing speed. After the region proposal stage, many regions of interest (ROI) are generated. The ROI pooling layer projects an ROI to the last feature map and forms a new feature map with a fixed size for final classification and box regression. The number of feature map pixels in the projected region can also influence the detection performance but this is not accurately controlled in former works. In this paper, the original faster R-CNN is optimized, especially for the on-board vehicle detection. This paper tries to solve these above-mentioned problems. The proposed method is tested on the KITTI dataset and the result shows a significant improvement without too many tricky parameter adjustments and training skills. The proposed method can also be used on other objects with obvious foreshortening effects, such as on-board pedestrian detection. The basic idea of the proposed method does not rely on concrete implementation and thus, most deep learning based object detectors with multiscale feature maps can be optimized with it. MDPI 2019-03-03 /pmc/articles/PMC6427343/ /pubmed/30832452 http://dx.doi.org/10.3390/s19051089 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Ye Liu, Zhenyi Deng, Weiwen Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection
title	Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection
title_full	Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection
title_fullStr	Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection
title_full_unstemmed	Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection
title_short	Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection
title_sort	anchor generation optimization and region of interest assignment for vehicle detection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6427343/ https://www.ncbi.nlm.nih.gov/pubmed/30832452 http://dx.doi.org/10.3390/s19051089
work_keys_str_mv	AT wangye anchorgenerationoptimizationandregionofinterestassignmentforvehicledetection AT liuzhenyi anchorgenerationoptimizationandregionofinterestassignmentforvehicledetection AT dengweiwen anchorgenerationoptimizationandregionofinterestassignmentforvehicledetection

Anchor Generation Optimization and Region of Interest Assignment for Vehicle Detection

Ejemplares similares