Cargando…

An Enhanced Region Proposal Network for object detection using deep learning method

Faster Region-based Convolutional Network (Faster R-CNN) is a state-of-the-art object detection method. However, the object detection effect of Faster R-CNN is not good based on the Region Proposal Network (RPN). Inspired by RPN of Faster R-CNN, we propose a novel proposal generation method called E...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Yu Peng, Li, Ying, Wang, Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6147513/
https://www.ncbi.nlm.nih.gov/pubmed/30235238
http://dx.doi.org/10.1371/journal.pone.0203897
_version_ 1783356565136343040
author Chen, Yu Peng
Li, Ying
Wang, Gang
author_facet Chen, Yu Peng
Li, Ying
Wang, Gang
author_sort Chen, Yu Peng
collection PubMed
description Faster Region-based Convolutional Network (Faster R-CNN) is a state-of-the-art object detection method. However, the object detection effect of Faster R-CNN is not good based on the Region Proposal Network (RPN). Inspired by RPN of Faster R-CNN, we propose a novel proposal generation method called Enhanced Region Proposal Network (ERPN). Four improvements are presented in ERPN. Firstly, our proposed deconvolutional feature pyramid network (DFPN) is introduced to improve the quality of region proposals. Secondly, novel anchor boxes are designed with interspersed scales and adaptive aspect ratios. Thereafter, the capability of object localization is increased. Thirdly, a particle swarm optimization (PSO) based support vector machine (SVM), termed PSO-SVM, is developed to distinguish the positive and negative anchor boxes. Fourthly, the classification part of multi-task loss function in RPN is improved. Consequently, the effect of classification loss is strengthened. In this study, our proposed ERPN is compared with five object detection methods on both PASCAL VOC and COCO data sets. For the VGG-16 model, our ERPN obtains 78.6% mAP on VOC 2007 data set, 74.4% mAP on VOC 2012 data set and 31.7% on COCO data set. The performance of ERPN is the best among the comparison object detection methods. Furthermore, the detection speed of ERPN is 5.8 fps. Additionally, ERPN obtains good effect on small object detection.
format Online
Article
Text
id pubmed-6147513
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-61475132018-10-08 An Enhanced Region Proposal Network for object detection using deep learning method Chen, Yu Peng Li, Ying Wang, Gang PLoS One Research Article Faster Region-based Convolutional Network (Faster R-CNN) is a state-of-the-art object detection method. However, the object detection effect of Faster R-CNN is not good based on the Region Proposal Network (RPN). Inspired by RPN of Faster R-CNN, we propose a novel proposal generation method called Enhanced Region Proposal Network (ERPN). Four improvements are presented in ERPN. Firstly, our proposed deconvolutional feature pyramid network (DFPN) is introduced to improve the quality of region proposals. Secondly, novel anchor boxes are designed with interspersed scales and adaptive aspect ratios. Thereafter, the capability of object localization is increased. Thirdly, a particle swarm optimization (PSO) based support vector machine (SVM), termed PSO-SVM, is developed to distinguish the positive and negative anchor boxes. Fourthly, the classification part of multi-task loss function in RPN is improved. Consequently, the effect of classification loss is strengthened. In this study, our proposed ERPN is compared with five object detection methods on both PASCAL VOC and COCO data sets. For the VGG-16 model, our ERPN obtains 78.6% mAP on VOC 2007 data set, 74.4% mAP on VOC 2012 data set and 31.7% on COCO data set. The performance of ERPN is the best among the comparison object detection methods. Furthermore, the detection speed of ERPN is 5.8 fps. Additionally, ERPN obtains good effect on small object detection. Public Library of Science 2018-09-20 /pmc/articles/PMC6147513/ /pubmed/30235238 http://dx.doi.org/10.1371/journal.pone.0203897 Text en © 2018 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chen, Yu Peng
Li, Ying
Wang, Gang
An Enhanced Region Proposal Network for object detection using deep learning method
title An Enhanced Region Proposal Network for object detection using deep learning method
title_full An Enhanced Region Proposal Network for object detection using deep learning method
title_fullStr An Enhanced Region Proposal Network for object detection using deep learning method
title_full_unstemmed An Enhanced Region Proposal Network for object detection using deep learning method
title_short An Enhanced Region Proposal Network for object detection using deep learning method
title_sort enhanced region proposal network for object detection using deep learning method
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6147513/
https://www.ncbi.nlm.nih.gov/pubmed/30235238
http://dx.doi.org/10.1371/journal.pone.0203897
work_keys_str_mv AT chenyupeng anenhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod
AT liying anenhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod
AT wanggang anenhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod
AT chenyupeng enhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod
AT liying enhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod
AT wanggang enhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod