Cargando…
An Enhanced Region Proposal Network for object detection using deep learning method
Faster Region-based Convolutional Network (Faster R-CNN) is a state-of-the-art object detection method. However, the object detection effect of Faster R-CNN is not good based on the Region Proposal Network (RPN). Inspired by RPN of Faster R-CNN, we propose a novel proposal generation method called E...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6147513/ https://www.ncbi.nlm.nih.gov/pubmed/30235238 http://dx.doi.org/10.1371/journal.pone.0203897 |
_version_ | 1783356565136343040 |
---|---|
author | Chen, Yu Peng Li, Ying Wang, Gang |
author_facet | Chen, Yu Peng Li, Ying Wang, Gang |
author_sort | Chen, Yu Peng |
collection | PubMed |
description | Faster Region-based Convolutional Network (Faster R-CNN) is a state-of-the-art object detection method. However, the object detection effect of Faster R-CNN is not good based on the Region Proposal Network (RPN). Inspired by RPN of Faster R-CNN, we propose a novel proposal generation method called Enhanced Region Proposal Network (ERPN). Four improvements are presented in ERPN. Firstly, our proposed deconvolutional feature pyramid network (DFPN) is introduced to improve the quality of region proposals. Secondly, novel anchor boxes are designed with interspersed scales and adaptive aspect ratios. Thereafter, the capability of object localization is increased. Thirdly, a particle swarm optimization (PSO) based support vector machine (SVM), termed PSO-SVM, is developed to distinguish the positive and negative anchor boxes. Fourthly, the classification part of multi-task loss function in RPN is improved. Consequently, the effect of classification loss is strengthened. In this study, our proposed ERPN is compared with five object detection methods on both PASCAL VOC and COCO data sets. For the VGG-16 model, our ERPN obtains 78.6% mAP on VOC 2007 data set, 74.4% mAP on VOC 2012 data set and 31.7% on COCO data set. The performance of ERPN is the best among the comparison object detection methods. Furthermore, the detection speed of ERPN is 5.8 fps. Additionally, ERPN obtains good effect on small object detection. |
format | Online Article Text |
id | pubmed-6147513 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-61475132018-10-08 An Enhanced Region Proposal Network for object detection using deep learning method Chen, Yu Peng Li, Ying Wang, Gang PLoS One Research Article Faster Region-based Convolutional Network (Faster R-CNN) is a state-of-the-art object detection method. However, the object detection effect of Faster R-CNN is not good based on the Region Proposal Network (RPN). Inspired by RPN of Faster R-CNN, we propose a novel proposal generation method called Enhanced Region Proposal Network (ERPN). Four improvements are presented in ERPN. Firstly, our proposed deconvolutional feature pyramid network (DFPN) is introduced to improve the quality of region proposals. Secondly, novel anchor boxes are designed with interspersed scales and adaptive aspect ratios. Thereafter, the capability of object localization is increased. Thirdly, a particle swarm optimization (PSO) based support vector machine (SVM), termed PSO-SVM, is developed to distinguish the positive and negative anchor boxes. Fourthly, the classification part of multi-task loss function in RPN is improved. Consequently, the effect of classification loss is strengthened. In this study, our proposed ERPN is compared with five object detection methods on both PASCAL VOC and COCO data sets. For the VGG-16 model, our ERPN obtains 78.6% mAP on VOC 2007 data set, 74.4% mAP on VOC 2012 data set and 31.7% on COCO data set. The performance of ERPN is the best among the comparison object detection methods. Furthermore, the detection speed of ERPN is 5.8 fps. Additionally, ERPN obtains good effect on small object detection. Public Library of Science 2018-09-20 /pmc/articles/PMC6147513/ /pubmed/30235238 http://dx.doi.org/10.1371/journal.pone.0203897 Text en © 2018 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Chen, Yu Peng Li, Ying Wang, Gang An Enhanced Region Proposal Network for object detection using deep learning method |
title | An Enhanced Region Proposal Network for object detection using deep learning method |
title_full | An Enhanced Region Proposal Network for object detection using deep learning method |
title_fullStr | An Enhanced Region Proposal Network for object detection using deep learning method |
title_full_unstemmed | An Enhanced Region Proposal Network for object detection using deep learning method |
title_short | An Enhanced Region Proposal Network for object detection using deep learning method |
title_sort | enhanced region proposal network for object detection using deep learning method |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6147513/ https://www.ncbi.nlm.nih.gov/pubmed/30235238 http://dx.doi.org/10.1371/journal.pone.0203897 |
work_keys_str_mv | AT chenyupeng anenhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod AT liying anenhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod AT wanggang anenhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod AT chenyupeng enhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod AT liying enhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod AT wanggang enhancedregionproposalnetworkforobjectdetectionusingdeeplearningmethod |