Cargando…

Object Detection Based on Faster R-CNN Algorithm with Skip Pooling and Fusion of Contextual Information

Deep learning is currently the mainstream method of object detection. Faster region-based convolutional neural network (Faster R-CNN) has a pivotal position in deep learning. It has impressive detection effects in ordinary scenes. However, under special conditions, there can still be unsatisfactory...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Yi, Wang, Xinqing, Zhang, Peng, Meng, Fanjie, Shao, Faming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582940/
https://www.ncbi.nlm.nih.gov/pubmed/32992739
http://dx.doi.org/10.3390/s20195490
_version_ 1783599306459054080
author Xiao, Yi
Wang, Xinqing
Zhang, Peng
Meng, Fanjie
Shao, Faming
author_facet Xiao, Yi
Wang, Xinqing
Zhang, Peng
Meng, Fanjie
Shao, Faming
author_sort Xiao, Yi
collection PubMed
description Deep learning is currently the mainstream method of object detection. Faster region-based convolutional neural network (Faster R-CNN) has a pivotal position in deep learning. It has impressive detection effects in ordinary scenes. However, under special conditions, there can still be unsatisfactory detection performance, such as the object having problems like occlusion, deformation, or small size. This paper proposes a novel and improved algorithm based on the Faster R-CNN framework combined with the Faster R-CNN algorithm with skip pooling and fusion of contextual information. This algorithm can improve the detection performance under special conditions on the basis of Faster R-CNN. The improvement mainly has three parts: The first part adds a context information feature extraction model after the conv5_3 of the convolutional layer; the second part adds skip pooling so that the former can fully obtain the contextual information of the object, especially for situations where the object is occluded and deformed; and the third part replaces the region proposal network (RPN) with a more efficient guided anchor RPN (GA-RPN), which can maintain the recall rate while improving the detection performance. The latter can obtain more detailed information from different feature layers of the deep neural network algorithm, and is especially aimed at scenes with small objects. Compared with Faster R-CNN, you only look once series (such as: YOLOv3), single shot detector (such as: SSD512), and other object detection algorithms, the algorithm proposed in this paper has an average improvement of 6.857% on the mean average precision (mAP) evaluation index while maintaining a certain recall rate. This strongly proves that the proposed method has higher detection rate and detection efficiency in this case.
format Online
Article
Text
id pubmed-7582940
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75829402020-10-28 Object Detection Based on Faster R-CNN Algorithm with Skip Pooling and Fusion of Contextual Information Xiao, Yi Wang, Xinqing Zhang, Peng Meng, Fanjie Shao, Faming Sensors (Basel) Article Deep learning is currently the mainstream method of object detection. Faster region-based convolutional neural network (Faster R-CNN) has a pivotal position in deep learning. It has impressive detection effects in ordinary scenes. However, under special conditions, there can still be unsatisfactory detection performance, such as the object having problems like occlusion, deformation, or small size. This paper proposes a novel and improved algorithm based on the Faster R-CNN framework combined with the Faster R-CNN algorithm with skip pooling and fusion of contextual information. This algorithm can improve the detection performance under special conditions on the basis of Faster R-CNN. The improvement mainly has three parts: The first part adds a context information feature extraction model after the conv5_3 of the convolutional layer; the second part adds skip pooling so that the former can fully obtain the contextual information of the object, especially for situations where the object is occluded and deformed; and the third part replaces the region proposal network (RPN) with a more efficient guided anchor RPN (GA-RPN), which can maintain the recall rate while improving the detection performance. The latter can obtain more detailed information from different feature layers of the deep neural network algorithm, and is especially aimed at scenes with small objects. Compared with Faster R-CNN, you only look once series (such as: YOLOv3), single shot detector (such as: SSD512), and other object detection algorithms, the algorithm proposed in this paper has an average improvement of 6.857% on the mean average precision (mAP) evaluation index while maintaining a certain recall rate. This strongly proves that the proposed method has higher detection rate and detection efficiency in this case. MDPI 2020-09-25 /pmc/articles/PMC7582940/ /pubmed/32992739 http://dx.doi.org/10.3390/s20195490 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xiao, Yi
Wang, Xinqing
Zhang, Peng
Meng, Fanjie
Shao, Faming
Object Detection Based on Faster R-CNN Algorithm with Skip Pooling and Fusion of Contextual Information
title Object Detection Based on Faster R-CNN Algorithm with Skip Pooling and Fusion of Contextual Information
title_full Object Detection Based on Faster R-CNN Algorithm with Skip Pooling and Fusion of Contextual Information
title_fullStr Object Detection Based on Faster R-CNN Algorithm with Skip Pooling and Fusion of Contextual Information
title_full_unstemmed Object Detection Based on Faster R-CNN Algorithm with Skip Pooling and Fusion of Contextual Information
title_short Object Detection Based on Faster R-CNN Algorithm with Skip Pooling and Fusion of Contextual Information
title_sort object detection based on faster r-cnn algorithm with skip pooling and fusion of contextual information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582940/
https://www.ncbi.nlm.nih.gov/pubmed/32992739
http://dx.doi.org/10.3390/s20195490
work_keys_str_mv AT xiaoyi objectdetectionbasedonfasterrcnnalgorithmwithskippoolingandfusionofcontextualinformation
AT wangxinqing objectdetectionbasedonfasterrcnnalgorithmwithskippoolingandfusionofcontextualinformation
AT zhangpeng objectdetectionbasedonfasterrcnnalgorithmwithskippoolingandfusionofcontextualinformation
AT mengfanjie objectdetectionbasedonfasterrcnnalgorithmwithskippoolingandfusionofcontextualinformation
AT shaofaming objectdetectionbasedonfasterrcnnalgorithmwithskippoolingandfusionofcontextualinformation