Cargando…

Pedestrian Detection with Multi-View Convolution Fusion Algorithm

In recent years, the pedestrian detection technology of a single 2D image has been dramatically improved. When the scene becomes very crowded, the detection performance will deteriorate seriously and cannot meet the requirements of autonomous driving perception. With the introduction of the multi-vi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yuhong, Han, Chunyan, Zhang, Lin, Gao, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870950/
https://www.ncbi.nlm.nih.gov/pubmed/35205460
http://dx.doi.org/10.3390/e24020165
_version_ 1784656879633301504
author Liu, Yuhong
Han, Chunyan
Zhang, Lin
Gao, Xin
author_facet Liu, Yuhong
Han, Chunyan
Zhang, Lin
Gao, Xin
author_sort Liu, Yuhong
collection PubMed
description In recent years, the pedestrian detection technology of a single 2D image has been dramatically improved. When the scene becomes very crowded, the detection performance will deteriorate seriously and cannot meet the requirements of autonomous driving perception. With the introduction of the multi-view method, the task of pedestrian detection in crowded or fuzzy scenes has been significantly improved and has become a widely used method in autonomous driving. In this paper, we construct a double-branch feature fusion structure, the first branch adopts a lightweight structure, the second branch further extracts features and gets the feature map obtained from each layer. At the same time, the receptive field is enlarged by expanding convolution. To improve the speed of the model, the keypoint is used instead of the entire object for regression without an NMS post-processing operation. Meanwhile, the whole model can be learned from end to end. Even in the presence of many people, the method can still perform better on accuracy and speed. In the standard of Wildtrack and MultiviewX dataset, the accuracy and running speed both perform better than the state-of-the-art model, which has great practical significance in the autonomous driving field.
format Online
Article
Text
id pubmed-8870950
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88709502022-02-25 Pedestrian Detection with Multi-View Convolution Fusion Algorithm Liu, Yuhong Han, Chunyan Zhang, Lin Gao, Xin Entropy (Basel) Article In recent years, the pedestrian detection technology of a single 2D image has been dramatically improved. When the scene becomes very crowded, the detection performance will deteriorate seriously and cannot meet the requirements of autonomous driving perception. With the introduction of the multi-view method, the task of pedestrian detection in crowded or fuzzy scenes has been significantly improved and has become a widely used method in autonomous driving. In this paper, we construct a double-branch feature fusion structure, the first branch adopts a lightweight structure, the second branch further extracts features and gets the feature map obtained from each layer. At the same time, the receptive field is enlarged by expanding convolution. To improve the speed of the model, the keypoint is used instead of the entire object for regression without an NMS post-processing operation. Meanwhile, the whole model can be learned from end to end. Even in the presence of many people, the method can still perform better on accuracy and speed. In the standard of Wildtrack and MultiviewX dataset, the accuracy and running speed both perform better than the state-of-the-art model, which has great practical significance in the autonomous driving field. MDPI 2022-01-22 /pmc/articles/PMC8870950/ /pubmed/35205460 http://dx.doi.org/10.3390/e24020165 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liu, Yuhong
Han, Chunyan
Zhang, Lin
Gao, Xin
Pedestrian Detection with Multi-View Convolution Fusion Algorithm
title Pedestrian Detection with Multi-View Convolution Fusion Algorithm
title_full Pedestrian Detection with Multi-View Convolution Fusion Algorithm
title_fullStr Pedestrian Detection with Multi-View Convolution Fusion Algorithm
title_full_unstemmed Pedestrian Detection with Multi-View Convolution Fusion Algorithm
title_short Pedestrian Detection with Multi-View Convolution Fusion Algorithm
title_sort pedestrian detection with multi-view convolution fusion algorithm
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870950/
https://www.ncbi.nlm.nih.gov/pubmed/35205460
http://dx.doi.org/10.3390/e24020165
work_keys_str_mv AT liuyuhong pedestriandetectionwithmultiviewconvolutionfusionalgorithm
AT hanchunyan pedestriandetectionwithmultiviewconvolutionfusionalgorithm
AT zhanglin pedestriandetectionwithmultiviewconvolutionfusionalgorithm
AT gaoxin pedestriandetectionwithmultiviewconvolutionfusionalgorithm