Cargando…
Pedestrian Detection with Multi-View Convolution Fusion Algorithm
In recent years, the pedestrian detection technology of a single 2D image has been dramatically improved. When the scene becomes very crowded, the detection performance will deteriorate seriously and cannot meet the requirements of autonomous driving perception. With the introduction of the multi-vi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870950/ https://www.ncbi.nlm.nih.gov/pubmed/35205460 http://dx.doi.org/10.3390/e24020165 |
_version_ | 1784656879633301504 |
---|---|
author | Liu, Yuhong Han, Chunyan Zhang, Lin Gao, Xin |
author_facet | Liu, Yuhong Han, Chunyan Zhang, Lin Gao, Xin |
author_sort | Liu, Yuhong |
collection | PubMed |
description | In recent years, the pedestrian detection technology of a single 2D image has been dramatically improved. When the scene becomes very crowded, the detection performance will deteriorate seriously and cannot meet the requirements of autonomous driving perception. With the introduction of the multi-view method, the task of pedestrian detection in crowded or fuzzy scenes has been significantly improved and has become a widely used method in autonomous driving. In this paper, we construct a double-branch feature fusion structure, the first branch adopts a lightweight structure, the second branch further extracts features and gets the feature map obtained from each layer. At the same time, the receptive field is enlarged by expanding convolution. To improve the speed of the model, the keypoint is used instead of the entire object for regression without an NMS post-processing operation. Meanwhile, the whole model can be learned from end to end. Even in the presence of many people, the method can still perform better on accuracy and speed. In the standard of Wildtrack and MultiviewX dataset, the accuracy and running speed both perform better than the state-of-the-art model, which has great practical significance in the autonomous driving field. |
format | Online Article Text |
id | pubmed-8870950 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-88709502022-02-25 Pedestrian Detection with Multi-View Convolution Fusion Algorithm Liu, Yuhong Han, Chunyan Zhang, Lin Gao, Xin Entropy (Basel) Article In recent years, the pedestrian detection technology of a single 2D image has been dramatically improved. When the scene becomes very crowded, the detection performance will deteriorate seriously and cannot meet the requirements of autonomous driving perception. With the introduction of the multi-view method, the task of pedestrian detection in crowded or fuzzy scenes has been significantly improved and has become a widely used method in autonomous driving. In this paper, we construct a double-branch feature fusion structure, the first branch adopts a lightweight structure, the second branch further extracts features and gets the feature map obtained from each layer. At the same time, the receptive field is enlarged by expanding convolution. To improve the speed of the model, the keypoint is used instead of the entire object for regression without an NMS post-processing operation. Meanwhile, the whole model can be learned from end to end. Even in the presence of many people, the method can still perform better on accuracy and speed. In the standard of Wildtrack and MultiviewX dataset, the accuracy and running speed both perform better than the state-of-the-art model, which has great practical significance in the autonomous driving field. MDPI 2022-01-22 /pmc/articles/PMC8870950/ /pubmed/35205460 http://dx.doi.org/10.3390/e24020165 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Liu, Yuhong Han, Chunyan Zhang, Lin Gao, Xin Pedestrian Detection with Multi-View Convolution Fusion Algorithm |
title | Pedestrian Detection with Multi-View Convolution Fusion Algorithm |
title_full | Pedestrian Detection with Multi-View Convolution Fusion Algorithm |
title_fullStr | Pedestrian Detection with Multi-View Convolution Fusion Algorithm |
title_full_unstemmed | Pedestrian Detection with Multi-View Convolution Fusion Algorithm |
title_short | Pedestrian Detection with Multi-View Convolution Fusion Algorithm |
title_sort | pedestrian detection with multi-view convolution fusion algorithm |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870950/ https://www.ncbi.nlm.nih.gov/pubmed/35205460 http://dx.doi.org/10.3390/e24020165 |
work_keys_str_mv | AT liuyuhong pedestriandetectionwithmultiviewconvolutionfusionalgorithm AT hanchunyan pedestriandetectionwithmultiviewconvolutionfusionalgorithm AT zhanglin pedestriandetectionwithmultiviewconvolutionfusionalgorithm AT gaoxin pedestriandetectionwithmultiviewconvolutionfusionalgorithm |