Cargando…

A New Deep Model for Detecting Multiple Moving Targets in Real Traffic Scenarios: Machine Vision-Based Vehicles

When performing multiple target detection, it is difficult to detect small and occluded targets in complex traffic scenes. To this end, an improved YOLOv4 detection method is proposed in this work. Firstly, the network structure of the original YOLOv4 is adjusted, and the 4× down-sampling feature ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Xiaowei, Xiong, Hao, Zhan, Liu, Królczyk, Grzegorz, Stanislawski, Rafal, Gardoni, Paolo, Li, Zhixiong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9144427/
https://www.ncbi.nlm.nih.gov/pubmed/35632150
http://dx.doi.org/10.3390/s22103742
_version_ 1784716045492158464
author Xu, Xiaowei
Xiong, Hao
Zhan, Liu
Królczyk, Grzegorz
Stanislawski, Rafal
Gardoni, Paolo
Li, Zhixiong
author_facet Xu, Xiaowei
Xiong, Hao
Zhan, Liu
Królczyk, Grzegorz
Stanislawski, Rafal
Gardoni, Paolo
Li, Zhixiong
author_sort Xu, Xiaowei
collection PubMed
description When performing multiple target detection, it is difficult to detect small and occluded targets in complex traffic scenes. To this end, an improved YOLOv4 detection method is proposed in this work. Firstly, the network structure of the original YOLOv4 is adjusted, and the 4× down-sampling feature map of the backbone network is introduced into the neck network of the YOLOv4 model to splice the feature map with 8× down-sampling to form a four-scale detection structure, which enhances the fusion of deep and shallow semantics information of the feature map to improve the detection accuracy of small targets. Then, the convolutional block attention module (CBAM) is added to the model neck network to enhance the learning ability for features in space and on channels. Lastly, the detection rate of the occluded target is improved by using the soft non-maximum suppression (Soft-NMS) algorithm based on the distance intersection over union (DIoU) to avoid deleting the bounding boxes. On the KITTI dataset, experimental evaluation is performed and the analysis results demonstrate that the proposed detection model can effectively improve the multiple target detection accuracy, and the mean average accuracy (mAP) of the improved YOLOv4 model reaches 81.23%, which is 3.18% higher than the original YOLOv4; and the computation speed of the proposed model reaches 47.32 FPS. Compared with existing popular detection models, the proposed model produces higher detection accuracy and computation speed.
format Online
Article
Text
id pubmed-9144427
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91444272022-05-29 A New Deep Model for Detecting Multiple Moving Targets in Real Traffic Scenarios: Machine Vision-Based Vehicles Xu, Xiaowei Xiong, Hao Zhan, Liu Królczyk, Grzegorz Stanislawski, Rafal Gardoni, Paolo Li, Zhixiong Sensors (Basel) Article When performing multiple target detection, it is difficult to detect small and occluded targets in complex traffic scenes. To this end, an improved YOLOv4 detection method is proposed in this work. Firstly, the network structure of the original YOLOv4 is adjusted, and the 4× down-sampling feature map of the backbone network is introduced into the neck network of the YOLOv4 model to splice the feature map with 8× down-sampling to form a four-scale detection structure, which enhances the fusion of deep and shallow semantics information of the feature map to improve the detection accuracy of small targets. Then, the convolutional block attention module (CBAM) is added to the model neck network to enhance the learning ability for features in space and on channels. Lastly, the detection rate of the occluded target is improved by using the soft non-maximum suppression (Soft-NMS) algorithm based on the distance intersection over union (DIoU) to avoid deleting the bounding boxes. On the KITTI dataset, experimental evaluation is performed and the analysis results demonstrate that the proposed detection model can effectively improve the multiple target detection accuracy, and the mean average accuracy (mAP) of the improved YOLOv4 model reaches 81.23%, which is 3.18% higher than the original YOLOv4; and the computation speed of the proposed model reaches 47.32 FPS. Compared with existing popular detection models, the proposed model produces higher detection accuracy and computation speed. MDPI 2022-05-14 /pmc/articles/PMC9144427/ /pubmed/35632150 http://dx.doi.org/10.3390/s22103742 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xu, Xiaowei
Xiong, Hao
Zhan, Liu
Królczyk, Grzegorz
Stanislawski, Rafal
Gardoni, Paolo
Li, Zhixiong
A New Deep Model for Detecting Multiple Moving Targets in Real Traffic Scenarios: Machine Vision-Based Vehicles
title A New Deep Model for Detecting Multiple Moving Targets in Real Traffic Scenarios: Machine Vision-Based Vehicles
title_full A New Deep Model for Detecting Multiple Moving Targets in Real Traffic Scenarios: Machine Vision-Based Vehicles
title_fullStr A New Deep Model for Detecting Multiple Moving Targets in Real Traffic Scenarios: Machine Vision-Based Vehicles
title_full_unstemmed A New Deep Model for Detecting Multiple Moving Targets in Real Traffic Scenarios: Machine Vision-Based Vehicles
title_short A New Deep Model for Detecting Multiple Moving Targets in Real Traffic Scenarios: Machine Vision-Based Vehicles
title_sort new deep model for detecting multiple moving targets in real traffic scenarios: machine vision-based vehicles
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9144427/
https://www.ncbi.nlm.nih.gov/pubmed/35632150
http://dx.doi.org/10.3390/s22103742
work_keys_str_mv AT xuxiaowei anewdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT xionghao anewdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT zhanliu anewdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT krolczykgrzegorz anewdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT stanislawskirafal anewdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT gardonipaolo anewdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT lizhixiong anewdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT xuxiaowei newdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT xionghao newdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT zhanliu newdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT krolczykgrzegorz newdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT stanislawskirafal newdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT gardonipaolo newdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles
AT lizhixiong newdeepmodelfordetectingmultiplemovingtargetsinrealtrafficscenariosmachinevisionbasedvehicles