Cargando…

VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4

Vehicle view object detection technology is the key to the environment perception modules of autonomous vehicles, which is crucial for driving safety. In view of the characteristics of complex scenes, such as dim light, occlusion, and long distance, an improved YOLOv4-based vehicle view object detec...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yinan, Guan, Yingzhou, Liu, Hanxu, Jin, Lisheng, Li, Xinwei, Guo, Baicang, Zhang, Zhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099275/
https://www.ncbi.nlm.nih.gov/pubmed/37050441
http://dx.doi.org/10.3390/s23073385
_version_ 1785025022787584000
author Wang, Yinan
Guan, Yingzhou
Liu, Hanxu
Jin, Lisheng
Li, Xinwei
Guo, Baicang
Zhang, Zhe
author_facet Wang, Yinan
Guan, Yingzhou
Liu, Hanxu
Jin, Lisheng
Li, Xinwei
Guo, Baicang
Zhang, Zhe
author_sort Wang, Yinan
collection PubMed
description Vehicle view object detection technology is the key to the environment perception modules of autonomous vehicles, which is crucial for driving safety. In view of the characteristics of complex scenes, such as dim light, occlusion, and long distance, an improved YOLOv4-based vehicle view object detection model, VV-YOLO, is proposed in this paper. The VV-YOLO model adopts the implementation mode based on anchor frames. In the anchor frame clustering, the improved K-means++ algorithm is used to reduce the possibility of instability in anchor frame clustering results caused by the random selection of a cluster center, so that the model can obtain a reasonable original anchor frame. Firstly, the CA-PAN network was designed by adding a coordinate attention mechanism, which was used in the neck network of the VV-YOLO model; the multidimensional modeling of image feature channel relationships was realized; and the extraction effect of complex image features was improved. Secondly, in order to ensure the sufficiency of model training, the loss function of the VV-YOLO model was reconstructed based on the focus function, which alleviated the problem of training imbalance caused by the unbalanced distribution of training data. Finally, the KITTI dataset was selected as the test set to conduct the index quantification experiment. The results showed that the precision and average precision of the VV-YOLO model were 90.68% and 80.01%, respectively, which were 6.88% and 3.44% higher than those of the YOLOv4 model, and the model’s calculation time on the same hardware platform did not increase significantly. In addition to testing on the KITTI dataset, we also selected the BDD100K dataset and typical complex traffic scene data collected in the field to conduct a visual comparison test of the results, and then the validity and robustness of the VV-YOLO model were verified.
format Online
Article
Text
id pubmed-10099275
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100992752023-04-14 VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4 Wang, Yinan Guan, Yingzhou Liu, Hanxu Jin, Lisheng Li, Xinwei Guo, Baicang Zhang, Zhe Sensors (Basel) Article Vehicle view object detection technology is the key to the environment perception modules of autonomous vehicles, which is crucial for driving safety. In view of the characteristics of complex scenes, such as dim light, occlusion, and long distance, an improved YOLOv4-based vehicle view object detection model, VV-YOLO, is proposed in this paper. The VV-YOLO model adopts the implementation mode based on anchor frames. In the anchor frame clustering, the improved K-means++ algorithm is used to reduce the possibility of instability in anchor frame clustering results caused by the random selection of a cluster center, so that the model can obtain a reasonable original anchor frame. Firstly, the CA-PAN network was designed by adding a coordinate attention mechanism, which was used in the neck network of the VV-YOLO model; the multidimensional modeling of image feature channel relationships was realized; and the extraction effect of complex image features was improved. Secondly, in order to ensure the sufficiency of model training, the loss function of the VV-YOLO model was reconstructed based on the focus function, which alleviated the problem of training imbalance caused by the unbalanced distribution of training data. Finally, the KITTI dataset was selected as the test set to conduct the index quantification experiment. The results showed that the precision and average precision of the VV-YOLO model were 90.68% and 80.01%, respectively, which were 6.88% and 3.44% higher than those of the YOLOv4 model, and the model’s calculation time on the same hardware platform did not increase significantly. In addition to testing on the KITTI dataset, we also selected the BDD100K dataset and typical complex traffic scene data collected in the field to conduct a visual comparison test of the results, and then the validity and robustness of the VV-YOLO model were verified. MDPI 2023-03-23 /pmc/articles/PMC10099275/ /pubmed/37050441 http://dx.doi.org/10.3390/s23073385 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Yinan
Guan, Yingzhou
Liu, Hanxu
Jin, Lisheng
Li, Xinwei
Guo, Baicang
Zhang, Zhe
VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title_full VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title_fullStr VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title_full_unstemmed VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title_short VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title_sort vv-yolo: a vehicle view object detection model based on improved yolov4
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099275/
https://www.ncbi.nlm.nih.gov/pubmed/37050441
http://dx.doi.org/10.3390/s23073385
work_keys_str_mv AT wangyinan vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4
AT guanyingzhou vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4
AT liuhanxu vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4
AT jinlisheng vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4
AT lixinwei vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4
AT guobaicang vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4
AT zhangzhe vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4