Cargando…

VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4

Vehicle view object detection technology is the key to the environment perception modules of autonomous vehicles, which is crucial for driving safety. In view of the characteristics of complex scenes, such as dim light, occlusion, and long distance, an improved YOLOv4-based vehicle view object detec...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Yinan, Guan, Yingzhou, Liu, Hanxu, Jin, Lisheng, Li, Xinwei, Guo, Baicang, Zhang, Zhe
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099275/ https://www.ncbi.nlm.nih.gov/pubmed/37050441 http://dx.doi.org/10.3390/s23073385

_version_	1785025022787584000
author	Wang, Yinan Guan, Yingzhou Liu, Hanxu Jin, Lisheng Li, Xinwei Guo, Baicang Zhang, Zhe
author_facet	Wang, Yinan Guan, Yingzhou Liu, Hanxu Jin, Lisheng Li, Xinwei Guo, Baicang Zhang, Zhe
author_sort	Wang, Yinan
collection	PubMed
description	Vehicle view object detection technology is the key to the environment perception modules of autonomous vehicles, which is crucial for driving safety. In view of the characteristics of complex scenes, such as dim light, occlusion, and long distance, an improved YOLOv4-based vehicle view object detection model, VV-YOLO, is proposed in this paper. The VV-YOLO model adopts the implementation mode based on anchor frames. In the anchor frame clustering, the improved K-means++ algorithm is used to reduce the possibility of instability in anchor frame clustering results caused by the random selection of a cluster center, so that the model can obtain a reasonable original anchor frame. Firstly, the CA-PAN network was designed by adding a coordinate attention mechanism, which was used in the neck network of the VV-YOLO model; the multidimensional modeling of image feature channel relationships was realized; and the extraction effect of complex image features was improved. Secondly, in order to ensure the sufficiency of model training, the loss function of the VV-YOLO model was reconstructed based on the focus function, which alleviated the problem of training imbalance caused by the unbalanced distribution of training data. Finally, the KITTI dataset was selected as the test set to conduct the index quantification experiment. The results showed that the precision and average precision of the VV-YOLO model were 90.68% and 80.01%, respectively, which were 6.88% and 3.44% higher than those of the YOLOv4 model, and the model’s calculation time on the same hardware platform did not increase significantly. In addition to testing on the KITTI dataset, we also selected the BDD100K dataset and typical complex traffic scene data collected in the field to conduct a visual comparison test of the results, and then the validity and robustness of the VV-YOLO model were verified.
format	Online Article Text
id	pubmed-10099275
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-100992752023-04-14 VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4 Wang, Yinan Guan, Yingzhou Liu, Hanxu Jin, Lisheng Li, Xinwei Guo, Baicang Zhang, Zhe Sensors (Basel) Article Vehicle view object detection technology is the key to the environment perception modules of autonomous vehicles, which is crucial for driving safety. In view of the characteristics of complex scenes, such as dim light, occlusion, and long distance, an improved YOLOv4-based vehicle view object detection model, VV-YOLO, is proposed in this paper. The VV-YOLO model adopts the implementation mode based on anchor frames. In the anchor frame clustering, the improved K-means++ algorithm is used to reduce the possibility of instability in anchor frame clustering results caused by the random selection of a cluster center, so that the model can obtain a reasonable original anchor frame. Firstly, the CA-PAN network was designed by adding a coordinate attention mechanism, which was used in the neck network of the VV-YOLO model; the multidimensional modeling of image feature channel relationships was realized; and the extraction effect of complex image features was improved. Secondly, in order to ensure the sufficiency of model training, the loss function of the VV-YOLO model was reconstructed based on the focus function, which alleviated the problem of training imbalance caused by the unbalanced distribution of training data. Finally, the KITTI dataset was selected as the test set to conduct the index quantification experiment. The results showed that the precision and average precision of the VV-YOLO model were 90.68% and 80.01%, respectively, which were 6.88% and 3.44% higher than those of the YOLOv4 model, and the model’s calculation time on the same hardware platform did not increase significantly. In addition to testing on the KITTI dataset, we also selected the BDD100K dataset and typical complex traffic scene data collected in the field to conduct a visual comparison test of the results, and then the validity and robustness of the VV-YOLO model were verified. MDPI 2023-03-23 /pmc/articles/PMC10099275/ /pubmed/37050441 http://dx.doi.org/10.3390/s23073385 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Yinan Guan, Yingzhou Liu, Hanxu Jin, Lisheng Li, Xinwei Guo, Baicang Zhang, Zhe VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title	VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title_full	VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title_fullStr	VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title_full_unstemmed	VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title_short	VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4
title_sort	vv-yolo: a vehicle view object detection model based on improved yolov4
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099275/ https://www.ncbi.nlm.nih.gov/pubmed/37050441 http://dx.doi.org/10.3390/s23073385
work_keys_str_mv	AT wangyinan vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4 AT guanyingzhou vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4 AT liuhanxu vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4 AT jinlisheng vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4 AT lixinwei vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4 AT guobaicang vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4 AT zhangzhe vvyoloavehicleviewobjectdetectionmodelbasedonimprovedyolov4

VV-YOLO: A Vehicle View Object Detection Model Based on Improved YOLOv4

Ejemplares similares