Cargando…

An object detection algorithm combining self-attention and YOLOv4 in traffic scene

Automobile intelligence is the trend for modern automobiles, of which environment perception is the key technology of intelligent automobile research. For autonomous vehicles, the detection of object information, such as vehicles and pedestrians in traffic scenes is crucial to improving driving safe...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Kewei, Zhao, Fengkui, Xu, Xiaomei, Zhang, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10194927/
https://www.ncbi.nlm.nih.gov/pubmed/37200376
http://dx.doi.org/10.1371/journal.pone.0285654
_version_ 1785044120269488128
author Lu, Kewei
Zhao, Fengkui
Xu, Xiaomei
Zhang, Yong
author_facet Lu, Kewei
Zhao, Fengkui
Xu, Xiaomei
Zhang, Yong
author_sort Lu, Kewei
collection PubMed
description Automobile intelligence is the trend for modern automobiles, of which environment perception is the key technology of intelligent automobile research. For autonomous vehicles, the detection of object information, such as vehicles and pedestrians in traffic scenes is crucial to improving driving safety. However, in the actual traffic scene, there are many special conditions such as object occlusion, small objects, and bad weather, which will affect the accuracy of object detection. In this research, the SwinT-YOLOv4 algorithm is proposed for detecting objects in traffic scenes, which is based on the YOLOv4 algorithm. Compared with a Convolutional neural network (CNN), the vision transformer is more powerful at extracting vision features of objects in the image. The CNN-based backbone in YOLOv4 is replaced by the Swin Transformer in the proposed algorithm. The feature-fusing neck and predicting head of YOLOv4 is remained. The proposed model was trained and evaluated in the COCO dataset. Experiments show that our method can significantly improve the accuracy of object detection under special conditions. Equipped with our method, the object detection precision for cars and person is improved by 1.75%, and the detection precision for car and person reach 89.04% and 94.16%, respectively.
format Online
Article
Text
id pubmed-10194927
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-101949272023-05-19 An object detection algorithm combining self-attention and YOLOv4 in traffic scene Lu, Kewei Zhao, Fengkui Xu, Xiaomei Zhang, Yong PLoS One Research Article Automobile intelligence is the trend for modern automobiles, of which environment perception is the key technology of intelligent automobile research. For autonomous vehicles, the detection of object information, such as vehicles and pedestrians in traffic scenes is crucial to improving driving safety. However, in the actual traffic scene, there are many special conditions such as object occlusion, small objects, and bad weather, which will affect the accuracy of object detection. In this research, the SwinT-YOLOv4 algorithm is proposed for detecting objects in traffic scenes, which is based on the YOLOv4 algorithm. Compared with a Convolutional neural network (CNN), the vision transformer is more powerful at extracting vision features of objects in the image. The CNN-based backbone in YOLOv4 is replaced by the Swin Transformer in the proposed algorithm. The feature-fusing neck and predicting head of YOLOv4 is remained. The proposed model was trained and evaluated in the COCO dataset. Experiments show that our method can significantly improve the accuracy of object detection under special conditions. Equipped with our method, the object detection precision for cars and person is improved by 1.75%, and the detection precision for car and person reach 89.04% and 94.16%, respectively. Public Library of Science 2023-05-18 /pmc/articles/PMC10194927/ /pubmed/37200376 http://dx.doi.org/10.1371/journal.pone.0285654 Text en © 2023 Lu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lu, Kewei
Zhao, Fengkui
Xu, Xiaomei
Zhang, Yong
An object detection algorithm combining self-attention and YOLOv4 in traffic scene
title An object detection algorithm combining self-attention and YOLOv4 in traffic scene
title_full An object detection algorithm combining self-attention and YOLOv4 in traffic scene
title_fullStr An object detection algorithm combining self-attention and YOLOv4 in traffic scene
title_full_unstemmed An object detection algorithm combining self-attention and YOLOv4 in traffic scene
title_short An object detection algorithm combining self-attention and YOLOv4 in traffic scene
title_sort object detection algorithm combining self-attention and yolov4 in traffic scene
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10194927/
https://www.ncbi.nlm.nih.gov/pubmed/37200376
http://dx.doi.org/10.1371/journal.pone.0285654
work_keys_str_mv AT lukewei anobjectdetectionalgorithmcombiningselfattentionandyolov4intrafficscene
AT zhaofengkui anobjectdetectionalgorithmcombiningselfattentionandyolov4intrafficscene
AT xuxiaomei anobjectdetectionalgorithmcombiningselfattentionandyolov4intrafficscene
AT zhangyong anobjectdetectionalgorithmcombiningselfattentionandyolov4intrafficscene
AT lukewei objectdetectionalgorithmcombiningselfattentionandyolov4intrafficscene
AT zhaofengkui objectdetectionalgorithmcombiningselfattentionandyolov4intrafficscene
AT xuxiaomei objectdetectionalgorithmcombiningselfattentionandyolov4intrafficscene
AT zhangyong objectdetectionalgorithmcombiningselfattentionandyolov4intrafficscene