Cargando…

Multi-object detection for crowded road scene based on ML-AFP of YOLOv5

Aiming at the problem of multi-object detection such as target occlusion and tiny targets in road scenes, this paper proposes an improved YOLOv5 multi-object detection model based on ML-AFP (multi-level aggregation feature perception) mechanism. Since tiny targets such as non-motor vehicle and pedes...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Yiming, Wu, Kaiwen, Kang, Wenshuo, Zhou, Yuhui, Di, Fan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10570361/ https://www.ncbi.nlm.nih.gov/pubmed/37828051 http://dx.doi.org/10.1038/s41598-023-43458-3

_version_	1785119749146935296
author	Li, Yiming Wu, Kaiwen Kang, Wenshuo Zhou, Yuhui Di, Fan
author_facet	Li, Yiming Wu, Kaiwen Kang, Wenshuo Zhou, Yuhui Di, Fan
author_sort	Li, Yiming
collection	PubMed
description	Aiming at the problem of multi-object detection such as target occlusion and tiny targets in road scenes, this paper proposes an improved YOLOv5 multi-object detection model based on ML-AFP (multi-level aggregation feature perception) mechanism. Since tiny targets such as non-motor vehicle and pedestrians are not easily detected, this paper adds a micro target detection layer and a double head mechanism to improve the detection ability of tiny targets. Varifocal loss is used to achieve a more accurate ranking in the process of non-maximum suppression to solve the problem of target occlusion, and this paper also proposes a ML-AFP mechanism. The adaptive fusion of spatial feature information at different scales improves the expression ability of network model features, and improves the detection accuracy of the model as a whole. Our experimental results on multiple challenging datasets such as KITTI, BDD100K, and show that the accuracy, recall rate and mAP value of the proposed model are greatly improved, which solves the problem of multi-object detection in crowded road scenes.
format	Online Article Text
id	pubmed-10570361
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-105703612023-10-14 Multi-object detection for crowded road scene based on ML-AFP of YOLOv5 Li, Yiming Wu, Kaiwen Kang, Wenshuo Zhou, Yuhui Di, Fan Sci Rep Article Aiming at the problem of multi-object detection such as target occlusion and tiny targets in road scenes, this paper proposes an improved YOLOv5 multi-object detection model based on ML-AFP (multi-level aggregation feature perception) mechanism. Since tiny targets such as non-motor vehicle and pedestrians are not easily detected, this paper adds a micro target detection layer and a double head mechanism to improve the detection ability of tiny targets. Varifocal loss is used to achieve a more accurate ranking in the process of non-maximum suppression to solve the problem of target occlusion, and this paper also proposes a ML-AFP mechanism. The adaptive fusion of spatial feature information at different scales improves the expression ability of network model features, and improves the detection accuracy of the model as a whole. Our experimental results on multiple challenging datasets such as KITTI, BDD100K, and show that the accuracy, recall rate and mAP value of the proposed model are greatly improved, which solves the problem of multi-object detection in crowded road scenes. Nature Publishing Group UK 2023-10-12 /pmc/articles/PMC10570361/ /pubmed/37828051 http://dx.doi.org/10.1038/s41598-023-43458-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Li, Yiming Wu, Kaiwen Kang, Wenshuo Zhou, Yuhui Di, Fan Multi-object detection for crowded road scene based on ML-AFP of YOLOv5
title	Multi-object detection for crowded road scene based on ML-AFP of YOLOv5
title_full	Multi-object detection for crowded road scene based on ML-AFP of YOLOv5
title_fullStr	Multi-object detection for crowded road scene based on ML-AFP of YOLOv5
title_full_unstemmed	Multi-object detection for crowded road scene based on ML-AFP of YOLOv5
title_short	Multi-object detection for crowded road scene based on ML-AFP of YOLOv5
title_sort	multi-object detection for crowded road scene based on ml-afp of yolov5
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10570361/ https://www.ncbi.nlm.nih.gov/pubmed/37828051 http://dx.doi.org/10.1038/s41598-023-43458-3
work_keys_str_mv	AT liyiming multiobjectdetectionforcrowdedroadscenebasedonmlafpofyolov5 AT wukaiwen multiobjectdetectionforcrowdedroadscenebasedonmlafpofyolov5 AT kangwenshuo multiobjectdetectionforcrowdedroadscenebasedonmlafpofyolov5 AT zhouyuhui multiobjectdetectionforcrowdedroadscenebasedonmlafpofyolov5 AT difan multiobjectdetectionforcrowdedroadscenebasedonmlafpofyolov5

Multi-object detection for crowded road scene based on ML-AFP of YOLOv5

Ejemplares similares