Cargando…

Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception

The objective of vehicle search is to locate and identify vehicles in uncropped, real-world images, which is the combination of two tasks: vehicle detection and re-identification (Re-ID). As an emerging research topic, vehicle search plays a significant role in the perception of cooperative autonomo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Hai, Niu, Yaqing, Chen, Long, Li, Yicheng, Luo, Tong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611384/ https://www.ncbi.nlm.nih.gov/pubmed/37896723 http://dx.doi.org/10.3390/s23208630

_version_	1785128478718296064
author	Wang, Hai Niu, Yaqing Chen, Long Li, Yicheng Luo, Tong
author_facet	Wang, Hai Niu, Yaqing Chen, Long Li, Yicheng Luo, Tong
author_sort	Wang, Hai
collection	PubMed
description	The objective of vehicle search is to locate and identify vehicles in uncropped, real-world images, which is the combination of two tasks: vehicle detection and re-identification (Re-ID). As an emerging research topic, vehicle search plays a significant role in the perception of cooperative autonomous vehicles and road driving in the distant future and has become a trend in the future development of intelligent driving. However, there is no suitable dataset for this study. The Tsinghua University DAIR-V2X dataset is utilized to create the first cross-camera vehicle search dataset, DAIR-V2XSearch, which combines the cameras at both ends of the vehicle and the road in real-world scenes. The primary purpose of the current search network is to address the pedestrian issue. Due to varying task scenarios, it is necessary to re-establish the network in order to resolve the problem of vast differences in different perspectives caused by vehicle searches. A phased feature extraction network (PFE-Net) is proposed as a solution to the cross-camera vehicle search problem. Initially, the anchor-free YOLOX framework is selected as the backbone network, which not only improves the network’s performance but also eliminates the fuzzy situation in which multiple anchor boxes correspond to a single vehicle ID in the Re-ID branch. Second, for the vehicle Re-ID branch, a camera grouping module is proposed to effectively address issues such as sudden changes in perspective and disparities in shooting under different cameras. Finally, a cross-level feature fusion module is designed to enhance the model’s ability to extract subtle vehicle features and the Re-ID’s precision. Experiments demonstrate that our proposed PFE-Net achieves the highest precision in the DAIR-V2XSearch dataset.
format	Online Article Text
id	pubmed-10611384
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-106113842023-10-28 Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception Wang, Hai Niu, Yaqing Chen, Long Li, Yicheng Luo, Tong Sensors (Basel) Article The objective of vehicle search is to locate and identify vehicles in uncropped, real-world images, which is the combination of two tasks: vehicle detection and re-identification (Re-ID). As an emerging research topic, vehicle search plays a significant role in the perception of cooperative autonomous vehicles and road driving in the distant future and has become a trend in the future development of intelligent driving. However, there is no suitable dataset for this study. The Tsinghua University DAIR-V2X dataset is utilized to create the first cross-camera vehicle search dataset, DAIR-V2XSearch, which combines the cameras at both ends of the vehicle and the road in real-world scenes. The primary purpose of the current search network is to address the pedestrian issue. Due to varying task scenarios, it is necessary to re-establish the network in order to resolve the problem of vast differences in different perspectives caused by vehicle searches. A phased feature extraction network (PFE-Net) is proposed as a solution to the cross-camera vehicle search problem. Initially, the anchor-free YOLOX framework is selected as the backbone network, which not only improves the network’s performance but also eliminates the fuzzy situation in which multiple anchor boxes correspond to a single vehicle ID in the Re-ID branch. Second, for the vehicle Re-ID branch, a camera grouping module is proposed to effectively address issues such as sudden changes in perspective and disparities in shooting under different cameras. Finally, a cross-level feature fusion module is designed to enhance the model’s ability to extract subtle vehicle features and the Re-ID’s precision. Experiments demonstrate that our proposed PFE-Net achieves the highest precision in the DAIR-V2XSearch dataset. MDPI 2023-10-22 /pmc/articles/PMC10611384/ /pubmed/37896723 http://dx.doi.org/10.3390/s23208630 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Hai Niu, Yaqing Chen, Long Li, Yicheng Luo, Tong Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title	Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title_full	Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title_fullStr	Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title_full_unstemmed	Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title_short	Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title_sort	phased feature extraction network for vehicle search tasks based on cross-camera for vehicle–road collaborative perception
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611384/ https://www.ncbi.nlm.nih.gov/pubmed/37896723 http://dx.doi.org/10.3390/s23208630
work_keys_str_mv	AT wanghai phasedfeatureextractionnetworkforvehiclesearchtasksbasedoncrosscameraforvehicleroadcollaborativeperception AT niuyaqing phasedfeatureextractionnetworkforvehiclesearchtasksbasedoncrosscameraforvehicleroadcollaborativeperception AT chenlong phasedfeatureextractionnetworkforvehiclesearchtasksbasedoncrosscameraforvehicleroadcollaborativeperception AT liyicheng phasedfeatureextractionnetworkforvehiclesearchtasksbasedoncrosscameraforvehicleroadcollaborativeperception AT luotong phasedfeatureextractionnetworkforvehiclesearchtasksbasedoncrosscameraforvehicleroadcollaborativeperception

Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception

Ejemplares similares