Cargando…

Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception

The objective of vehicle search is to locate and identify vehicles in uncropped, real-world images, which is the combination of two tasks: vehicle detection and re-identification (Re-ID). As an emerging research topic, vehicle search plays a significant role in the perception of cooperative autonomo...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Hai, Niu, Yaqing, Chen, Long, Li, Yicheng, Luo, Tong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611384/
https://www.ncbi.nlm.nih.gov/pubmed/37896723
http://dx.doi.org/10.3390/s23208630
_version_ 1785128478718296064
author Wang, Hai
Niu, Yaqing
Chen, Long
Li, Yicheng
Luo, Tong
author_facet Wang, Hai
Niu, Yaqing
Chen, Long
Li, Yicheng
Luo, Tong
author_sort Wang, Hai
collection PubMed
description The objective of vehicle search is to locate and identify vehicles in uncropped, real-world images, which is the combination of two tasks: vehicle detection and re-identification (Re-ID). As an emerging research topic, vehicle search plays a significant role in the perception of cooperative autonomous vehicles and road driving in the distant future and has become a trend in the future development of intelligent driving. However, there is no suitable dataset for this study. The Tsinghua University DAIR-V2X dataset is utilized to create the first cross-camera vehicle search dataset, DAIR-V2XSearch, which combines the cameras at both ends of the vehicle and the road in real-world scenes. The primary purpose of the current search network is to address the pedestrian issue. Due to varying task scenarios, it is necessary to re-establish the network in order to resolve the problem of vast differences in different perspectives caused by vehicle searches. A phased feature extraction network (PFE-Net) is proposed as a solution to the cross-camera vehicle search problem. Initially, the anchor-free YOLOX framework is selected as the backbone network, which not only improves the network’s performance but also eliminates the fuzzy situation in which multiple anchor boxes correspond to a single vehicle ID in the Re-ID branch. Second, for the vehicle Re-ID branch, a camera grouping module is proposed to effectively address issues such as sudden changes in perspective and disparities in shooting under different cameras. Finally, a cross-level feature fusion module is designed to enhance the model’s ability to extract subtle vehicle features and the Re-ID’s precision. Experiments demonstrate that our proposed PFE-Net achieves the highest precision in the DAIR-V2XSearch dataset.
format Online
Article
Text
id pubmed-10611384
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106113842023-10-28 Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception Wang, Hai Niu, Yaqing Chen, Long Li, Yicheng Luo, Tong Sensors (Basel) Article The objective of vehicle search is to locate and identify vehicles in uncropped, real-world images, which is the combination of two tasks: vehicle detection and re-identification (Re-ID). As an emerging research topic, vehicle search plays a significant role in the perception of cooperative autonomous vehicles and road driving in the distant future and has become a trend in the future development of intelligent driving. However, there is no suitable dataset for this study. The Tsinghua University DAIR-V2X dataset is utilized to create the first cross-camera vehicle search dataset, DAIR-V2XSearch, which combines the cameras at both ends of the vehicle and the road in real-world scenes. The primary purpose of the current search network is to address the pedestrian issue. Due to varying task scenarios, it is necessary to re-establish the network in order to resolve the problem of vast differences in different perspectives caused by vehicle searches. A phased feature extraction network (PFE-Net) is proposed as a solution to the cross-camera vehicle search problem. Initially, the anchor-free YOLOX framework is selected as the backbone network, which not only improves the network’s performance but also eliminates the fuzzy situation in which multiple anchor boxes correspond to a single vehicle ID in the Re-ID branch. Second, for the vehicle Re-ID branch, a camera grouping module is proposed to effectively address issues such as sudden changes in perspective and disparities in shooting under different cameras. Finally, a cross-level feature fusion module is designed to enhance the model’s ability to extract subtle vehicle features and the Re-ID’s precision. Experiments demonstrate that our proposed PFE-Net achieves the highest precision in the DAIR-V2XSearch dataset. MDPI 2023-10-22 /pmc/articles/PMC10611384/ /pubmed/37896723 http://dx.doi.org/10.3390/s23208630 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Hai
Niu, Yaqing
Chen, Long
Li, Yicheng
Luo, Tong
Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title_full Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title_fullStr Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title_full_unstemmed Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title_short Phased Feature Extraction Network for Vehicle Search Tasks Based on Cross-Camera for Vehicle–Road Collaborative Perception
title_sort phased feature extraction network for vehicle search tasks based on cross-camera for vehicle–road collaborative perception
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611384/
https://www.ncbi.nlm.nih.gov/pubmed/37896723
http://dx.doi.org/10.3390/s23208630
work_keys_str_mv AT wanghai phasedfeatureextractionnetworkforvehiclesearchtasksbasedoncrosscameraforvehicleroadcollaborativeperception
AT niuyaqing phasedfeatureextractionnetworkforvehiclesearchtasksbasedoncrosscameraforvehicleroadcollaborativeperception
AT chenlong phasedfeatureextractionnetworkforvehiclesearchtasksbasedoncrosscameraforvehicleroadcollaborativeperception
AT liyicheng phasedfeatureextractionnetworkforvehiclesearchtasksbasedoncrosscameraforvehicleroadcollaborativeperception
AT luotong phasedfeatureextractionnetworkforvehiclesearchtasksbasedoncrosscameraforvehicleroadcollaborativeperception