Cargando…

Graph Sampling-Based Multi-Stream Enhancement Network for Visible-Infrared Person Re-Identification

With the increasing demand for person re-identification (Re-ID) tasks, the need for all-day retrieval has become an inevitable trend. Nevertheless, single-modal Re-ID is no longer sufficient to meet this requirement, making Multi-Modal Data crucial in Re-ID. Consequently, a Visible-Infrared Person R...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jiang, Jinhua, Xiao, Junjie, Wang, Renlin, Li, Tiansong, Zhang, Wenfeng, Ran, Ruisheng, Xiang, Sen
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10534846/ https://www.ncbi.nlm.nih.gov/pubmed/37766005 http://dx.doi.org/10.3390/s23187948

_version_	1785112490385866752
author	Jiang, Jinhua Xiao, Junjie Wang, Renlin Li, Tiansong Zhang, Wenfeng Ran, Ruisheng Xiang, Sen
author_facet	Jiang, Jinhua Xiao, Junjie Wang, Renlin Li, Tiansong Zhang, Wenfeng Ran, Ruisheng Xiang, Sen
author_sort	Jiang, Jinhua
collection	PubMed
description	With the increasing demand for person re-identification (Re-ID) tasks, the need for all-day retrieval has become an inevitable trend. Nevertheless, single-modal Re-ID is no longer sufficient to meet this requirement, making Multi-Modal Data crucial in Re-ID. Consequently, a Visible-Infrared Person Re-Identification (VI Re-ID) task is proposed, which aims to match pairs of person images from the visible and infrared modalities. The significant modality discrepancy between the modalities poses a major challenge. Existing VI Re-ID methods focus on cross-modal feature learning and modal transformation to alleviate the discrepancy but overlook the impact of person contour information. Contours exhibit modality invariance, which is vital for learning effective identity representations and cross-modal matching. In addition, due to the low intra-modal diversity in the visible modality, it is difficult to distinguish the boundaries between some hard samples. To address these issues, we propose the Graph Sampling-based Multi-stream Enhancement Network (GSMEN). Firstly, the Contour Expansion Module (CEM) incorporates the contour information of a person into the original samples, further reducing the modality discrepancy and leading to improved matching stability between image pairs of different modalities. Additionally, to better distinguish cross-modal hard sample pairs during the training process, an innovative Cross-modality Graph Sampler (CGS) is designed for sample selection before training. The CGS calculates the feature distance between samples from different modalities and groups similar samples into the same batch during the training process, effectively exploring the boundary relationships between hard classes in the cross-modal setting. Some experiments conducted on the SYSU-MM01 and RegDB datasets demonstrate the superiority of our proposed method. Specifically, in the VIS→IR task, the experimental results on the RegDB dataset achieve 93.69% for Rank-1 and 92.56% for mAP.
format	Online Article Text
id	pubmed-10534846
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-105348462023-09-29 Graph Sampling-Based Multi-Stream Enhancement Network for Visible-Infrared Person Re-Identification Jiang, Jinhua Xiao, Junjie Wang, Renlin Li, Tiansong Zhang, Wenfeng Ran, Ruisheng Xiang, Sen Sensors (Basel) Article With the increasing demand for person re-identification (Re-ID) tasks, the need for all-day retrieval has become an inevitable trend. Nevertheless, single-modal Re-ID is no longer sufficient to meet this requirement, making Multi-Modal Data crucial in Re-ID. Consequently, a Visible-Infrared Person Re-Identification (VI Re-ID) task is proposed, which aims to match pairs of person images from the visible and infrared modalities. The significant modality discrepancy between the modalities poses a major challenge. Existing VI Re-ID methods focus on cross-modal feature learning and modal transformation to alleviate the discrepancy but overlook the impact of person contour information. Contours exhibit modality invariance, which is vital for learning effective identity representations and cross-modal matching. In addition, due to the low intra-modal diversity in the visible modality, it is difficult to distinguish the boundaries between some hard samples. To address these issues, we propose the Graph Sampling-based Multi-stream Enhancement Network (GSMEN). Firstly, the Contour Expansion Module (CEM) incorporates the contour information of a person into the original samples, further reducing the modality discrepancy and leading to improved matching stability between image pairs of different modalities. Additionally, to better distinguish cross-modal hard sample pairs during the training process, an innovative Cross-modality Graph Sampler (CGS) is designed for sample selection before training. The CGS calculates the feature distance between samples from different modalities and groups similar samples into the same batch during the training process, effectively exploring the boundary relationships between hard classes in the cross-modal setting. Some experiments conducted on the SYSU-MM01 and RegDB datasets demonstrate the superiority of our proposed method. Specifically, in the VIS→IR task, the experimental results on the RegDB dataset achieve 93.69% for Rank-1 and 92.56% for mAP. MDPI 2023-09-18 /pmc/articles/PMC10534846/ /pubmed/37766005 http://dx.doi.org/10.3390/s23187948 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Jiang, Jinhua Xiao, Junjie Wang, Renlin Li, Tiansong Zhang, Wenfeng Ran, Ruisheng Xiang, Sen Graph Sampling-Based Multi-Stream Enhancement Network for Visible-Infrared Person Re-Identification
title	Graph Sampling-Based Multi-Stream Enhancement Network for Visible-Infrared Person Re-Identification
title_full	Graph Sampling-Based Multi-Stream Enhancement Network for Visible-Infrared Person Re-Identification
title_fullStr	Graph Sampling-Based Multi-Stream Enhancement Network for Visible-Infrared Person Re-Identification
title_full_unstemmed	Graph Sampling-Based Multi-Stream Enhancement Network for Visible-Infrared Person Re-Identification
title_short	Graph Sampling-Based Multi-Stream Enhancement Network for Visible-Infrared Person Re-Identification
title_sort	graph sampling-based multi-stream enhancement network for visible-infrared person re-identification
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10534846/ https://www.ncbi.nlm.nih.gov/pubmed/37766005 http://dx.doi.org/10.3390/s23187948
work_keys_str_mv	AT jiangjinhua graphsamplingbasedmultistreamenhancementnetworkforvisibleinfraredpersonreidentification AT xiaojunjie graphsamplingbasedmultistreamenhancementnetworkforvisibleinfraredpersonreidentification AT wangrenlin graphsamplingbasedmultistreamenhancementnetworkforvisibleinfraredpersonreidentification AT litiansong graphsamplingbasedmultistreamenhancementnetworkforvisibleinfraredpersonreidentification AT zhangwenfeng graphsamplingbasedmultistreamenhancementnetworkforvisibleinfraredpersonreidentification AT ranruisheng graphsamplingbasedmultistreamenhancementnetworkforvisibleinfraredpersonreidentification AT xiangsen graphsamplingbasedmultistreamenhancementnetworkforvisibleinfraredpersonreidentification

Graph Sampling-Based Multi-Stream Enhancement Network for Visible-Infrared Person Re-Identification

Ejemplares similares