Cargando…

Channel Exchanging for RGB-T Tracking

It is difficult to achieve all-weather visual object tracking in an open environment only utilizing single modality data input. Due to the complementarity of RGB and thermal infrared (TIR) data in various complex environments, a more robust object tracking framework can be obtained using video data...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhao, Long, Zhu, Meng, Ren, Honge, Xue, Lingjixuan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434326/ https://www.ncbi.nlm.nih.gov/pubmed/34502691 http://dx.doi.org/10.3390/s21175800

_version_	1783751572031799296
author	Zhao, Long Zhu, Meng Ren, Honge Xue, Lingjixuan
author_facet	Zhao, Long Zhu, Meng Ren, Honge Xue, Lingjixuan
author_sort	Zhao, Long
collection	PubMed
description	It is difficult to achieve all-weather visual object tracking in an open environment only utilizing single modality data input. Due to the complementarity of RGB and thermal infrared (TIR) data in various complex environments, a more robust object tracking framework can be obtained using video data of these two modalities. The fusion methods of RGB and TIR data are the core elements to determine the performance of the RGB-T object tracking method, and the existing RGB-T trackers have not solved this problem well. In order to solve the current low utilization of information intra single modality in aggregation-based methods and between two modalities in alignment-based methods, we used DiMP as the baseline tracker to design an RGB-T object tracking framework channel exchanging DiMP (CEDiMP) based on channel exchanging. CEDiMP achieves dynamic channel exchanging between sub-networks of different modes hardly adding any parameters during the feature fusion process. The expression ability of the deep features generated by our data fusion method based on channel exchanging is stronger. At the same time, in order to solve the poor generalization ability of the existing RGB-T object tracking methods and the poor ability in the long-term object tracking, more training of CEDiMP on the synthetic dataset LaSOT-RGBT is added. A large number of experiments demonstrate the effectiveness of the proposed model. CEDiMP achieves the best performance on two RGB-T object tracking benchmark datasets, GTOT and RGBT234, and performs outstandingly in the generalization testing.
format	Online Article Text
id	pubmed-8434326
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-84343262021-09-12 Channel Exchanging for RGB-T Tracking Zhao, Long Zhu, Meng Ren, Honge Xue, Lingjixuan Sensors (Basel) Article It is difficult to achieve all-weather visual object tracking in an open environment only utilizing single modality data input. Due to the complementarity of RGB and thermal infrared (TIR) data in various complex environments, a more robust object tracking framework can be obtained using video data of these two modalities. The fusion methods of RGB and TIR data are the core elements to determine the performance of the RGB-T object tracking method, and the existing RGB-T trackers have not solved this problem well. In order to solve the current low utilization of information intra single modality in aggregation-based methods and between two modalities in alignment-based methods, we used DiMP as the baseline tracker to design an RGB-T object tracking framework channel exchanging DiMP (CEDiMP) based on channel exchanging. CEDiMP achieves dynamic channel exchanging between sub-networks of different modes hardly adding any parameters during the feature fusion process. The expression ability of the deep features generated by our data fusion method based on channel exchanging is stronger. At the same time, in order to solve the poor generalization ability of the existing RGB-T object tracking methods and the poor ability in the long-term object tracking, more training of CEDiMP on the synthetic dataset LaSOT-RGBT is added. A large number of experiments demonstrate the effectiveness of the proposed model. CEDiMP achieves the best performance on two RGB-T object tracking benchmark datasets, GTOT and RGBT234, and performs outstandingly in the generalization testing. MDPI 2021-08-28 /pmc/articles/PMC8434326/ /pubmed/34502691 http://dx.doi.org/10.3390/s21175800 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhao, Long Zhu, Meng Ren, Honge Xue, Lingjixuan Channel Exchanging for RGB-T Tracking
title	Channel Exchanging for RGB-T Tracking
title_full	Channel Exchanging for RGB-T Tracking
title_fullStr	Channel Exchanging for RGB-T Tracking
title_full_unstemmed	Channel Exchanging for RGB-T Tracking
title_short	Channel Exchanging for RGB-T Tracking
title_sort	channel exchanging for rgb-t tracking
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434326/ https://www.ncbi.nlm.nih.gov/pubmed/34502691 http://dx.doi.org/10.3390/s21175800
work_keys_str_mv	AT zhaolong channelexchangingforrgbttracking AT zhumeng channelexchangingforrgbttracking AT renhonge channelexchangingforrgbttracking AT xuelingjixuan channelexchangingforrgbttracking

Channel Exchanging for RGB-T Tracking

Ejemplares similares