Cargando…

Channel Exchanging for RGB-T Tracking

It is difficult to achieve all-weather visual object tracking in an open environment only utilizing single modality data input. Due to the complementarity of RGB and thermal infrared (TIR) data in various complex environments, a more robust object tracking framework can be obtained using video data...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Long, Zhu, Meng, Ren, Honge, Xue, Lingjixuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434326/
https://www.ncbi.nlm.nih.gov/pubmed/34502691
http://dx.doi.org/10.3390/s21175800
_version_ 1783751572031799296
author Zhao, Long
Zhu, Meng
Ren, Honge
Xue, Lingjixuan
author_facet Zhao, Long
Zhu, Meng
Ren, Honge
Xue, Lingjixuan
author_sort Zhao, Long
collection PubMed
description It is difficult to achieve all-weather visual object tracking in an open environment only utilizing single modality data input. Due to the complementarity of RGB and thermal infrared (TIR) data in various complex environments, a more robust object tracking framework can be obtained using video data of these two modalities. The fusion methods of RGB and TIR data are the core elements to determine the performance of the RGB-T object tracking method, and the existing RGB-T trackers have not solved this problem well. In order to solve the current low utilization of information intra single modality in aggregation-based methods and between two modalities in alignment-based methods, we used DiMP as the baseline tracker to design an RGB-T object tracking framework channel exchanging DiMP (CEDiMP) based on channel exchanging. CEDiMP achieves dynamic channel exchanging between sub-networks of different modes hardly adding any parameters during the feature fusion process. The expression ability of the deep features generated by our data fusion method based on channel exchanging is stronger. At the same time, in order to solve the poor generalization ability of the existing RGB-T object tracking methods and the poor ability in the long-term object tracking, more training of CEDiMP on the synthetic dataset LaSOT-RGBT is added. A large number of experiments demonstrate the effectiveness of the proposed model. CEDiMP achieves the best performance on two RGB-T object tracking benchmark datasets, GTOT and RGBT234, and performs outstandingly in the generalization testing.
format Online
Article
Text
id pubmed-8434326
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84343262021-09-12 Channel Exchanging for RGB-T Tracking Zhao, Long Zhu, Meng Ren, Honge Xue, Lingjixuan Sensors (Basel) Article It is difficult to achieve all-weather visual object tracking in an open environment only utilizing single modality data input. Due to the complementarity of RGB and thermal infrared (TIR) data in various complex environments, a more robust object tracking framework can be obtained using video data of these two modalities. The fusion methods of RGB and TIR data are the core elements to determine the performance of the RGB-T object tracking method, and the existing RGB-T trackers have not solved this problem well. In order to solve the current low utilization of information intra single modality in aggregation-based methods and between two modalities in alignment-based methods, we used DiMP as the baseline tracker to design an RGB-T object tracking framework channel exchanging DiMP (CEDiMP) based on channel exchanging. CEDiMP achieves dynamic channel exchanging between sub-networks of different modes hardly adding any parameters during the feature fusion process. The expression ability of the deep features generated by our data fusion method based on channel exchanging is stronger. At the same time, in order to solve the poor generalization ability of the existing RGB-T object tracking methods and the poor ability in the long-term object tracking, more training of CEDiMP on the synthetic dataset LaSOT-RGBT is added. A large number of experiments demonstrate the effectiveness of the proposed model. CEDiMP achieves the best performance on two RGB-T object tracking benchmark datasets, GTOT and RGBT234, and performs outstandingly in the generalization testing. MDPI 2021-08-28 /pmc/articles/PMC8434326/ /pubmed/34502691 http://dx.doi.org/10.3390/s21175800 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhao, Long
Zhu, Meng
Ren, Honge
Xue, Lingjixuan
Channel Exchanging for RGB-T Tracking
title Channel Exchanging for RGB-T Tracking
title_full Channel Exchanging for RGB-T Tracking
title_fullStr Channel Exchanging for RGB-T Tracking
title_full_unstemmed Channel Exchanging for RGB-T Tracking
title_short Channel Exchanging for RGB-T Tracking
title_sort channel exchanging for rgb-t tracking
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434326/
https://www.ncbi.nlm.nih.gov/pubmed/34502691
http://dx.doi.org/10.3390/s21175800
work_keys_str_mv AT zhaolong channelexchangingforrgbttracking
AT zhumeng channelexchangingforrgbttracking
AT renhonge channelexchangingforrgbttracking
AT xuelingjixuan channelexchangingforrgbttracking