Cargando…
Channel Exchanging for RGB-T Tracking
It is difficult to achieve all-weather visual object tracking in an open environment only utilizing single modality data input. Due to the complementarity of RGB and thermal infrared (TIR) data in various complex environments, a more robust object tracking framework can be obtained using video data...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434326/ https://www.ncbi.nlm.nih.gov/pubmed/34502691 http://dx.doi.org/10.3390/s21175800 |
_version_ | 1783751572031799296 |
---|---|
author | Zhao, Long Zhu, Meng Ren, Honge Xue, Lingjixuan |
author_facet | Zhao, Long Zhu, Meng Ren, Honge Xue, Lingjixuan |
author_sort | Zhao, Long |
collection | PubMed |
description | It is difficult to achieve all-weather visual object tracking in an open environment only utilizing single modality data input. Due to the complementarity of RGB and thermal infrared (TIR) data in various complex environments, a more robust object tracking framework can be obtained using video data of these two modalities. The fusion methods of RGB and TIR data are the core elements to determine the performance of the RGB-T object tracking method, and the existing RGB-T trackers have not solved this problem well. In order to solve the current low utilization of information intra single modality in aggregation-based methods and between two modalities in alignment-based methods, we used DiMP as the baseline tracker to design an RGB-T object tracking framework channel exchanging DiMP (CEDiMP) based on channel exchanging. CEDiMP achieves dynamic channel exchanging between sub-networks of different modes hardly adding any parameters during the feature fusion process. The expression ability of the deep features generated by our data fusion method based on channel exchanging is stronger. At the same time, in order to solve the poor generalization ability of the existing RGB-T object tracking methods and the poor ability in the long-term object tracking, more training of CEDiMP on the synthetic dataset LaSOT-RGBT is added. A large number of experiments demonstrate the effectiveness of the proposed model. CEDiMP achieves the best performance on two RGB-T object tracking benchmark datasets, GTOT and RGBT234, and performs outstandingly in the generalization testing. |
format | Online Article Text |
id | pubmed-8434326 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84343262021-09-12 Channel Exchanging for RGB-T Tracking Zhao, Long Zhu, Meng Ren, Honge Xue, Lingjixuan Sensors (Basel) Article It is difficult to achieve all-weather visual object tracking in an open environment only utilizing single modality data input. Due to the complementarity of RGB and thermal infrared (TIR) data in various complex environments, a more robust object tracking framework can be obtained using video data of these two modalities. The fusion methods of RGB and TIR data are the core elements to determine the performance of the RGB-T object tracking method, and the existing RGB-T trackers have not solved this problem well. In order to solve the current low utilization of information intra single modality in aggregation-based methods and between two modalities in alignment-based methods, we used DiMP as the baseline tracker to design an RGB-T object tracking framework channel exchanging DiMP (CEDiMP) based on channel exchanging. CEDiMP achieves dynamic channel exchanging between sub-networks of different modes hardly adding any parameters during the feature fusion process. The expression ability of the deep features generated by our data fusion method based on channel exchanging is stronger. At the same time, in order to solve the poor generalization ability of the existing RGB-T object tracking methods and the poor ability in the long-term object tracking, more training of CEDiMP on the synthetic dataset LaSOT-RGBT is added. A large number of experiments demonstrate the effectiveness of the proposed model. CEDiMP achieves the best performance on two RGB-T object tracking benchmark datasets, GTOT and RGBT234, and performs outstandingly in the generalization testing. MDPI 2021-08-28 /pmc/articles/PMC8434326/ /pubmed/34502691 http://dx.doi.org/10.3390/s21175800 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhao, Long Zhu, Meng Ren, Honge Xue, Lingjixuan Channel Exchanging for RGB-T Tracking |
title | Channel Exchanging for RGB-T Tracking |
title_full | Channel Exchanging for RGB-T Tracking |
title_fullStr | Channel Exchanging for RGB-T Tracking |
title_full_unstemmed | Channel Exchanging for RGB-T Tracking |
title_short | Channel Exchanging for RGB-T Tracking |
title_sort | channel exchanging for rgb-t tracking |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434326/ https://www.ncbi.nlm.nih.gov/pubmed/34502691 http://dx.doi.org/10.3390/s21175800 |
work_keys_str_mv | AT zhaolong channelexchangingforrgbttracking AT zhumeng channelexchangingforrgbttracking AT renhonge channelexchangingforrgbttracking AT xuelingjixuan channelexchangingforrgbttracking |