Cargando…
Multi-Scale Feature Interactive Fusion Network for RGBT Tracking
The fusion tracking of RGB and thermal infrared image (RGBT) is paid wide attention to due to their complementary advantages. Currently, most algorithms obtain modality weights through attention mechanisms to integrate multi-modalities information. They do not fully exploit the multi-scale informati...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10098685/ https://www.ncbi.nlm.nih.gov/pubmed/37050470 http://dx.doi.org/10.3390/s23073410 |
_version_ | 1785024869477384192 |
---|---|
author | Xiao, Xianbing Xiong, Xingzhong Meng, Fanqin Chen, Zhen |
author_facet | Xiao, Xianbing Xiong, Xingzhong Meng, Fanqin Chen, Zhen |
author_sort | Xiao, Xianbing |
collection | PubMed |
description | The fusion tracking of RGB and thermal infrared image (RGBT) is paid wide attention to due to their complementary advantages. Currently, most algorithms obtain modality weights through attention mechanisms to integrate multi-modalities information. They do not fully exploit the multi-scale information and ignore the rich contextual information among features, which limits the tracking performance to some extent. To solve this problem, this work proposes a new multi-scale feature interactive fusion network (MSIFNet) for RGBT tracking. Specifically, we use different convolution branches for multi-scale feature extraction and aggregate them through the feature selection module adaptively. At the same time, a Transformer interactive fusion module is proposed to build long-distance dependencies and enhance semantic representation further. Finally, a global feature fusion module is designed to adjust the global information adaptively. Numerous experiments on publicly available GTOT, RGBT234, and LasHeR datasets show that our algorithm outperforms the current mainstream tracking algorithms. |
format | Online Article Text |
id | pubmed-10098685 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100986852023-04-14 Multi-Scale Feature Interactive Fusion Network for RGBT Tracking Xiao, Xianbing Xiong, Xingzhong Meng, Fanqin Chen, Zhen Sensors (Basel) Article The fusion tracking of RGB and thermal infrared image (RGBT) is paid wide attention to due to their complementary advantages. Currently, most algorithms obtain modality weights through attention mechanisms to integrate multi-modalities information. They do not fully exploit the multi-scale information and ignore the rich contextual information among features, which limits the tracking performance to some extent. To solve this problem, this work proposes a new multi-scale feature interactive fusion network (MSIFNet) for RGBT tracking. Specifically, we use different convolution branches for multi-scale feature extraction and aggregate them through the feature selection module adaptively. At the same time, a Transformer interactive fusion module is proposed to build long-distance dependencies and enhance semantic representation further. Finally, a global feature fusion module is designed to adjust the global information adaptively. Numerous experiments on publicly available GTOT, RGBT234, and LasHeR datasets show that our algorithm outperforms the current mainstream tracking algorithms. MDPI 2023-03-24 /pmc/articles/PMC10098685/ /pubmed/37050470 http://dx.doi.org/10.3390/s23073410 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Xiao, Xianbing Xiong, Xingzhong Meng, Fanqin Chen, Zhen Multi-Scale Feature Interactive Fusion Network for RGBT Tracking |
title | Multi-Scale Feature Interactive Fusion Network for RGBT Tracking |
title_full | Multi-Scale Feature Interactive Fusion Network for RGBT Tracking |
title_fullStr | Multi-Scale Feature Interactive Fusion Network for RGBT Tracking |
title_full_unstemmed | Multi-Scale Feature Interactive Fusion Network for RGBT Tracking |
title_short | Multi-Scale Feature Interactive Fusion Network for RGBT Tracking |
title_sort | multi-scale feature interactive fusion network for rgbt tracking |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10098685/ https://www.ncbi.nlm.nih.gov/pubmed/37050470 http://dx.doi.org/10.3390/s23073410 |
work_keys_str_mv | AT xiaoxianbing multiscalefeatureinteractivefusionnetworkforrgbttracking AT xiongxingzhong multiscalefeatureinteractivefusionnetworkforrgbttracking AT mengfanqin multiscalefeatureinteractivefusionnetworkforrgbttracking AT chenzhen multiscalefeatureinteractivefusionnetworkforrgbttracking |