Cargando…

Multi-Scale Feature Interactive Fusion Network for RGBT Tracking

The fusion tracking of RGB and thermal infrared image (RGBT) is paid wide attention to due to their complementary advantages. Currently, most algorithms obtain modality weights through attention mechanisms to integrate multi-modalities information. They do not fully exploit the multi-scale informati...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Xianbing, Xiong, Xingzhong, Meng, Fanqin, Chen, Zhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10098685/
https://www.ncbi.nlm.nih.gov/pubmed/37050470
http://dx.doi.org/10.3390/s23073410
_version_ 1785024869477384192
author Xiao, Xianbing
Xiong, Xingzhong
Meng, Fanqin
Chen, Zhen
author_facet Xiao, Xianbing
Xiong, Xingzhong
Meng, Fanqin
Chen, Zhen
author_sort Xiao, Xianbing
collection PubMed
description The fusion tracking of RGB and thermal infrared image (RGBT) is paid wide attention to due to their complementary advantages. Currently, most algorithms obtain modality weights through attention mechanisms to integrate multi-modalities information. They do not fully exploit the multi-scale information and ignore the rich contextual information among features, which limits the tracking performance to some extent. To solve this problem, this work proposes a new multi-scale feature interactive fusion network (MSIFNet) for RGBT tracking. Specifically, we use different convolution branches for multi-scale feature extraction and aggregate them through the feature selection module adaptively. At the same time, a Transformer interactive fusion module is proposed to build long-distance dependencies and enhance semantic representation further. Finally, a global feature fusion module is designed to adjust the global information adaptively. Numerous experiments on publicly available GTOT, RGBT234, and LasHeR datasets show that our algorithm outperforms the current mainstream tracking algorithms.
format Online
Article
Text
id pubmed-10098685
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100986852023-04-14 Multi-Scale Feature Interactive Fusion Network for RGBT Tracking Xiao, Xianbing Xiong, Xingzhong Meng, Fanqin Chen, Zhen Sensors (Basel) Article The fusion tracking of RGB and thermal infrared image (RGBT) is paid wide attention to due to their complementary advantages. Currently, most algorithms obtain modality weights through attention mechanisms to integrate multi-modalities information. They do not fully exploit the multi-scale information and ignore the rich contextual information among features, which limits the tracking performance to some extent. To solve this problem, this work proposes a new multi-scale feature interactive fusion network (MSIFNet) for RGBT tracking. Specifically, we use different convolution branches for multi-scale feature extraction and aggregate them through the feature selection module adaptively. At the same time, a Transformer interactive fusion module is proposed to build long-distance dependencies and enhance semantic representation further. Finally, a global feature fusion module is designed to adjust the global information adaptively. Numerous experiments on publicly available GTOT, RGBT234, and LasHeR datasets show that our algorithm outperforms the current mainstream tracking algorithms. MDPI 2023-03-24 /pmc/articles/PMC10098685/ /pubmed/37050470 http://dx.doi.org/10.3390/s23073410 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xiao, Xianbing
Xiong, Xingzhong
Meng, Fanqin
Chen, Zhen
Multi-Scale Feature Interactive Fusion Network for RGBT Tracking
title Multi-Scale Feature Interactive Fusion Network for RGBT Tracking
title_full Multi-Scale Feature Interactive Fusion Network for RGBT Tracking
title_fullStr Multi-Scale Feature Interactive Fusion Network for RGBT Tracking
title_full_unstemmed Multi-Scale Feature Interactive Fusion Network for RGBT Tracking
title_short Multi-Scale Feature Interactive Fusion Network for RGBT Tracking
title_sort multi-scale feature interactive fusion network for rgbt tracking
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10098685/
https://www.ncbi.nlm.nih.gov/pubmed/37050470
http://dx.doi.org/10.3390/s23073410
work_keys_str_mv AT xiaoxianbing multiscalefeatureinteractivefusionnetworkforrgbttracking
AT xiongxingzhong multiscalefeatureinteractivefusionnetworkforrgbttracking
AT mengfanqin multiscalefeatureinteractivefusionnetworkforrgbttracking
AT chenzhen multiscalefeatureinteractivefusionnetworkforrgbttracking