Cargando…

Siamese hierarchical feature fusion transformer for efficient tracking

Object tracking is a fundamental task in computer vision. Recent years, most of the tracking algorithms are based on deep networks. Trackers with deeper backbones are computationally expensive and can hardly meet the real-time requirements on edge platforms. Lightweight networks are widely used to t...

Descripción completa

Detalles Bibliográficos
Autores principales: Dai, Jiahai, Fu, Yunhao, Wang, Songxin, Chang, Yuchun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9752039/
https://www.ncbi.nlm.nih.gov/pubmed/36531916
http://dx.doi.org/10.3389/fnbot.2022.1082346
_version_ 1784850621141090304
author Dai, Jiahai
Fu, Yunhao
Wang, Songxin
Chang, Yuchun
author_facet Dai, Jiahai
Fu, Yunhao
Wang, Songxin
Chang, Yuchun
author_sort Dai, Jiahai
collection PubMed
description Object tracking is a fundamental task in computer vision. Recent years, most of the tracking algorithms are based on deep networks. Trackers with deeper backbones are computationally expensive and can hardly meet the real-time requirements on edge platforms. Lightweight networks are widely used to tackle this issue, but the features extracted by a lightweight backbone are inadequate for discriminating the object from the background in complex scenarios, especially for small objects tracking task. In this paper, we adopted a lightweight backbone and extracted features from multiple levels. A hierarchical feature fusion transformer (HFFT) was designed to mine the interdependencies of multi-level features in a novel model—SiamHFFT. Therefore, our tracker can exploit comprehensive feature representations in an end-to-end manner, and the proposed model is capable of handling small target tracking in complex scenarios on a CPU at a rate of 29 FPS. Comprehensive experimental results on UAV123, UAV123@10fps, LaSOT, VOT2020, and GOT-10k benchmarks with multiple trackers demonstrate the effectiveness and efficiency of SiamHFFT. In particular, our SiamHFFT achieves good performance both in accuracy and speed, which has practical implications in terms of improving small object tracking performance in the real world.
format Online
Article
Text
id pubmed-9752039
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97520392022-12-16 Siamese hierarchical feature fusion transformer for efficient tracking Dai, Jiahai Fu, Yunhao Wang, Songxin Chang, Yuchun Front Neurorobot Neuroscience Object tracking is a fundamental task in computer vision. Recent years, most of the tracking algorithms are based on deep networks. Trackers with deeper backbones are computationally expensive and can hardly meet the real-time requirements on edge platforms. Lightweight networks are widely used to tackle this issue, but the features extracted by a lightweight backbone are inadequate for discriminating the object from the background in complex scenarios, especially for small objects tracking task. In this paper, we adopted a lightweight backbone and extracted features from multiple levels. A hierarchical feature fusion transformer (HFFT) was designed to mine the interdependencies of multi-level features in a novel model—SiamHFFT. Therefore, our tracker can exploit comprehensive feature representations in an end-to-end manner, and the proposed model is capable of handling small target tracking in complex scenarios on a CPU at a rate of 29 FPS. Comprehensive experimental results on UAV123, UAV123@10fps, LaSOT, VOT2020, and GOT-10k benchmarks with multiple trackers demonstrate the effectiveness and efficiency of SiamHFFT. In particular, our SiamHFFT achieves good performance both in accuracy and speed, which has practical implications in terms of improving small object tracking performance in the real world. Frontiers Media S.A. 2022-12-01 /pmc/articles/PMC9752039/ /pubmed/36531916 http://dx.doi.org/10.3389/fnbot.2022.1082346 Text en Copyright © 2022 Dai, Fu, Wang and Chang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Dai, Jiahai
Fu, Yunhao
Wang, Songxin
Chang, Yuchun
Siamese hierarchical feature fusion transformer for efficient tracking
title Siamese hierarchical feature fusion transformer for efficient tracking
title_full Siamese hierarchical feature fusion transformer for efficient tracking
title_fullStr Siamese hierarchical feature fusion transformer for efficient tracking
title_full_unstemmed Siamese hierarchical feature fusion transformer for efficient tracking
title_short Siamese hierarchical feature fusion transformer for efficient tracking
title_sort siamese hierarchical feature fusion transformer for efficient tracking
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9752039/
https://www.ncbi.nlm.nih.gov/pubmed/36531916
http://dx.doi.org/10.3389/fnbot.2022.1082346
work_keys_str_mv AT daijiahai siamesehierarchicalfeaturefusiontransformerforefficienttracking
AT fuyunhao siamesehierarchicalfeaturefusiontransformerforefficienttracking
AT wangsongxin siamesehierarchicalfeaturefusiontransformerforefficienttracking
AT changyuchun siamesehierarchicalfeaturefusiontransformerforefficienttracking