Cargando…
Siamese hierarchical feature fusion transformer for efficient tracking
Object tracking is a fundamental task in computer vision. Recent years, most of the tracking algorithms are based on deep networks. Trackers with deeper backbones are computationally expensive and can hardly meet the real-time requirements on edge platforms. Lightweight networks are widely used to t...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9752039/ https://www.ncbi.nlm.nih.gov/pubmed/36531916 http://dx.doi.org/10.3389/fnbot.2022.1082346 |
_version_ | 1784850621141090304 |
---|---|
author | Dai, Jiahai Fu, Yunhao Wang, Songxin Chang, Yuchun |
author_facet | Dai, Jiahai Fu, Yunhao Wang, Songxin Chang, Yuchun |
author_sort | Dai, Jiahai |
collection | PubMed |
description | Object tracking is a fundamental task in computer vision. Recent years, most of the tracking algorithms are based on deep networks. Trackers with deeper backbones are computationally expensive and can hardly meet the real-time requirements on edge platforms. Lightweight networks are widely used to tackle this issue, but the features extracted by a lightweight backbone are inadequate for discriminating the object from the background in complex scenarios, especially for small objects tracking task. In this paper, we adopted a lightweight backbone and extracted features from multiple levels. A hierarchical feature fusion transformer (HFFT) was designed to mine the interdependencies of multi-level features in a novel model—SiamHFFT. Therefore, our tracker can exploit comprehensive feature representations in an end-to-end manner, and the proposed model is capable of handling small target tracking in complex scenarios on a CPU at a rate of 29 FPS. Comprehensive experimental results on UAV123, UAV123@10fps, LaSOT, VOT2020, and GOT-10k benchmarks with multiple trackers demonstrate the effectiveness and efficiency of SiamHFFT. In particular, our SiamHFFT achieves good performance both in accuracy and speed, which has practical implications in terms of improving small object tracking performance in the real world. |
format | Online Article Text |
id | pubmed-9752039 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97520392022-12-16 Siamese hierarchical feature fusion transformer for efficient tracking Dai, Jiahai Fu, Yunhao Wang, Songxin Chang, Yuchun Front Neurorobot Neuroscience Object tracking is a fundamental task in computer vision. Recent years, most of the tracking algorithms are based on deep networks. Trackers with deeper backbones are computationally expensive and can hardly meet the real-time requirements on edge platforms. Lightweight networks are widely used to tackle this issue, but the features extracted by a lightweight backbone are inadequate for discriminating the object from the background in complex scenarios, especially for small objects tracking task. In this paper, we adopted a lightweight backbone and extracted features from multiple levels. A hierarchical feature fusion transformer (HFFT) was designed to mine the interdependencies of multi-level features in a novel model—SiamHFFT. Therefore, our tracker can exploit comprehensive feature representations in an end-to-end manner, and the proposed model is capable of handling small target tracking in complex scenarios on a CPU at a rate of 29 FPS. Comprehensive experimental results on UAV123, UAV123@10fps, LaSOT, VOT2020, and GOT-10k benchmarks with multiple trackers demonstrate the effectiveness and efficiency of SiamHFFT. In particular, our SiamHFFT achieves good performance both in accuracy and speed, which has practical implications in terms of improving small object tracking performance in the real world. Frontiers Media S.A. 2022-12-01 /pmc/articles/PMC9752039/ /pubmed/36531916 http://dx.doi.org/10.3389/fnbot.2022.1082346 Text en Copyright © 2022 Dai, Fu, Wang and Chang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Dai, Jiahai Fu, Yunhao Wang, Songxin Chang, Yuchun Siamese hierarchical feature fusion transformer for efficient tracking |
title | Siamese hierarchical feature fusion transformer for efficient tracking |
title_full | Siamese hierarchical feature fusion transformer for efficient tracking |
title_fullStr | Siamese hierarchical feature fusion transformer for efficient tracking |
title_full_unstemmed | Siamese hierarchical feature fusion transformer for efficient tracking |
title_short | Siamese hierarchical feature fusion transformer for efficient tracking |
title_sort | siamese hierarchical feature fusion transformer for efficient tracking |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9752039/ https://www.ncbi.nlm.nih.gov/pubmed/36531916 http://dx.doi.org/10.3389/fnbot.2022.1082346 |
work_keys_str_mv | AT daijiahai siamesehierarchicalfeaturefusiontransformerforefficienttracking AT fuyunhao siamesehierarchicalfeaturefusiontransformerforefficienttracking AT wangsongxin siamesehierarchicalfeaturefusiontransformerforefficienttracking AT changyuchun siamesehierarchicalfeaturefusiontransformerforefficienttracking |