Cargando…

Learning Enhanced Feature Responses for Visual Object Tracking

Visual object tracking is an important topic in computer vision, which has successfully utilized pretrained convolutional neural networks, such as VGG and ResNet. However, the features extracted by these pretrained models are high dimensional, and the redundant feature channels reduce target localiz...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Runqing, Fan, Chunxiao, Ming, Yue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8847016/
https://www.ncbi.nlm.nih.gov/pubmed/35178074
http://dx.doi.org/10.1155/2022/1241687
_version_ 1784651960336515072
author Zhang, Runqing
Fan, Chunxiao
Ming, Yue
author_facet Zhang, Runqing
Fan, Chunxiao
Ming, Yue
author_sort Zhang, Runqing
collection PubMed
description Visual object tracking is an important topic in computer vision, which has successfully utilized pretrained convolutional neural networks, such as VGG and ResNet. However, the features extracted by these pretrained models are high dimensional, and the redundant feature channels reduce target localization and scale estimation precision, leading to tracking drifting. In this paper, a novel visual object tracking method, called learning enhanced feature responses tracking (LEFRT), is proposed, which adopts the target-specific features to enhance target localization and scale estimation responses. First, a channel attention module, called target-specific network (TSNet), is presented to reduce the redundant feature channels. Secondly, the scale estimation network (SCENet) is introduced to extract spatial structural features to generate a more precise response for the scale estimation. Extensive experiments on six tracking benchmarks, including LaSOT, GOT-10k, TrackingNet, OTB-2013, OTB-2015, and TC-128, demonstrate that the proposed algorithm can effectively improve the precision and speed of visual object tracking. LEFRT achieves 90.4% precision and a 71.2% success rate on the OTB-2015 dataset, improving the tracking methods based on the pretrained features.
format Online
Article
Text
id pubmed-8847016
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-88470162022-02-16 Learning Enhanced Feature Responses for Visual Object Tracking Zhang, Runqing Fan, Chunxiao Ming, Yue Comput Intell Neurosci Research Article Visual object tracking is an important topic in computer vision, which has successfully utilized pretrained convolutional neural networks, such as VGG and ResNet. However, the features extracted by these pretrained models are high dimensional, and the redundant feature channels reduce target localization and scale estimation precision, leading to tracking drifting. In this paper, a novel visual object tracking method, called learning enhanced feature responses tracking (LEFRT), is proposed, which adopts the target-specific features to enhance target localization and scale estimation responses. First, a channel attention module, called target-specific network (TSNet), is presented to reduce the redundant feature channels. Secondly, the scale estimation network (SCENet) is introduced to extract spatial structural features to generate a more precise response for the scale estimation. Extensive experiments on six tracking benchmarks, including LaSOT, GOT-10k, TrackingNet, OTB-2013, OTB-2015, and TC-128, demonstrate that the proposed algorithm can effectively improve the precision and speed of visual object tracking. LEFRT achieves 90.4% precision and a 71.2% success rate on the OTB-2015 dataset, improving the tracking methods based on the pretrained features. Hindawi 2022-02-08 /pmc/articles/PMC8847016/ /pubmed/35178074 http://dx.doi.org/10.1155/2022/1241687 Text en Copyright © 2022 Runqing Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Runqing
Fan, Chunxiao
Ming, Yue
Learning Enhanced Feature Responses for Visual Object Tracking
title Learning Enhanced Feature Responses for Visual Object Tracking
title_full Learning Enhanced Feature Responses for Visual Object Tracking
title_fullStr Learning Enhanced Feature Responses for Visual Object Tracking
title_full_unstemmed Learning Enhanced Feature Responses for Visual Object Tracking
title_short Learning Enhanced Feature Responses for Visual Object Tracking
title_sort learning enhanced feature responses for visual object tracking
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8847016/
https://www.ncbi.nlm.nih.gov/pubmed/35178074
http://dx.doi.org/10.1155/2022/1241687
work_keys_str_mv AT zhangrunqing learningenhancedfeatureresponsesforvisualobjecttracking
AT fanchunxiao learningenhancedfeatureresponsesforvisualobjecttracking
AT mingyue learningenhancedfeatureresponsesforvisualobjecttracking