Cargando…

Learning Enhanced Feature Responses for Visual Object Tracking

Visual object tracking is an important topic in computer vision, which has successfully utilized pretrained convolutional neural networks, such as VGG and ResNet. However, the features extracted by these pretrained models are high dimensional, and the redundant feature channels reduce target localiz...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Runqing, Fan, Chunxiao, Ming, Yue
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8847016/ https://www.ncbi.nlm.nih.gov/pubmed/35178074 http://dx.doi.org/10.1155/2022/1241687

_version_	1784651960336515072
author	Zhang, Runqing Fan, Chunxiao Ming, Yue
author_facet	Zhang, Runqing Fan, Chunxiao Ming, Yue
author_sort	Zhang, Runqing
collection	PubMed
description	Visual object tracking is an important topic in computer vision, which has successfully utilized pretrained convolutional neural networks, such as VGG and ResNet. However, the features extracted by these pretrained models are high dimensional, and the redundant feature channels reduce target localization and scale estimation precision, leading to tracking drifting. In this paper, a novel visual object tracking method, called learning enhanced feature responses tracking (LEFRT), is proposed, which adopts the target-specific features to enhance target localization and scale estimation responses. First, a channel attention module, called target-specific network (TSNet), is presented to reduce the redundant feature channels. Secondly, the scale estimation network (SCENet) is introduced to extract spatial structural features to generate a more precise response for the scale estimation. Extensive experiments on six tracking benchmarks, including LaSOT, GOT-10k, TrackingNet, OTB-2013, OTB-2015, and TC-128, demonstrate that the proposed algorithm can effectively improve the precision and speed of visual object tracking. LEFRT achieves 90.4% precision and a 71.2% success rate on the OTB-2015 dataset, improving the tracking methods based on the pretrained features.
format	Online Article Text
id	pubmed-8847016
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-88470162022-02-16 Learning Enhanced Feature Responses for Visual Object Tracking Zhang, Runqing Fan, Chunxiao Ming, Yue Comput Intell Neurosci Research Article Visual object tracking is an important topic in computer vision, which has successfully utilized pretrained convolutional neural networks, such as VGG and ResNet. However, the features extracted by these pretrained models are high dimensional, and the redundant feature channels reduce target localization and scale estimation precision, leading to tracking drifting. In this paper, a novel visual object tracking method, called learning enhanced feature responses tracking (LEFRT), is proposed, which adopts the target-specific features to enhance target localization and scale estimation responses. First, a channel attention module, called target-specific network (TSNet), is presented to reduce the redundant feature channels. Secondly, the scale estimation network (SCENet) is introduced to extract spatial structural features to generate a more precise response for the scale estimation. Extensive experiments on six tracking benchmarks, including LaSOT, GOT-10k, TrackingNet, OTB-2013, OTB-2015, and TC-128, demonstrate that the proposed algorithm can effectively improve the precision and speed of visual object tracking. LEFRT achieves 90.4% precision and a 71.2% success rate on the OTB-2015 dataset, improving the tracking methods based on the pretrained features. Hindawi 2022-02-08 /pmc/articles/PMC8847016/ /pubmed/35178074 http://dx.doi.org/10.1155/2022/1241687 Text en Copyright © 2022 Runqing Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Zhang, Runqing Fan, Chunxiao Ming, Yue Learning Enhanced Feature Responses for Visual Object Tracking
title	Learning Enhanced Feature Responses for Visual Object Tracking
title_full	Learning Enhanced Feature Responses for Visual Object Tracking
title_fullStr	Learning Enhanced Feature Responses for Visual Object Tracking
title_full_unstemmed	Learning Enhanced Feature Responses for Visual Object Tracking
title_short	Learning Enhanced Feature Responses for Visual Object Tracking
title_sort	learning enhanced feature responses for visual object tracking
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8847016/ https://www.ncbi.nlm.nih.gov/pubmed/35178074 http://dx.doi.org/10.1155/2022/1241687
work_keys_str_mv	AT zhangrunqing learningenhancedfeatureresponsesforvisualobjecttracking AT fanchunxiao learningenhancedfeatureresponsesforvisualobjecttracking AT mingyue learningenhancedfeatureresponsesforvisualobjecttracking

Learning Enhanced Feature Responses for Visual Object Tracking

Ejemplares similares