Cargando…
Learning Enhanced Feature Responses for Visual Object Tracking
Visual object tracking is an important topic in computer vision, which has successfully utilized pretrained convolutional neural networks, such as VGG and ResNet. However, the features extracted by these pretrained models are high dimensional, and the redundant feature channels reduce target localiz...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8847016/ https://www.ncbi.nlm.nih.gov/pubmed/35178074 http://dx.doi.org/10.1155/2022/1241687 |
_version_ | 1784651960336515072 |
---|---|
author | Zhang, Runqing Fan, Chunxiao Ming, Yue |
author_facet | Zhang, Runqing Fan, Chunxiao Ming, Yue |
author_sort | Zhang, Runqing |
collection | PubMed |
description | Visual object tracking is an important topic in computer vision, which has successfully utilized pretrained convolutional neural networks, such as VGG and ResNet. However, the features extracted by these pretrained models are high dimensional, and the redundant feature channels reduce target localization and scale estimation precision, leading to tracking drifting. In this paper, a novel visual object tracking method, called learning enhanced feature responses tracking (LEFRT), is proposed, which adopts the target-specific features to enhance target localization and scale estimation responses. First, a channel attention module, called target-specific network (TSNet), is presented to reduce the redundant feature channels. Secondly, the scale estimation network (SCENet) is introduced to extract spatial structural features to generate a more precise response for the scale estimation. Extensive experiments on six tracking benchmarks, including LaSOT, GOT-10k, TrackingNet, OTB-2013, OTB-2015, and TC-128, demonstrate that the proposed algorithm can effectively improve the precision and speed of visual object tracking. LEFRT achieves 90.4% precision and a 71.2% success rate on the OTB-2015 dataset, improving the tracking methods based on the pretrained features. |
format | Online Article Text |
id | pubmed-8847016 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-88470162022-02-16 Learning Enhanced Feature Responses for Visual Object Tracking Zhang, Runqing Fan, Chunxiao Ming, Yue Comput Intell Neurosci Research Article Visual object tracking is an important topic in computer vision, which has successfully utilized pretrained convolutional neural networks, such as VGG and ResNet. However, the features extracted by these pretrained models are high dimensional, and the redundant feature channels reduce target localization and scale estimation precision, leading to tracking drifting. In this paper, a novel visual object tracking method, called learning enhanced feature responses tracking (LEFRT), is proposed, which adopts the target-specific features to enhance target localization and scale estimation responses. First, a channel attention module, called target-specific network (TSNet), is presented to reduce the redundant feature channels. Secondly, the scale estimation network (SCENet) is introduced to extract spatial structural features to generate a more precise response for the scale estimation. Extensive experiments on six tracking benchmarks, including LaSOT, GOT-10k, TrackingNet, OTB-2013, OTB-2015, and TC-128, demonstrate that the proposed algorithm can effectively improve the precision and speed of visual object tracking. LEFRT achieves 90.4% precision and a 71.2% success rate on the OTB-2015 dataset, improving the tracking methods based on the pretrained features. Hindawi 2022-02-08 /pmc/articles/PMC8847016/ /pubmed/35178074 http://dx.doi.org/10.1155/2022/1241687 Text en Copyright © 2022 Runqing Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhang, Runqing Fan, Chunxiao Ming, Yue Learning Enhanced Feature Responses for Visual Object Tracking |
title | Learning Enhanced Feature Responses for Visual Object Tracking |
title_full | Learning Enhanced Feature Responses for Visual Object Tracking |
title_fullStr | Learning Enhanced Feature Responses for Visual Object Tracking |
title_full_unstemmed | Learning Enhanced Feature Responses for Visual Object Tracking |
title_short | Learning Enhanced Feature Responses for Visual Object Tracking |
title_sort | learning enhanced feature responses for visual object tracking |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8847016/ https://www.ncbi.nlm.nih.gov/pubmed/35178074 http://dx.doi.org/10.1155/2022/1241687 |
work_keys_str_mv | AT zhangrunqing learningenhancedfeatureresponsesforvisualobjecttracking AT fanchunxiao learningenhancedfeatureresponsesforvisualobjecttracking AT mingyue learningenhancedfeatureresponsesforvisualobjecttracking |