Cargando…
A Visual Tracker Offering More Solutions
Most trackers focus solely on robustness and accuracy. Visual tracking, however, is a long-term problem with a high time limitation. A tracker that is robust, accurate, with long-term sustainability and real-time processing, is of high research value and practical significance. In this paper, we com...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7570860/ https://www.ncbi.nlm.nih.gov/pubmed/32961752 http://dx.doi.org/10.3390/s20185374 |
_version_ | 1783597044560035840 |
---|---|
author | Zhao, Long Ishag Mahmoud, Mubarak Adam Ren, Honge Zhu, Meng |
author_facet | Zhao, Long Ishag Mahmoud, Mubarak Adam Ren, Honge Zhu, Meng |
author_sort | Zhao, Long |
collection | PubMed |
description | Most trackers focus solely on robustness and accuracy. Visual tracking, however, is a long-term problem with a high time limitation. A tracker that is robust, accurate, with long-term sustainability and real-time processing, is of high research value and practical significance. In this paper, we comprehensively consider these requirements in order to propose a new, state-of-the-art tracker with an excellent performance. EfficientNet-B0 is adopted for the first time via neural architecture search technology as the backbone network for the tracking task. This improves the network feature extraction ability and significantly reduces the number of parameters required for the tracker backbone network. In addition, maximal Distance Intersection-over-Union is set as the target estimation method, enhancing network stability and increasing the offline training convergence rate. Channel and spatial dual attention mechanisms are employed in the target classification module to improve the discrimination of the trackers. Furthermore, the conjugate gradient optimization strategy increases the speed of the online learning target classification module. A two-stage search method combined with a screening module is proposed to enable the tracker to cope with sudden target movement and reappearance following a brief disappearance. Our proposed method has an obvious speed advantage compared with pure global searching and achieves an optimal performance on OTB2015, VOT2016, VOT2018-LT, UAV-123 and LaSOT while running at over 50 FPS. |
format | Online Article Text |
id | pubmed-7570860 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75708602020-10-28 A Visual Tracker Offering More Solutions Zhao, Long Ishag Mahmoud, Mubarak Adam Ren, Honge Zhu, Meng Sensors (Basel) Article Most trackers focus solely on robustness and accuracy. Visual tracking, however, is a long-term problem with a high time limitation. A tracker that is robust, accurate, with long-term sustainability and real-time processing, is of high research value and practical significance. In this paper, we comprehensively consider these requirements in order to propose a new, state-of-the-art tracker with an excellent performance. EfficientNet-B0 is adopted for the first time via neural architecture search technology as the backbone network for the tracking task. This improves the network feature extraction ability and significantly reduces the number of parameters required for the tracker backbone network. In addition, maximal Distance Intersection-over-Union is set as the target estimation method, enhancing network stability and increasing the offline training convergence rate. Channel and spatial dual attention mechanisms are employed in the target classification module to improve the discrimination of the trackers. Furthermore, the conjugate gradient optimization strategy increases the speed of the online learning target classification module. A two-stage search method combined with a screening module is proposed to enable the tracker to cope with sudden target movement and reappearance following a brief disappearance. Our proposed method has an obvious speed advantage compared with pure global searching and achieves an optimal performance on OTB2015, VOT2016, VOT2018-LT, UAV-123 and LaSOT while running at over 50 FPS. MDPI 2020-09-19 /pmc/articles/PMC7570860/ /pubmed/32961752 http://dx.doi.org/10.3390/s20185374 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhao, Long Ishag Mahmoud, Mubarak Adam Ren, Honge Zhu, Meng A Visual Tracker Offering More Solutions |
title | A Visual Tracker Offering More Solutions |
title_full | A Visual Tracker Offering More Solutions |
title_fullStr | A Visual Tracker Offering More Solutions |
title_full_unstemmed | A Visual Tracker Offering More Solutions |
title_short | A Visual Tracker Offering More Solutions |
title_sort | visual tracker offering more solutions |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7570860/ https://www.ncbi.nlm.nih.gov/pubmed/32961752 http://dx.doi.org/10.3390/s20185374 |
work_keys_str_mv | AT zhaolong avisualtrackerofferingmoresolutions AT ishagmahmoudmubarakadam avisualtrackerofferingmoresolutions AT renhonge avisualtrackerofferingmoresolutions AT zhumeng avisualtrackerofferingmoresolutions AT zhaolong visualtrackerofferingmoresolutions AT ishagmahmoudmubarakadam visualtrackerofferingmoresolutions AT renhonge visualtrackerofferingmoresolutions AT zhumeng visualtrackerofferingmoresolutions |