Cargando…

A Visual Tracker Offering More Solutions

Most trackers focus solely on robustness and accuracy. Visual tracking, however, is a long-term problem with a high time limitation. A tracker that is robust, accurate, with long-term sustainability and real-time processing, is of high research value and practical significance. In this paper, we com...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Long, Ishag Mahmoud, Mubarak Adam, Ren, Honge, Zhu, Meng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7570860/
https://www.ncbi.nlm.nih.gov/pubmed/32961752
http://dx.doi.org/10.3390/s20185374
_version_ 1783597044560035840
author Zhao, Long
Ishag Mahmoud, Mubarak Adam
Ren, Honge
Zhu, Meng
author_facet Zhao, Long
Ishag Mahmoud, Mubarak Adam
Ren, Honge
Zhu, Meng
author_sort Zhao, Long
collection PubMed
description Most trackers focus solely on robustness and accuracy. Visual tracking, however, is a long-term problem with a high time limitation. A tracker that is robust, accurate, with long-term sustainability and real-time processing, is of high research value and practical significance. In this paper, we comprehensively consider these requirements in order to propose a new, state-of-the-art tracker with an excellent performance. EfficientNet-B0 is adopted for the first time via neural architecture search technology as the backbone network for the tracking task. This improves the network feature extraction ability and significantly reduces the number of parameters required for the tracker backbone network. In addition, maximal Distance Intersection-over-Union is set as the target estimation method, enhancing network stability and increasing the offline training convergence rate. Channel and spatial dual attention mechanisms are employed in the target classification module to improve the discrimination of the trackers. Furthermore, the conjugate gradient optimization strategy increases the speed of the online learning target classification module. A two-stage search method combined with a screening module is proposed to enable the tracker to cope with sudden target movement and reappearance following a brief disappearance. Our proposed method has an obvious speed advantage compared with pure global searching and achieves an optimal performance on OTB2015, VOT2016, VOT2018-LT, UAV-123 and LaSOT while running at over 50 FPS.
format Online
Article
Text
id pubmed-7570860
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75708602020-10-28 A Visual Tracker Offering More Solutions Zhao, Long Ishag Mahmoud, Mubarak Adam Ren, Honge Zhu, Meng Sensors (Basel) Article Most trackers focus solely on robustness and accuracy. Visual tracking, however, is a long-term problem with a high time limitation. A tracker that is robust, accurate, with long-term sustainability and real-time processing, is of high research value and practical significance. In this paper, we comprehensively consider these requirements in order to propose a new, state-of-the-art tracker with an excellent performance. EfficientNet-B0 is adopted for the first time via neural architecture search technology as the backbone network for the tracking task. This improves the network feature extraction ability and significantly reduces the number of parameters required for the tracker backbone network. In addition, maximal Distance Intersection-over-Union is set as the target estimation method, enhancing network stability and increasing the offline training convergence rate. Channel and spatial dual attention mechanisms are employed in the target classification module to improve the discrimination of the trackers. Furthermore, the conjugate gradient optimization strategy increases the speed of the online learning target classification module. A two-stage search method combined with a screening module is proposed to enable the tracker to cope with sudden target movement and reappearance following a brief disappearance. Our proposed method has an obvious speed advantage compared with pure global searching and achieves an optimal performance on OTB2015, VOT2016, VOT2018-LT, UAV-123 and LaSOT while running at over 50 FPS. MDPI 2020-09-19 /pmc/articles/PMC7570860/ /pubmed/32961752 http://dx.doi.org/10.3390/s20185374 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhao, Long
Ishag Mahmoud, Mubarak Adam
Ren, Honge
Zhu, Meng
A Visual Tracker Offering More Solutions
title A Visual Tracker Offering More Solutions
title_full A Visual Tracker Offering More Solutions
title_fullStr A Visual Tracker Offering More Solutions
title_full_unstemmed A Visual Tracker Offering More Solutions
title_short A Visual Tracker Offering More Solutions
title_sort visual tracker offering more solutions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7570860/
https://www.ncbi.nlm.nih.gov/pubmed/32961752
http://dx.doi.org/10.3390/s20185374
work_keys_str_mv AT zhaolong avisualtrackerofferingmoresolutions
AT ishagmahmoudmubarakadam avisualtrackerofferingmoresolutions
AT renhonge avisualtrackerofferingmoresolutions
AT zhumeng avisualtrackerofferingmoresolutions
AT zhaolong visualtrackerofferingmoresolutions
AT ishagmahmoudmubarakadam visualtrackerofferingmoresolutions
AT renhonge visualtrackerofferingmoresolutions
AT zhumeng visualtrackerofferingmoresolutions