Cargando…

Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks

Recently, Siamese architecture has been widely used in the field of visual tracking, and has achieved great success. Most Siamese network based trackers aggregate the target information of two branches by cross-correlation. However, since the location of the sampling points in the search feature are...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Hang, Zhang, Weiguo, Yan, Danghui
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659980/ https://www.ncbi.nlm.nih.gov/pubmed/34883790 http://dx.doi.org/10.3390/s21237790

_version_	1784613092664016896
author	Chen, Hang Zhang, Weiguo Yan, Danghui
author_facet	Chen, Hang Zhang, Weiguo Yan, Danghui
author_sort	Chen, Hang
collection	PubMed
description	Recently, Siamese architecture has been widely used in the field of visual tracking, and has achieved great success. Most Siamese network based trackers aggregate the target information of two branches by cross-correlation. However, since the location of the sampling points in the search feature area is pre-fixed in cross-correlation operation, these trackers suffer from either background noise influence or missing foreground information. Moreover, the cross-correlation between the template and the search area neglects the geometry information of the target. In this paper, we propose a Siamese deformable cross-correlation network to model the geometric structure of target and improve the performance of visual tracking. We propose to learn an offset field end-to-end in cross-correlation. With the guidance of the offset field, the sampling in the search image area can adapt to the deformation of the target, and realize the modeling of the geometric structure of the target. We further propose an online classification sub-network to model the variation of target appearance and enhance the robustness of the tracker. Extensive experiments are conducted on four challenging benchmarks, including OTB2015, VOT2018, VOT2019 and UAV123. The results demonstrate that our tracker achieves state-of-the-art performance.
format	Online Article Text
id	pubmed-8659980
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-86599802021-12-10 Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks Chen, Hang Zhang, Weiguo Yan, Danghui Sensors (Basel) Article Recently, Siamese architecture has been widely used in the field of visual tracking, and has achieved great success. Most Siamese network based trackers aggregate the target information of two branches by cross-correlation. However, since the location of the sampling points in the search feature area is pre-fixed in cross-correlation operation, these trackers suffer from either background noise influence or missing foreground information. Moreover, the cross-correlation between the template and the search area neglects the geometry information of the target. In this paper, we propose a Siamese deformable cross-correlation network to model the geometric structure of target and improve the performance of visual tracking. We propose to learn an offset field end-to-end in cross-correlation. With the guidance of the offset field, the sampling in the search image area can adapt to the deformation of the target, and realize the modeling of the geometric structure of the target. We further propose an online classification sub-network to model the variation of target appearance and enhance the robustness of the tracker. Extensive experiments are conducted on four challenging benchmarks, including OTB2015, VOT2018, VOT2019 and UAV123. The results demonstrate that our tracker achieves state-of-the-art performance. MDPI 2021-11-23 /pmc/articles/PMC8659980/ /pubmed/34883790 http://dx.doi.org/10.3390/s21237790 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Chen, Hang Zhang, Weiguo Yan, Danghui Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks
title	Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks
title_full	Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks
title_fullStr	Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks
title_full_unstemmed	Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks
title_short	Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks
title_sort	learning geometry information of target for visual object tracking with siamese networks
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659980/ https://www.ncbi.nlm.nih.gov/pubmed/34883790 http://dx.doi.org/10.3390/s21237790
work_keys_str_mv	AT chenhang learninggeometryinformationoftargetforvisualobjecttrackingwithsiamesenetworks AT zhangweiguo learninggeometryinformationoftargetforvisualobjecttrackingwithsiamesenetworks AT yandanghui learninggeometryinformationoftargetforvisualobjecttrackingwithsiamesenetworks

Learning Geometry Information of Target for Visual Object Tracking with Siamese Networks

Ejemplares similares