Cargando…

Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking

We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fiaz, Mustansar, Mahmood, Arif, Jung, Soon Ki
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7412361/ https://www.ncbi.nlm.nih.gov/pubmed/32698339 http://dx.doi.org/10.3390/s20144021

_version_	1783568589928792064
author	Fiaz, Mustansar Mahmood, Arif Jung, Soon Ki
author_facet	Fiaz, Mustansar Mahmood, Arif Jung, Soon Ki
author_sort	Fiaz, Mustansar
collection	PubMed
description	We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate its effectiveness for visual object tracking. The proposed soft mask is used to give more importance to the target regions as compared to the other regions to enable effective target feature representation and to increase discriminative power. The low-level feature fusion improves the tracker robustness against distractors. The channel attention is used to identify more discriminative channels for better target representation. The spatial attention complements the soft mask based approach to better localize the target objects in challenging tracking scenarios. We evaluated our proposed approach over five publicly available benchmark datasets and performed extensive comparisons with 39 state-of-the-art tracking algorithms. The proposed tracker demonstrates excellent performance compared to the existing state-of-the-art trackers.
format	Online Article Text
id	pubmed-7412361
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-74123612020-08-26 Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking Fiaz, Mustansar Mahmood, Arif Jung, Soon Ki Sensors (Basel) Article We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate its effectiveness for visual object tracking. The proposed soft mask is used to give more importance to the target regions as compared to the other regions to enable effective target feature representation and to increase discriminative power. The low-level feature fusion improves the tracker robustness against distractors. The channel attention is used to identify more discriminative channels for better target representation. The spatial attention complements the soft mask based approach to better localize the target objects in challenging tracking scenarios. We evaluated our proposed approach over five publicly available benchmark datasets and performed extensive comparisons with 39 state-of-the-art tracking algorithms. The proposed tracker demonstrates excellent performance compared to the existing state-of-the-art trackers. MDPI 2020-07-20 /pmc/articles/PMC7412361/ /pubmed/32698339 http://dx.doi.org/10.3390/s20144021 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Fiaz, Mustansar Mahmood, Arif Jung, Soon Ki Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title	Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title_full	Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title_fullStr	Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title_full_unstemmed	Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title_short	Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title_sort	learning soft mask based feature fusion with channel and spatial attention for robust visual object tracking
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7412361/ https://www.ncbi.nlm.nih.gov/pubmed/32698339 http://dx.doi.org/10.3390/s20144021
work_keys_str_mv	AT fiazmustansar learningsoftmaskbasedfeaturefusionwithchannelandspatialattentionforrobustvisualobjecttracking AT mahmoodarif learningsoftmaskbasedfeaturefusionwithchannelandspatialattentionforrobustvisualobjecttracking AT jungsoonki learningsoftmaskbasedfeaturefusionwithchannelandspatialattentionforrobustvisualobjecttracking

Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking

Ejemplares similares