Cargando…

Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking

We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate...

Descripción completa

Detalles Bibliográficos
Autores principales: Fiaz, Mustansar, Mahmood, Arif, Jung, Soon Ki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7412361/
https://www.ncbi.nlm.nih.gov/pubmed/32698339
http://dx.doi.org/10.3390/s20144021
_version_ 1783568589928792064
author Fiaz, Mustansar
Mahmood, Arif
Jung, Soon Ki
author_facet Fiaz, Mustansar
Mahmood, Arif
Jung, Soon Ki
author_sort Fiaz, Mustansar
collection PubMed
description We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate its effectiveness for visual object tracking. The proposed soft mask is used to give more importance to the target regions as compared to the other regions to enable effective target feature representation and to increase discriminative power. The low-level feature fusion improves the tracker robustness against distractors. The channel attention is used to identify more discriminative channels for better target representation. The spatial attention complements the soft mask based approach to better localize the target objects in challenging tracking scenarios. We evaluated our proposed approach over five publicly available benchmark datasets and performed extensive comparisons with 39 state-of-the-art tracking algorithms. The proposed tracker demonstrates excellent performance compared to the existing state-of-the-art trackers.
format Online
Article
Text
id pubmed-7412361
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-74123612020-08-26 Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking Fiaz, Mustansar Mahmood, Arif Jung, Soon Ki Sensors (Basel) Article We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate its effectiveness for visual object tracking. The proposed soft mask is used to give more importance to the target regions as compared to the other regions to enable effective target feature representation and to increase discriminative power. The low-level feature fusion improves the tracker robustness against distractors. The channel attention is used to identify more discriminative channels for better target representation. The spatial attention complements the soft mask based approach to better localize the target objects in challenging tracking scenarios. We evaluated our proposed approach over five publicly available benchmark datasets and performed extensive comparisons with 39 state-of-the-art tracking algorithms. The proposed tracker demonstrates excellent performance compared to the existing state-of-the-art trackers. MDPI 2020-07-20 /pmc/articles/PMC7412361/ /pubmed/32698339 http://dx.doi.org/10.3390/s20144021 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Fiaz, Mustansar
Mahmood, Arif
Jung, Soon Ki
Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title_full Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title_fullStr Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title_full_unstemmed Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title_short Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
title_sort learning soft mask based feature fusion with channel and spatial attention for robust visual object tracking
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7412361/
https://www.ncbi.nlm.nih.gov/pubmed/32698339
http://dx.doi.org/10.3390/s20144021
work_keys_str_mv AT fiazmustansar learningsoftmaskbasedfeaturefusionwithchannelandspatialattentionforrobustvisualobjecttracking
AT mahmoodarif learningsoftmaskbasedfeaturefusionwithchannelandspatialattentionforrobustvisualobjecttracking
AT jungsoonki learningsoftmaskbasedfeaturefusionwithchannelandspatialattentionforrobustvisualobjecttracking