Cargando…
Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking
We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7412361/ https://www.ncbi.nlm.nih.gov/pubmed/32698339 http://dx.doi.org/10.3390/s20144021 |
_version_ | 1783568589928792064 |
---|---|
author | Fiaz, Mustansar Mahmood, Arif Jung, Soon Ki |
author_facet | Fiaz, Mustansar Mahmood, Arif Jung, Soon Ki |
author_sort | Fiaz, Mustansar |
collection | PubMed |
description | We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate its effectiveness for visual object tracking. The proposed soft mask is used to give more importance to the target regions as compared to the other regions to enable effective target feature representation and to increase discriminative power. The low-level feature fusion improves the tracker robustness against distractors. The channel attention is used to identify more discriminative channels for better target representation. The spatial attention complements the soft mask based approach to better localize the target objects in challenging tracking scenarios. We evaluated our proposed approach over five publicly available benchmark datasets and performed extensive comparisons with 39 state-of-the-art tracking algorithms. The proposed tracker demonstrates excellent performance compared to the existing state-of-the-art trackers. |
format | Online Article Text |
id | pubmed-7412361 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-74123612020-08-26 Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking Fiaz, Mustansar Mahmood, Arif Jung, Soon Ki Sensors (Basel) Article We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate its effectiveness for visual object tracking. The proposed soft mask is used to give more importance to the target regions as compared to the other regions to enable effective target feature representation and to increase discriminative power. The low-level feature fusion improves the tracker robustness against distractors. The channel attention is used to identify more discriminative channels for better target representation. The spatial attention complements the soft mask based approach to better localize the target objects in challenging tracking scenarios. We evaluated our proposed approach over five publicly available benchmark datasets and performed extensive comparisons with 39 state-of-the-art tracking algorithms. The proposed tracker demonstrates excellent performance compared to the existing state-of-the-art trackers. MDPI 2020-07-20 /pmc/articles/PMC7412361/ /pubmed/32698339 http://dx.doi.org/10.3390/s20144021 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Fiaz, Mustansar Mahmood, Arif Jung, Soon Ki Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking |
title | Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking |
title_full | Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking |
title_fullStr | Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking |
title_full_unstemmed | Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking |
title_short | Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking |
title_sort | learning soft mask based feature fusion with channel and spatial attention for robust visual object tracking |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7412361/ https://www.ncbi.nlm.nih.gov/pubmed/32698339 http://dx.doi.org/10.3390/s20144021 |
work_keys_str_mv | AT fiazmustansar learningsoftmaskbasedfeaturefusionwithchannelandspatialattentionforrobustvisualobjecttracking AT mahmoodarif learningsoftmaskbasedfeaturefusionwithchannelandspatialattentionforrobustvisualobjecttracking AT jungsoonki learningsoftmaskbasedfeaturefusionwithchannelandspatialattentionforrobustvisualobjecttracking |