Cargando…
Visual Tracking Using Sparse Coding and Earth Mover's Distance
An efficient iterative Earth Mover's Distance (iEMD) algorithm for visual tracking is proposed in this paper. The Earth Mover's Distance (EMD) is used as the similarity measure to search for the optimal template candidates in feature-spatial space in a video sequence. The local sparse repr...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805675/ https://www.ncbi.nlm.nih.gov/pubmed/33500974 http://dx.doi.org/10.3389/frobt.2018.00095 |
_version_ | 1783636354414936064 |
---|---|
author | Yao, Gang Dani, Ashwin |
author_facet | Yao, Gang Dani, Ashwin |
author_sort | Yao, Gang |
collection | PubMed |
description | An efficient iterative Earth Mover's Distance (iEMD) algorithm for visual tracking is proposed in this paper. The Earth Mover's Distance (EMD) is used as the similarity measure to search for the optimal template candidates in feature-spatial space in a video sequence. The local sparse representation is used as the appearance model for the iEMD tracker. The maximum-alignment-pooling method is used for constructing a sparse coding histogram which reduces the computational complexity of the EMD optimization. The template update algorithm based on the EMD is also presented. When the camera is mounted on a moving robot, e.g., a flying quadcopter, the camera could experience a sudden and rapid motion leading to large inter-frame movements. To ensure that the tracking algorithm converges, a gyro-aided extension of the iEMD tracker is presented, where synchronized gyroscope information is utilized to compensate for the rotation of the camera. The iEMD algorithm's performance is evaluated using eight publicly available videos from the CVPR 2013 dataset. The performance of the iEMD algorithm is compared with eight state-of-the-art tracking algorithms based on relative percentage overlap. Experimental results show that the iEMD algorithm performs robustly in the presence of illumination variation and deformation. The robustness of this algorithm for large inter-frame displacements is also illustrated. |
format | Online Article Text |
id | pubmed-7805675 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-78056752021-01-25 Visual Tracking Using Sparse Coding and Earth Mover's Distance Yao, Gang Dani, Ashwin Front Robot AI Robotics and AI An efficient iterative Earth Mover's Distance (iEMD) algorithm for visual tracking is proposed in this paper. The Earth Mover's Distance (EMD) is used as the similarity measure to search for the optimal template candidates in feature-spatial space in a video sequence. The local sparse representation is used as the appearance model for the iEMD tracker. The maximum-alignment-pooling method is used for constructing a sparse coding histogram which reduces the computational complexity of the EMD optimization. The template update algorithm based on the EMD is also presented. When the camera is mounted on a moving robot, e.g., a flying quadcopter, the camera could experience a sudden and rapid motion leading to large inter-frame movements. To ensure that the tracking algorithm converges, a gyro-aided extension of the iEMD tracker is presented, where synchronized gyroscope information is utilized to compensate for the rotation of the camera. The iEMD algorithm's performance is evaluated using eight publicly available videos from the CVPR 2013 dataset. The performance of the iEMD algorithm is compared with eight state-of-the-art tracking algorithms based on relative percentage overlap. Experimental results show that the iEMD algorithm performs robustly in the presence of illumination variation and deformation. The robustness of this algorithm for large inter-frame displacements is also illustrated. Frontiers Media S.A. 2018-08-22 /pmc/articles/PMC7805675/ /pubmed/33500974 http://dx.doi.org/10.3389/frobt.2018.00095 Text en Copyright © 2018 Yao and Dani. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Yao, Gang Dani, Ashwin Visual Tracking Using Sparse Coding and Earth Mover's Distance |
title | Visual Tracking Using Sparse Coding and Earth Mover's Distance |
title_full | Visual Tracking Using Sparse Coding and Earth Mover's Distance |
title_fullStr | Visual Tracking Using Sparse Coding and Earth Mover's Distance |
title_full_unstemmed | Visual Tracking Using Sparse Coding and Earth Mover's Distance |
title_short | Visual Tracking Using Sparse Coding and Earth Mover's Distance |
title_sort | visual tracking using sparse coding and earth mover's distance |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805675/ https://www.ncbi.nlm.nih.gov/pubmed/33500974 http://dx.doi.org/10.3389/frobt.2018.00095 |
work_keys_str_mv | AT yaogang visualtrackingusingsparsecodingandearthmoversdistance AT daniashwin visualtrackingusingsparsecodingandearthmoversdistance |