Cargando…

Visual Tracking Using Sparse Coding and Earth Mover's Distance

An efficient iterative Earth Mover's Distance (iEMD) algorithm for visual tracking is proposed in this paper. The Earth Mover's Distance (EMD) is used as the similarity measure to search for the optimal template candidates in feature-spatial space in a video sequence. The local sparse repr...

Descripción completa

Detalles Bibliográficos
Autores principales: Yao, Gang, Dani, Ashwin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805675/
https://www.ncbi.nlm.nih.gov/pubmed/33500974
http://dx.doi.org/10.3389/frobt.2018.00095
_version_ 1783636354414936064
author Yao, Gang
Dani, Ashwin
author_facet Yao, Gang
Dani, Ashwin
author_sort Yao, Gang
collection PubMed
description An efficient iterative Earth Mover's Distance (iEMD) algorithm for visual tracking is proposed in this paper. The Earth Mover's Distance (EMD) is used as the similarity measure to search for the optimal template candidates in feature-spatial space in a video sequence. The local sparse representation is used as the appearance model for the iEMD tracker. The maximum-alignment-pooling method is used for constructing a sparse coding histogram which reduces the computational complexity of the EMD optimization. The template update algorithm based on the EMD is also presented. When the camera is mounted on a moving robot, e.g., a flying quadcopter, the camera could experience a sudden and rapid motion leading to large inter-frame movements. To ensure that the tracking algorithm converges, a gyro-aided extension of the iEMD tracker is presented, where synchronized gyroscope information is utilized to compensate for the rotation of the camera. The iEMD algorithm's performance is evaluated using eight publicly available videos from the CVPR 2013 dataset. The performance of the iEMD algorithm is compared with eight state-of-the-art tracking algorithms based on relative percentage overlap. Experimental results show that the iEMD algorithm performs robustly in the presence of illumination variation and deformation. The robustness of this algorithm for large inter-frame displacements is also illustrated.
format Online
Article
Text
id pubmed-7805675
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-78056752021-01-25 Visual Tracking Using Sparse Coding and Earth Mover's Distance Yao, Gang Dani, Ashwin Front Robot AI Robotics and AI An efficient iterative Earth Mover's Distance (iEMD) algorithm for visual tracking is proposed in this paper. The Earth Mover's Distance (EMD) is used as the similarity measure to search for the optimal template candidates in feature-spatial space in a video sequence. The local sparse representation is used as the appearance model for the iEMD tracker. The maximum-alignment-pooling method is used for constructing a sparse coding histogram which reduces the computational complexity of the EMD optimization. The template update algorithm based on the EMD is also presented. When the camera is mounted on a moving robot, e.g., a flying quadcopter, the camera could experience a sudden and rapid motion leading to large inter-frame movements. To ensure that the tracking algorithm converges, a gyro-aided extension of the iEMD tracker is presented, where synchronized gyroscope information is utilized to compensate for the rotation of the camera. The iEMD algorithm's performance is evaluated using eight publicly available videos from the CVPR 2013 dataset. The performance of the iEMD algorithm is compared with eight state-of-the-art tracking algorithms based on relative percentage overlap. Experimental results show that the iEMD algorithm performs robustly in the presence of illumination variation and deformation. The robustness of this algorithm for large inter-frame displacements is also illustrated. Frontiers Media S.A. 2018-08-22 /pmc/articles/PMC7805675/ /pubmed/33500974 http://dx.doi.org/10.3389/frobt.2018.00095 Text en Copyright © 2018 Yao and Dani. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Robotics and AI
Yao, Gang
Dani, Ashwin
Visual Tracking Using Sparse Coding and Earth Mover's Distance
title Visual Tracking Using Sparse Coding and Earth Mover's Distance
title_full Visual Tracking Using Sparse Coding and Earth Mover's Distance
title_fullStr Visual Tracking Using Sparse Coding and Earth Mover's Distance
title_full_unstemmed Visual Tracking Using Sparse Coding and Earth Mover's Distance
title_short Visual Tracking Using Sparse Coding and Earth Mover's Distance
title_sort visual tracking using sparse coding and earth mover's distance
topic Robotics and AI
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805675/
https://www.ncbi.nlm.nih.gov/pubmed/33500974
http://dx.doi.org/10.3389/frobt.2018.00095
work_keys_str_mv AT yaogang visualtrackingusingsparsecodingandearthmoversdistance
AT daniashwin visualtrackingusingsparsecodingandearthmoversdistance