Cargando…

Benchmarking Deep Trackers on Aerial Videos

In recent years, deep learning-based visual object trackers have achieved state-of-the-art performance on several visual object tracking benchmarks. However, most tracking benchmarks are focused on ground level videos, whereas aerial tracking presents a new set of challenges. In this paper, we compa...

Descripción completa

Detalles Bibliográficos
Autores principales: Taufique, Abu Md Niamul, Minnehan, Breton, Savakis, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7014490/
https://www.ncbi.nlm.nih.gov/pubmed/31963879
http://dx.doi.org/10.3390/s20020547
_version_ 1783496642635235328
author Taufique, Abu Md Niamul
Minnehan, Breton
Savakis, Andreas
author_facet Taufique, Abu Md Niamul
Minnehan, Breton
Savakis, Andreas
author_sort Taufique, Abu Md Niamul
collection PubMed
description In recent years, deep learning-based visual object trackers have achieved state-of-the-art performance on several visual object tracking benchmarks. However, most tracking benchmarks are focused on ground level videos, whereas aerial tracking presents a new set of challenges. In this paper, we compare ten trackers based on deep learning techniques on four aerial datasets. We choose top performing trackers utilizing different approaches, specifically tracking by detection, discriminative correlation filters, Siamese networks and reinforcement learning. In our experiments, we use a subset of OTB2015 dataset with aerial style videos; the UAV123 dataset without synthetic sequences; the UAV20L dataset, which contains 20 long sequences; and DTB70 dataset as our benchmark datasets. We compare the advantages and disadvantages of different trackers in different tracking situations encountered in aerial data. Our findings indicate that the trackers perform significantly worse in aerial datasets compared to standard ground level videos. We attribute this effect to smaller target size, camera motion, significant camera rotation with respect to the target, out of view movement, and clutter in the form of occlusions or similar looking distractors near tracked object.
format Online
Article
Text
id pubmed-7014490
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-70144902020-03-09 Benchmarking Deep Trackers on Aerial Videos Taufique, Abu Md Niamul Minnehan, Breton Savakis, Andreas Sensors (Basel) Review In recent years, deep learning-based visual object trackers have achieved state-of-the-art performance on several visual object tracking benchmarks. However, most tracking benchmarks are focused on ground level videos, whereas aerial tracking presents a new set of challenges. In this paper, we compare ten trackers based on deep learning techniques on four aerial datasets. We choose top performing trackers utilizing different approaches, specifically tracking by detection, discriminative correlation filters, Siamese networks and reinforcement learning. In our experiments, we use a subset of OTB2015 dataset with aerial style videos; the UAV123 dataset without synthetic sequences; the UAV20L dataset, which contains 20 long sequences; and DTB70 dataset as our benchmark datasets. We compare the advantages and disadvantages of different trackers in different tracking situations encountered in aerial data. Our findings indicate that the trackers perform significantly worse in aerial datasets compared to standard ground level videos. We attribute this effect to smaller target size, camera motion, significant camera rotation with respect to the target, out of view movement, and clutter in the form of occlusions or similar looking distractors near tracked object. MDPI 2020-01-19 /pmc/articles/PMC7014490/ /pubmed/31963879 http://dx.doi.org/10.3390/s20020547 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Taufique, Abu Md Niamul
Minnehan, Breton
Savakis, Andreas
Benchmarking Deep Trackers on Aerial Videos
title Benchmarking Deep Trackers on Aerial Videos
title_full Benchmarking Deep Trackers on Aerial Videos
title_fullStr Benchmarking Deep Trackers on Aerial Videos
title_full_unstemmed Benchmarking Deep Trackers on Aerial Videos
title_short Benchmarking Deep Trackers on Aerial Videos
title_sort benchmarking deep trackers on aerial videos
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7014490/
https://www.ncbi.nlm.nih.gov/pubmed/31963879
http://dx.doi.org/10.3390/s20020547
work_keys_str_mv AT taufiqueabumdniamul benchmarkingdeeptrackersonaerialvideos
AT minnehanbreton benchmarkingdeeptrackersonaerialvideos
AT savakisandreas benchmarkingdeeptrackersonaerialvideos