Cargando…

Deep Attention Models for Human Tracking Using RGBD

Visual tracking performance has long been limited by the lack of better appearance models. These models fail either where they tend to change rapidly, like in motion-based tracking, or where accurate information of the object may not be available, like in color camouflage (where background and foreg...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rasoulidanesh, Maryamsadat, Yadav, Srishti, Herath, Sachini, Vaghei, Yasaman, Payandeh, Shahram
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412970/ https://www.ncbi.nlm.nih.gov/pubmed/30781737 http://dx.doi.org/10.3390/s19040750

_version_	1783402729814622208
author	Rasoulidanesh, Maryamsadat Yadav, Srishti Herath, Sachini Vaghei, Yasaman Payandeh, Shahram
author_facet	Rasoulidanesh, Maryamsadat Yadav, Srishti Herath, Sachini Vaghei, Yasaman Payandeh, Shahram
author_sort	Rasoulidanesh, Maryamsadat
collection	PubMed
description	Visual tracking performance has long been limited by the lack of better appearance models. These models fail either where they tend to change rapidly, like in motion-based tracking, or where accurate information of the object may not be available, like in color camouflage (where background and foreground colors are similar). This paper proposes a robust, adaptive appearance model which works accurately in situations of color camouflage, even in the presence of complex natural objects. The proposed model includes depth as an additional feature in a hierarchical modular neural framework for online object tracking. The model adapts to the confusing appearance by identifying the stable property of depth between the target and the surrounding object(s). The depth complements the existing RGB features in scenarios when RGB features fail to adapt, hence becoming unstable over a long duration of time. The parameters of the model are learned efficiently in the Deep network, which consists of three modules: (1) The spatial attention layer, which discards the majority of the background by selecting a region containing the object of interest; (2) the appearance attention layer, which extracts appearance and spatial information about the tracked object; and (3) the state estimation layer, which enables the framework to predict future object appearance and location. Three different models were trained and tested to analyze the effect of depth along with RGB information. Also, a model is proposed to utilize only depth as a standalone input for tracking purposes. The proposed models were also evaluated in real-time using KinectV2 and showed very promising results. The results of our proposed network structures and their comparison with the state-of-the-art RGB tracking model demonstrate that adding depth significantly improves the accuracy of tracking in a more challenging environment (i.e., cluttered and camouflaged environments). Furthermore, the results of depth-based models showed that depth data can provide enough information for accurate tracking, even without RGB information.
format	Online Article Text
id	pubmed-6412970
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-64129702019-04-03 Deep Attention Models for Human Tracking Using RGBD Rasoulidanesh, Maryamsadat Yadav, Srishti Herath, Sachini Vaghei, Yasaman Payandeh, Shahram Sensors (Basel) Article Visual tracking performance has long been limited by the lack of better appearance models. These models fail either where they tend to change rapidly, like in motion-based tracking, or where accurate information of the object may not be available, like in color camouflage (where background and foreground colors are similar). This paper proposes a robust, adaptive appearance model which works accurately in situations of color camouflage, even in the presence of complex natural objects. The proposed model includes depth as an additional feature in a hierarchical modular neural framework for online object tracking. The model adapts to the confusing appearance by identifying the stable property of depth between the target and the surrounding object(s). The depth complements the existing RGB features in scenarios when RGB features fail to adapt, hence becoming unstable over a long duration of time. The parameters of the model are learned efficiently in the Deep network, which consists of three modules: (1) The spatial attention layer, which discards the majority of the background by selecting a region containing the object of interest; (2) the appearance attention layer, which extracts appearance and spatial information about the tracked object; and (3) the state estimation layer, which enables the framework to predict future object appearance and location. Three different models were trained and tested to analyze the effect of depth along with RGB information. Also, a model is proposed to utilize only depth as a standalone input for tracking purposes. The proposed models were also evaluated in real-time using KinectV2 and showed very promising results. The results of our proposed network structures and their comparison with the state-of-the-art RGB tracking model demonstrate that adding depth significantly improves the accuracy of tracking in a more challenging environment (i.e., cluttered and camouflaged environments). Furthermore, the results of depth-based models showed that depth data can provide enough information for accurate tracking, even without RGB information. MDPI 2019-02-13 /pmc/articles/PMC6412970/ /pubmed/30781737 http://dx.doi.org/10.3390/s19040750 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Rasoulidanesh, Maryamsadat Yadav, Srishti Herath, Sachini Vaghei, Yasaman Payandeh, Shahram Deep Attention Models for Human Tracking Using RGBD
title	Deep Attention Models for Human Tracking Using RGBD
title_full	Deep Attention Models for Human Tracking Using RGBD
title_fullStr	Deep Attention Models for Human Tracking Using RGBD
title_full_unstemmed	Deep Attention Models for Human Tracking Using RGBD
title_short	Deep Attention Models for Human Tracking Using RGBD
title_sort	deep attention models for human tracking using rgbd
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412970/ https://www.ncbi.nlm.nih.gov/pubmed/30781737 http://dx.doi.org/10.3390/s19040750
work_keys_str_mv	AT rasoulidaneshmaryamsadat deepattentionmodelsforhumantrackingusingrgbd AT yadavsrishti deepattentionmodelsforhumantrackingusingrgbd AT herathsachini deepattentionmodelsforhumantrackingusingrgbd AT vagheiyasaman deepattentionmodelsforhumantrackingusingrgbd AT payandehshahram deepattentionmodelsforhumantrackingusingrgbd

Deep Attention Models for Human Tracking Using RGBD

Ejemplares similares