Cargando…

Multiple Objects Fusion Tracker Using a Matching Network for Adaptively Represented Instance Pairs

Multiple-object tracking is affected by various sources of distortion, such as occlusion, illumination variations and motion changes. Overcoming these distortions by tracking on RGB frames, such as shifting, has limitations because of material distortions caused by RGB frames. To overcome these dist...

Descripción completa

Detalles Bibliográficos
Autores principales: Oh, Sang-Il, Kang, Hang-Bong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5424760/
https://www.ncbi.nlm.nih.gov/pubmed/28420194
http://dx.doi.org/10.3390/s17040883
_version_ 1783235187241385984
author Oh, Sang-Il
Kang, Hang-Bong
author_facet Oh, Sang-Il
Kang, Hang-Bong
author_sort Oh, Sang-Il
collection PubMed
description Multiple-object tracking is affected by various sources of distortion, such as occlusion, illumination variations and motion changes. Overcoming these distortions by tracking on RGB frames, such as shifting, has limitations because of material distortions caused by RGB frames. To overcome these distortions, we propose a multiple-object fusion tracker (MOFT), which uses a combination of 3D point clouds and corresponding RGB frames. The MOFT uses a matching function initialized on large-scale external sequences to determine which candidates in the current frame match with the target object in the previous frame. After conducting tracking on a few frames, the initialized matching function is fine-tuned according to the appearance models of target objects. The fine-tuning process of the matching function is constructed as a structured form with diverse matching function branches. In general multiple object tracking situations, scale variations for a scene occur depending on the distance between the target objects and the sensors. If the target objects in various scales are equally represented with the same strategy, information losses will occur for any representation of the target objects. In this paper, the output map of the convolutional layer obtained from a pre-trained convolutional neural network is used to adaptively represent instances without information loss. In addition, MOFT fuses the tracking results obtained from each modality at the decision level to compensate the tracking failures of each modality using basic belief assignment, rather than fusing modalities by selectively using the features of each modality. Experimental results indicate that the proposed tracker provides state-of-the-art performance considering multiple objects tracking (MOT) and KITTIbenchmarks.
format Online
Article
Text
id pubmed-5424760
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-54247602017-05-12 Multiple Objects Fusion Tracker Using a Matching Network for Adaptively Represented Instance Pairs Oh, Sang-Il Kang, Hang-Bong Sensors (Basel) Article Multiple-object tracking is affected by various sources of distortion, such as occlusion, illumination variations and motion changes. Overcoming these distortions by tracking on RGB frames, such as shifting, has limitations because of material distortions caused by RGB frames. To overcome these distortions, we propose a multiple-object fusion tracker (MOFT), which uses a combination of 3D point clouds and corresponding RGB frames. The MOFT uses a matching function initialized on large-scale external sequences to determine which candidates in the current frame match with the target object in the previous frame. After conducting tracking on a few frames, the initialized matching function is fine-tuned according to the appearance models of target objects. The fine-tuning process of the matching function is constructed as a structured form with diverse matching function branches. In general multiple object tracking situations, scale variations for a scene occur depending on the distance between the target objects and the sensors. If the target objects in various scales are equally represented with the same strategy, information losses will occur for any representation of the target objects. In this paper, the output map of the convolutional layer obtained from a pre-trained convolutional neural network is used to adaptively represent instances without information loss. In addition, MOFT fuses the tracking results obtained from each modality at the decision level to compensate the tracking failures of each modality using basic belief assignment, rather than fusing modalities by selectively using the features of each modality. Experimental results indicate that the proposed tracker provides state-of-the-art performance considering multiple objects tracking (MOT) and KITTIbenchmarks. MDPI 2017-04-18 /pmc/articles/PMC5424760/ /pubmed/28420194 http://dx.doi.org/10.3390/s17040883 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Oh, Sang-Il
Kang, Hang-Bong
Multiple Objects Fusion Tracker Using a Matching Network for Adaptively Represented Instance Pairs
title Multiple Objects Fusion Tracker Using a Matching Network for Adaptively Represented Instance Pairs
title_full Multiple Objects Fusion Tracker Using a Matching Network for Adaptively Represented Instance Pairs
title_fullStr Multiple Objects Fusion Tracker Using a Matching Network for Adaptively Represented Instance Pairs
title_full_unstemmed Multiple Objects Fusion Tracker Using a Matching Network for Adaptively Represented Instance Pairs
title_short Multiple Objects Fusion Tracker Using a Matching Network for Adaptively Represented Instance Pairs
title_sort multiple objects fusion tracker using a matching network for adaptively represented instance pairs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5424760/
https://www.ncbi.nlm.nih.gov/pubmed/28420194
http://dx.doi.org/10.3390/s17040883
work_keys_str_mv AT ohsangil multipleobjectsfusiontrackerusingamatchingnetworkforadaptivelyrepresentedinstancepairs
AT kanghangbong multipleobjectsfusiontrackerusingamatchingnetworkforadaptivelyrepresentedinstancepairs