Cargando…

Iterative Multiple Bounding-Box Refinements for Visual Tracking

Single-object visual tracking aims at locating a target in each video frame by predicting the bounding box of the object. Recent approaches have adopted iterative procedures to gradually refine the bounding box and locate the target in the image. In such approaches, the deep model takes as input the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cruciata, Giorgio, Lo Presti, Liliana, La Cascia, Marco
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8955588/ https://www.ncbi.nlm.nih.gov/pubmed/35324616 http://dx.doi.org/10.3390/jimaging8030061

_version_	1784676372558381056
author	Cruciata, Giorgio Lo Presti, Liliana La Cascia, Marco
author_facet	Cruciata, Giorgio Lo Presti, Liliana La Cascia, Marco
author_sort	Cruciata, Giorgio
collection	PubMed
description	Single-object visual tracking aims at locating a target in each video frame by predicting the bounding box of the object. Recent approaches have adopted iterative procedures to gradually refine the bounding box and locate the target in the image. In such approaches, the deep model takes as input the image patch corresponding to the currently estimated target bounding box, and provides as output the probability associated with each of the possible bounding box refinements, generally defined as a discrete set of linear transformations of the bounding box center and size. At each iteration, only one transformation is applied, and supervised training of the model may introduce an inherent ambiguity by giving importance priority to some transformations over the others. This paper proposes a novel formulation of the problem of selecting the bounding box refinement. It introduces the concept of non-conflicting transformations and allows applying multiple refinements to the target bounding box at each iteration without introducing ambiguities during learning of the model parameters. Empirical results demonstrate that the proposed approach improves the iterative single refinement in terms of accuracy and precision of the tracking results.
format	Online Article Text
id	pubmed-8955588
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-89555882022-03-26 Iterative Multiple Bounding-Box Refinements for Visual Tracking Cruciata, Giorgio Lo Presti, Liliana La Cascia, Marco J Imaging Article Single-object visual tracking aims at locating a target in each video frame by predicting the bounding box of the object. Recent approaches have adopted iterative procedures to gradually refine the bounding box and locate the target in the image. In such approaches, the deep model takes as input the image patch corresponding to the currently estimated target bounding box, and provides as output the probability associated with each of the possible bounding box refinements, generally defined as a discrete set of linear transformations of the bounding box center and size. At each iteration, only one transformation is applied, and supervised training of the model may introduce an inherent ambiguity by giving importance priority to some transformations over the others. This paper proposes a novel formulation of the problem of selecting the bounding box refinement. It introduces the concept of non-conflicting transformations and allows applying multiple refinements to the target bounding box at each iteration without introducing ambiguities during learning of the model parameters. Empirical results demonstrate that the proposed approach improves the iterative single refinement in terms of accuracy and precision of the tracking results. MDPI 2022-03-03 /pmc/articles/PMC8955588/ /pubmed/35324616 http://dx.doi.org/10.3390/jimaging8030061 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Cruciata, Giorgio Lo Presti, Liliana La Cascia, Marco Iterative Multiple Bounding-Box Refinements for Visual Tracking
title	Iterative Multiple Bounding-Box Refinements for Visual Tracking
title_full	Iterative Multiple Bounding-Box Refinements for Visual Tracking
title_fullStr	Iterative Multiple Bounding-Box Refinements for Visual Tracking
title_full_unstemmed	Iterative Multiple Bounding-Box Refinements for Visual Tracking
title_short	Iterative Multiple Bounding-Box Refinements for Visual Tracking
title_sort	iterative multiple bounding-box refinements for visual tracking
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8955588/ https://www.ncbi.nlm.nih.gov/pubmed/35324616 http://dx.doi.org/10.3390/jimaging8030061
work_keys_str_mv	AT cruciatagiorgio iterativemultipleboundingboxrefinementsforvisualtracking AT loprestililiana iterativemultipleboundingboxrefinementsforvisualtracking AT lacasciamarco iterativemultipleboundingboxrefinementsforvisualtracking

Iterative Multiple Bounding-Box Refinements for Visual Tracking

Ejemplares similares