Cargando…

Visual Object Tracking in First Person Vision

The understanding of human-object interactions is fundamental in First Person Vision (FPV). Visual tracking algorithms which follow the objects manipulated by the camera wearer can provide useful information to effectively model such interactions. In the last years, the computer vision community has...

Descripción completa

Detalles Bibliográficos
Autores principales: Dunnhofer, Matteo, Furnari, Antonino, Farinella, Giovanni Maria, Micheloni, Christian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9816211/
https://www.ncbi.nlm.nih.gov/pubmed/36624862
http://dx.doi.org/10.1007/s11263-022-01694-6
_version_ 1784864481802715136
author Dunnhofer, Matteo
Furnari, Antonino
Farinella, Giovanni Maria
Micheloni, Christian
author_facet Dunnhofer, Matteo
Furnari, Antonino
Farinella, Giovanni Maria
Micheloni, Christian
author_sort Dunnhofer, Matteo
collection PubMed
description The understanding of human-object interactions is fundamental in First Person Vision (FPV). Visual tracking algorithms which follow the objects manipulated by the camera wearer can provide useful information to effectively model such interactions. In the last years, the computer vision community has significantly improved the performance of tracking algorithms for a large variety of target objects and scenarios. Despite a few previous attempts to exploit trackers in the FPV domain, a methodical analysis of the performance of state-of-the-art trackers is still missing. This research gap raises the question of whether current solutions can be used “off-the-shelf” or more domain-specific investigations should be carried out. This paper aims to provide answers to such questions. We present the first systematic investigation of single object tracking in FPV. Our study extensively analyses the performance of 42 algorithms including generic object trackers and baseline FPV-specific trackers. The analysis is carried out by focusing on different aspects of the FPV setting, introducing new performance measures, and in relation to FPV-specific tasks. The study is made possible through the introduction of TREK-150, a novel benchmark dataset composed of 150 densely annotated video sequences. Our results show that object tracking in FPV poses new challenges to current visual trackers. We highlight the factors causing such behavior and point out possible research directions. Despite their difficulties, we prove that trackers bring benefits to FPV downstream tasks requiring short-term object tracking. We expect that generic object tracking will gain popularity in FPV as new and FPV-specific methodologies are investigated. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11263-022-01694-6.
format Online
Article
Text
id pubmed-9816211
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-98162112023-01-07 Visual Object Tracking in First Person Vision Dunnhofer, Matteo Furnari, Antonino Farinella, Giovanni Maria Micheloni, Christian Int J Comput Vis Article The understanding of human-object interactions is fundamental in First Person Vision (FPV). Visual tracking algorithms which follow the objects manipulated by the camera wearer can provide useful information to effectively model such interactions. In the last years, the computer vision community has significantly improved the performance of tracking algorithms for a large variety of target objects and scenarios. Despite a few previous attempts to exploit trackers in the FPV domain, a methodical analysis of the performance of state-of-the-art trackers is still missing. This research gap raises the question of whether current solutions can be used “off-the-shelf” or more domain-specific investigations should be carried out. This paper aims to provide answers to such questions. We present the first systematic investigation of single object tracking in FPV. Our study extensively analyses the performance of 42 algorithms including generic object trackers and baseline FPV-specific trackers. The analysis is carried out by focusing on different aspects of the FPV setting, introducing new performance measures, and in relation to FPV-specific tasks. The study is made possible through the introduction of TREK-150, a novel benchmark dataset composed of 150 densely annotated video sequences. Our results show that object tracking in FPV poses new challenges to current visual trackers. We highlight the factors causing such behavior and point out possible research directions. Despite their difficulties, we prove that trackers bring benefits to FPV downstream tasks requiring short-term object tracking. We expect that generic object tracking will gain popularity in FPV as new and FPV-specific methodologies are investigated. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11263-022-01694-6. Springer US 2022-10-18 2023 /pmc/articles/PMC9816211/ /pubmed/36624862 http://dx.doi.org/10.1007/s11263-022-01694-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Dunnhofer, Matteo
Furnari, Antonino
Farinella, Giovanni Maria
Micheloni, Christian
Visual Object Tracking in First Person Vision
title Visual Object Tracking in First Person Vision
title_full Visual Object Tracking in First Person Vision
title_fullStr Visual Object Tracking in First Person Vision
title_full_unstemmed Visual Object Tracking in First Person Vision
title_short Visual Object Tracking in First Person Vision
title_sort visual object tracking in first person vision
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9816211/
https://www.ncbi.nlm.nih.gov/pubmed/36624862
http://dx.doi.org/10.1007/s11263-022-01694-6
work_keys_str_mv AT dunnhofermatteo visualobjecttrackinginfirstpersonvision
AT furnariantonino visualobjecttrackinginfirstpersonvision
AT farinellagiovannimaria visualobjecttrackinginfirstpersonvision
AT michelonichristian visualobjecttrackinginfirstpersonvision