Cargando…

Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †

Applying people detectors to unseen data is challenging since patterns distributions, such as viewpoints, motion, poses, backgrounds, occlusions and people sizes, may significantly differ from the ones of the training dataset. In this paper, we propose a coarse-to-fine framework to adapt frame by fr...

Descripción completa

Detalles Bibliográficos
Autores principales: García-Martín, Álvaro, SanMiguel, Juan C., Martínez, José M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6339205/
https://www.ncbi.nlm.nih.gov/pubmed/30577455
http://dx.doi.org/10.3390/s19010004
_version_ 1783388584954298368
author García-Martín, Álvaro
SanMiguel, Juan C.
Martínez, José M.
author_facet García-Martín, Álvaro
SanMiguel, Juan C.
Martínez, José M.
author_sort García-Martín, Álvaro
collection PubMed
description Applying people detectors to unseen data is challenging since patterns distributions, such as viewpoints, motion, poses, backgrounds, occlusions and people sizes, may significantly differ from the ones of the training dataset. In this paper, we propose a coarse-to-fine framework to adapt frame by frame people detectors during runtime classification, without requiring any additional manually labeled ground truth apart from the offline training of the detection model. Such adaptation make use of multiple detectors mutual information, i.e., similarities and dissimilarities of detectors estimated and agreed by pair-wise correlating their outputs. Globally, the proposed adaptation discriminates between relevant instants in a video sequence, i.e., identifies the representative frames for an adaptation of the system. Locally, the proposed adaptation identifies the best configuration (i.e., detection threshold) of each detector under analysis, maximizing the mutual information to obtain the detection threshold of each detector. The proposed coarse-to-fine approach does not require training the detectors for each new scenario and uses standard people detector outputs, i.e., bounding boxes. The experimental results demonstrate that the proposed approach outperforms state-of-the-art detectors whose optimal threshold configurations are previously determined and fixed from offline training data.
format Online
Article
Text
id pubmed-6339205
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-63392052019-01-23 Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information † García-Martín, Álvaro SanMiguel, Juan C. Martínez, José M. Sensors (Basel) Article Applying people detectors to unseen data is challenging since patterns distributions, such as viewpoints, motion, poses, backgrounds, occlusions and people sizes, may significantly differ from the ones of the training dataset. In this paper, we propose a coarse-to-fine framework to adapt frame by frame people detectors during runtime classification, without requiring any additional manually labeled ground truth apart from the offline training of the detection model. Such adaptation make use of multiple detectors mutual information, i.e., similarities and dissimilarities of detectors estimated and agreed by pair-wise correlating their outputs. Globally, the proposed adaptation discriminates between relevant instants in a video sequence, i.e., identifies the representative frames for an adaptation of the system. Locally, the proposed adaptation identifies the best configuration (i.e., detection threshold) of each detector under analysis, maximizing the mutual information to obtain the detection threshold of each detector. The proposed coarse-to-fine approach does not require training the detectors for each new scenario and uses standard people detector outputs, i.e., bounding boxes. The experimental results demonstrate that the proposed approach outperforms state-of-the-art detectors whose optimal threshold configurations are previously determined and fixed from offline training data. MDPI 2018-12-20 /pmc/articles/PMC6339205/ /pubmed/30577455 http://dx.doi.org/10.3390/s19010004 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
García-Martín, Álvaro
SanMiguel, Juan C.
Martínez, José M.
Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †
title Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †
title_full Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †
title_fullStr Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †
title_full_unstemmed Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †
title_short Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †
title_sort coarse-to-fine adaptive people detection for video sequences by maximizing mutual information †
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6339205/
https://www.ncbi.nlm.nih.gov/pubmed/30577455
http://dx.doi.org/10.3390/s19010004
work_keys_str_mv AT garciamartinalvaro coarsetofineadaptivepeopledetectionforvideosequencesbymaximizingmutualinformation
AT sanmigueljuanc coarsetofineadaptivepeopledetectionforvideosequencesbymaximizingmutualinformation
AT martinezjosem coarsetofineadaptivepeopledetectionforvideosequencesbymaximizingmutualinformation