Cargando…

Unsupervised Action Proposals Using Support Vector Classifiers for Online Video Processing

In this work, we introduce an intelligent video sensor for the problem of Action Proposals (AP). AP consists of localizing temporal segments in untrimmed videos that are likely to contain actions. Solving this problem can accelerate several video action understanding tasks, such as detection, retrie...

Descripción completa

Detalles Bibliográficos
Autores principales:	Baptista Ríos, Marcos, López-Sastre, Roberto Javier, Acevedo-Rodríguez, Francisco Javier, Martín-Martín, Pilar, Maldonado-Bascón, Saturnino
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7285365/ https://www.ncbi.nlm.nih.gov/pubmed/32456050 http://dx.doi.org/10.3390/s20102953

_version_	1783544683354390528
author	Baptista Ríos, Marcos López-Sastre, Roberto Javier Acevedo-Rodríguez, Francisco Javier Martín-Martín, Pilar Maldonado-Bascón, Saturnino
author_facet	Baptista Ríos, Marcos López-Sastre, Roberto Javier Acevedo-Rodríguez, Francisco Javier Martín-Martín, Pilar Maldonado-Bascón, Saturnino
author_sort	Baptista Ríos, Marcos
collection	PubMed
description	In this work, we introduce an intelligent video sensor for the problem of Action Proposals (AP). AP consists of localizing temporal segments in untrimmed videos that are likely to contain actions. Solving this problem can accelerate several video action understanding tasks, such as detection, retrieval, or indexing. All previous AP approaches are supervised and offline, i.e., they need both the temporal annotations of the datasets during training and access to the whole video to effectively cast the proposals. We propose here a new approach which, unlike the rest of the state-of-the-art models, is unsupervised. This implies that we do not allow it to see any labeled data during learning nor to work with any pre-trained feature on the used dataset. Moreover, our approach also operates in an online manner, which can be beneficial for many real-world applications where the video has to be processed as soon as it arrives at the sensor, e.g., robotics or video monitoring. The core of our method is based on a Support Vector Classifier (SVC) module which produces candidate segments for AP by distinguishing between sets of contiguous video frames. We further propose a mechanism to refine and filter those candidate segments. This filter optimizes a learning-to-rank formulation over the dynamics of the segments. An extensive experimental evaluation is conducted on Thumos’14 and ActivityNet datasets, and, to the best of our knowledge, this work supposes the first unsupervised approach on these main AP benchmarks. Finally, we also provide a thorough comparison to the current state-of-the-art supervised AP approaches. We achieve 41% and 59% of the performance of the best-supervised model on ActivityNet and Thumos’14, respectively, confirming our unsupervised solution as a correct option to tackle the AP problem. The code to reproduce all our results will be publicly released upon acceptance of the paper.
format	Online Article Text
id	pubmed-7285365
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-72853652020-06-15 Unsupervised Action Proposals Using Support Vector Classifiers for Online Video Processing Baptista Ríos, Marcos López-Sastre, Roberto Javier Acevedo-Rodríguez, Francisco Javier Martín-Martín, Pilar Maldonado-Bascón, Saturnino Sensors (Basel) Article In this work, we introduce an intelligent video sensor for the problem of Action Proposals (AP). AP consists of localizing temporal segments in untrimmed videos that are likely to contain actions. Solving this problem can accelerate several video action understanding tasks, such as detection, retrieval, or indexing. All previous AP approaches are supervised and offline, i.e., they need both the temporal annotations of the datasets during training and access to the whole video to effectively cast the proposals. We propose here a new approach which, unlike the rest of the state-of-the-art models, is unsupervised. This implies that we do not allow it to see any labeled data during learning nor to work with any pre-trained feature on the used dataset. Moreover, our approach also operates in an online manner, which can be beneficial for many real-world applications where the video has to be processed as soon as it arrives at the sensor, e.g., robotics or video monitoring. The core of our method is based on a Support Vector Classifier (SVC) module which produces candidate segments for AP by distinguishing between sets of contiguous video frames. We further propose a mechanism to refine and filter those candidate segments. This filter optimizes a learning-to-rank formulation over the dynamics of the segments. An extensive experimental evaluation is conducted on Thumos’14 and ActivityNet datasets, and, to the best of our knowledge, this work supposes the first unsupervised approach on these main AP benchmarks. Finally, we also provide a thorough comparison to the current state-of-the-art supervised AP approaches. We achieve 41% and 59% of the performance of the best-supervised model on ActivityNet and Thumos’14, respectively, confirming our unsupervised solution as a correct option to tackle the AP problem. The code to reproduce all our results will be publicly released upon acceptance of the paper. MDPI 2020-05-22 /pmc/articles/PMC7285365/ /pubmed/32456050 http://dx.doi.org/10.3390/s20102953 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Baptista Ríos, Marcos López-Sastre, Roberto Javier Acevedo-Rodríguez, Francisco Javier Martín-Martín, Pilar Maldonado-Bascón, Saturnino Unsupervised Action Proposals Using Support Vector Classifiers for Online Video Processing
title	Unsupervised Action Proposals Using Support Vector Classifiers for Online Video Processing
title_full	Unsupervised Action Proposals Using Support Vector Classifiers for Online Video Processing
title_fullStr	Unsupervised Action Proposals Using Support Vector Classifiers for Online Video Processing
title_full_unstemmed	Unsupervised Action Proposals Using Support Vector Classifiers for Online Video Processing
title_short	Unsupervised Action Proposals Using Support Vector Classifiers for Online Video Processing
title_sort	unsupervised action proposals using support vector classifiers for online video processing
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7285365/ https://www.ncbi.nlm.nih.gov/pubmed/32456050 http://dx.doi.org/10.3390/s20102953
work_keys_str_mv	AT baptistariosmarcos unsupervisedactionproposalsusingsupportvectorclassifiersforonlinevideoprocessing AT lopezsastrerobertojavier unsupervisedactionproposalsusingsupportvectorclassifiersforonlinevideoprocessing AT acevedorodriguezfranciscojavier unsupervisedactionproposalsusingsupportvectorclassifiersforonlinevideoprocessing AT martinmartinpilar unsupervisedactionproposalsusingsupportvectorclassifiersforonlinevideoprocessing AT maldonadobasconsaturnino unsupervisedactionproposalsusingsupportvectorclassifiersforonlinevideoprocessing

Unsupervised Action Proposals Using Support Vector Classifiers for Online Video Processing

Ejemplares similares