Cargando…

Hollywood 3D: What are the Best 3D Features for Action Recognition?

Action recognition “in the wild” is extremely challenging, particularly when complex 3D actions are projected down to the image plane, losing a great deal of information. The recent growth of 3D data in broadcast content and commercial depth sensors, makes it possible to overcome this. However, ther...

Descripción completa

Detalles Bibliográficos
Autores principales: Hadfield, Simon, Lebeda, Karel, Bowden, Richard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7175691/
https://www.ncbi.nlm.nih.gov/pubmed/32355409
http://dx.doi.org/10.1007/s11263-016-0917-2
_version_ 1783524882250727424
author Hadfield, Simon
Lebeda, Karel
Bowden, Richard
author_facet Hadfield, Simon
Lebeda, Karel
Bowden, Richard
author_sort Hadfield, Simon
collection PubMed
description Action recognition “in the wild” is extremely challenging, particularly when complex 3D actions are projected down to the image plane, losing a great deal of information. The recent growth of 3D data in broadcast content and commercial depth sensors, makes it possible to overcome this. However, there is little work examining the best way to exploit this new modality. In this paper we introduce the Hollywood 3D benchmark, which is the first dataset containing “in the wild” action footage including 3D data. This dataset consists of 650 stereo video clips across 14 action classes, taken from Hollywood movies. We provide stereo calibrations and depth reconstructions for each clip. We also provide an action recognition pipeline, and propose a number of specialised depth-aware techniques including five interest point detectors and three feature descriptors. Extensive tests allow evaluation of different appearance and depth encoding schemes. Our novel techniques exploiting this depth allow us to reach performance levels more than triple those of the best baseline algorithm using only appearance information. The benchmark data, code and calibrations are all made available to the community.
format Online
Article
Text
id pubmed-7175691
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-71756912020-04-28 Hollywood 3D: What are the Best 3D Features for Action Recognition? Hadfield, Simon Lebeda, Karel Bowden, Richard Int J Comput Vis Article Action recognition “in the wild” is extremely challenging, particularly when complex 3D actions are projected down to the image plane, losing a great deal of information. The recent growth of 3D data in broadcast content and commercial depth sensors, makes it possible to overcome this. However, there is little work examining the best way to exploit this new modality. In this paper we introduce the Hollywood 3D benchmark, which is the first dataset containing “in the wild” action footage including 3D data. This dataset consists of 650 stereo video clips across 14 action classes, taken from Hollywood movies. We provide stereo calibrations and depth reconstructions for each clip. We also provide an action recognition pipeline, and propose a number of specialised depth-aware techniques including five interest point detectors and three feature descriptors. Extensive tests allow evaluation of different appearance and depth encoding schemes. Our novel techniques exploiting this depth allow us to reach performance levels more than triple those of the best baseline algorithm using only appearance information. The benchmark data, code and calibrations are all made available to the community. Springer US 2016-06-21 2017 /pmc/articles/PMC7175691/ /pubmed/32355409 http://dx.doi.org/10.1007/s11263-016-0917-2 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Hadfield, Simon
Lebeda, Karel
Bowden, Richard
Hollywood 3D: What are the Best 3D Features for Action Recognition?
title Hollywood 3D: What are the Best 3D Features for Action Recognition?
title_full Hollywood 3D: What are the Best 3D Features for Action Recognition?
title_fullStr Hollywood 3D: What are the Best 3D Features for Action Recognition?
title_full_unstemmed Hollywood 3D: What are the Best 3D Features for Action Recognition?
title_short Hollywood 3D: What are the Best 3D Features for Action Recognition?
title_sort hollywood 3d: what are the best 3d features for action recognition?
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7175691/
https://www.ncbi.nlm.nih.gov/pubmed/32355409
http://dx.doi.org/10.1007/s11263-016-0917-2
work_keys_str_mv AT hadfieldsimon hollywood3dwhatarethebest3dfeaturesforactionrecognition
AT lebedakarel hollywood3dwhatarethebest3dfeaturesforactionrecognition
AT bowdenrichard hollywood3dwhatarethebest3dfeaturesforactionrecognition