Cargando…

Spatio-Temporal Scale Coded Bag-of-Words

The Bag-of-Words (BoW) framework has been widely used in action recognition tasks due to its compact and efficient feature representation. Various modifications have been made to this framework to increase its classification power. This often results in an increased complexity and reduced efficiency...

Descripción completa

Detalles Bibliográficos
Autores principales: Govender, Divina, Tapamo, Jules-Raymond
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7664889/
https://www.ncbi.nlm.nih.gov/pubmed/33182276
http://dx.doi.org/10.3390/s20216380
_version_ 1783609912771739648
author Govender, Divina
Tapamo, Jules-Raymond
author_facet Govender, Divina
Tapamo, Jules-Raymond
author_sort Govender, Divina
collection PubMed
description The Bag-of-Words (BoW) framework has been widely used in action recognition tasks due to its compact and efficient feature representation. Various modifications have been made to this framework to increase its classification power. This often results in an increased complexity and reduced efficiency. Inspired by the success of image-based scale coded BoW representations, we propose a spatio-temporal scale coded BoW (SC-BoW) for video-based recognition. This involves encoding extracted multi-scale information into BoW representations by partitioning spatio-temporal features into sub-groups based on the spatial scale from which they were extracted. We evaluate SC-BoW in two experimental setups. We first present a general pipeline to perform real-time action recognition with SC-BoW. Secondly, we apply SC-BoW onto the popular Dense Trajectory feature set. Results showed SC-BoW representations to successfully improve performance by 2–7% with low added computational cost. Notably, SC-BoW on Dense Trajectories outperformed more complex deep learning approaches. Thus, scale coding is a low-cost and low-level encoding scheme that increases classification power of the standard BoW without compromising efficiency.
format Online
Article
Text
id pubmed-7664889
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-76648892020-11-14 Spatio-Temporal Scale Coded Bag-of-Words Govender, Divina Tapamo, Jules-Raymond Sensors (Basel) Article The Bag-of-Words (BoW) framework has been widely used in action recognition tasks due to its compact and efficient feature representation. Various modifications have been made to this framework to increase its classification power. This often results in an increased complexity and reduced efficiency. Inspired by the success of image-based scale coded BoW representations, we propose a spatio-temporal scale coded BoW (SC-BoW) for video-based recognition. This involves encoding extracted multi-scale information into BoW representations by partitioning spatio-temporal features into sub-groups based on the spatial scale from which they were extracted. We evaluate SC-BoW in two experimental setups. We first present a general pipeline to perform real-time action recognition with SC-BoW. Secondly, we apply SC-BoW onto the popular Dense Trajectory feature set. Results showed SC-BoW representations to successfully improve performance by 2–7% with low added computational cost. Notably, SC-BoW on Dense Trajectories outperformed more complex deep learning approaches. Thus, scale coding is a low-cost and low-level encoding scheme that increases classification power of the standard BoW without compromising efficiency. MDPI 2020-11-09 /pmc/articles/PMC7664889/ /pubmed/33182276 http://dx.doi.org/10.3390/s20216380 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Govender, Divina
Tapamo, Jules-Raymond
Spatio-Temporal Scale Coded Bag-of-Words
title Spatio-Temporal Scale Coded Bag-of-Words
title_full Spatio-Temporal Scale Coded Bag-of-Words
title_fullStr Spatio-Temporal Scale Coded Bag-of-Words
title_full_unstemmed Spatio-Temporal Scale Coded Bag-of-Words
title_short Spatio-Temporal Scale Coded Bag-of-Words
title_sort spatio-temporal scale coded bag-of-words
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7664889/
https://www.ncbi.nlm.nih.gov/pubmed/33182276
http://dx.doi.org/10.3390/s20216380
work_keys_str_mv AT govenderdivina spatiotemporalscalecodedbagofwords
AT tapamojulesraymond spatiotemporalscalecodedbagofwords