Cargando…

Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing

The potential benefits of recognising activities of daily living from video for active and assisted living have yet to be fully untapped. These technologies can be used for behaviour understanding, and lifelogging for caregivers and end users alike. The recent publication of realistic datasets for t...

Descripción completa

Detalles Bibliográficos
Autores principales: Climent-Pérez, Pau, Florez-Revuelta, Francisco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7867344/
https://www.ncbi.nlm.nih.gov/pubmed/33540809
http://dx.doi.org/10.3390/s21031005
_version_ 1783648281364004864
author Climent-Pérez, Pau
Florez-Revuelta, Francisco
author_facet Climent-Pérez, Pau
Florez-Revuelta, Francisco
author_sort Climent-Pérez, Pau
collection PubMed
description The potential benefits of recognising activities of daily living from video for active and assisted living have yet to be fully untapped. These technologies can be used for behaviour understanding, and lifelogging for caregivers and end users alike. The recent publication of realistic datasets for this purpose, such as the Toyota Smarthomes dataset, calls for pushing forward the efforts to improve action recognition. Using the separable spatio-temporal attention network proposed in the literature, this paper introduces a view-invariant normalisation of skeletal pose data and full activity crops for RGB data, which improve the baseline results by 9.5% (on the cross-subject experiments), outperforming state-of-the-art techniques in this field when using the original unmodified skeletal data in dataset. Our code and data are available online.
format Online
Article
Text
id pubmed-7867344
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-78673442021-02-07 Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing Climent-Pérez, Pau Florez-Revuelta, Francisco Sensors (Basel) Article The potential benefits of recognising activities of daily living from video for active and assisted living have yet to be fully untapped. These technologies can be used for behaviour understanding, and lifelogging for caregivers and end users alike. The recent publication of realistic datasets for this purpose, such as the Toyota Smarthomes dataset, calls for pushing forward the efforts to improve action recognition. Using the separable spatio-temporal attention network proposed in the literature, this paper introduces a view-invariant normalisation of skeletal pose data and full activity crops for RGB data, which improve the baseline results by 9.5% (on the cross-subject experiments), outperforming state-of-the-art techniques in this field when using the original unmodified skeletal data in dataset. Our code and data are available online. MDPI 2021-02-02 /pmc/articles/PMC7867344/ /pubmed/33540809 http://dx.doi.org/10.3390/s21031005 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Climent-Pérez, Pau
Florez-Revuelta, Francisco
Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing
title Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing
title_full Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing
title_fullStr Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing
title_full_unstemmed Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing
title_short Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing
title_sort improved action recognition with separable spatio-temporal attention using alternative skeletal and video pre-processing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7867344/
https://www.ncbi.nlm.nih.gov/pubmed/33540809
http://dx.doi.org/10.3390/s21031005
work_keys_str_mv AT climentperezpau improvedactionrecognitionwithseparablespatiotemporalattentionusingalternativeskeletalandvideopreprocessing
AT florezrevueltafrancisco improvedactionrecognitionwithseparablespatiotemporalattentionusingalternativeskeletalandvideopreprocessing