Cargando…
Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing
The potential benefits of recognising activities of daily living from video for active and assisted living have yet to be fully untapped. These technologies can be used for behaviour understanding, and lifelogging for caregivers and end users alike. The recent publication of realistic datasets for t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7867344/ https://www.ncbi.nlm.nih.gov/pubmed/33540809 http://dx.doi.org/10.3390/s21031005 |
_version_ | 1783648281364004864 |
---|---|
author | Climent-Pérez, Pau Florez-Revuelta, Francisco |
author_facet | Climent-Pérez, Pau Florez-Revuelta, Francisco |
author_sort | Climent-Pérez, Pau |
collection | PubMed |
description | The potential benefits of recognising activities of daily living from video for active and assisted living have yet to be fully untapped. These technologies can be used for behaviour understanding, and lifelogging for caregivers and end users alike. The recent publication of realistic datasets for this purpose, such as the Toyota Smarthomes dataset, calls for pushing forward the efforts to improve action recognition. Using the separable spatio-temporal attention network proposed in the literature, this paper introduces a view-invariant normalisation of skeletal pose data and full activity crops for RGB data, which improve the baseline results by 9.5% (on the cross-subject experiments), outperforming state-of-the-art techniques in this field when using the original unmodified skeletal data in dataset. Our code and data are available online. |
format | Online Article Text |
id | pubmed-7867344 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-78673442021-02-07 Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing Climent-Pérez, Pau Florez-Revuelta, Francisco Sensors (Basel) Article The potential benefits of recognising activities of daily living from video for active and assisted living have yet to be fully untapped. These technologies can be used for behaviour understanding, and lifelogging for caregivers and end users alike. The recent publication of realistic datasets for this purpose, such as the Toyota Smarthomes dataset, calls for pushing forward the efforts to improve action recognition. Using the separable spatio-temporal attention network proposed in the literature, this paper introduces a view-invariant normalisation of skeletal pose data and full activity crops for RGB data, which improve the baseline results by 9.5% (on the cross-subject experiments), outperforming state-of-the-art techniques in this field when using the original unmodified skeletal data in dataset. Our code and data are available online. MDPI 2021-02-02 /pmc/articles/PMC7867344/ /pubmed/33540809 http://dx.doi.org/10.3390/s21031005 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Climent-Pérez, Pau Florez-Revuelta, Francisco Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing |
title | Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing |
title_full | Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing |
title_fullStr | Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing |
title_full_unstemmed | Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing |
title_short | Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing |
title_sort | improved action recognition with separable spatio-temporal attention using alternative skeletal and video pre-processing |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7867344/ https://www.ncbi.nlm.nih.gov/pubmed/33540809 http://dx.doi.org/10.3390/s21031005 |
work_keys_str_mv | AT climentperezpau improvedactionrecognitionwithseparablespatiotemporalattentionusingalternativeskeletalandvideopreprocessing AT florezrevueltafrancisco improvedactionrecognitionwithseparablespatiotemporalattentionusingalternativeskeletalandvideopreprocessing |