Cargando…

Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition

Action recognition plays an important role in various applications such as video monitoring, automatic video indexing, crowd analysis, human-machine interaction, smart homes and personal assistive robotics. In this paper, we propose improvements to some methods for human action recognition from vide...

Descripción completa

Detalles Bibliográficos
Autores principales: Nan, Mihai, Trăscău, Mihai, Florea, Adina Magda, Iacob, Cezar Cătălin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8001872/
https://www.ncbi.nlm.nih.gov/pubmed/33803929
http://dx.doi.org/10.3390/s21062051
_version_ 1783671331608330240
author Nan, Mihai
Trăscău, Mihai
Florea, Adina Magda
Iacob, Cezar Cătălin
author_facet Nan, Mihai
Trăscău, Mihai
Florea, Adina Magda
Iacob, Cezar Cătălin
author_sort Nan, Mihai
collection PubMed
description Action recognition plays an important role in various applications such as video monitoring, automatic video indexing, crowd analysis, human-machine interaction, smart homes and personal assistive robotics. In this paper, we propose improvements to some methods for human action recognition from videos that work with data represented in the form of skeleton poses. These methods are based on the most widely used techniques for this problem—Graph Convolutional Networks (GCNs), Temporal Convolutional Networks (TCNs) and Recurrent Neural Networks (RNNs). Initially, the paper explores and compares different ways to extract the most relevant spatial and temporal characteristics for a sequence of frames describing an action. Based on this comparative analysis, we show how a TCN type unit can be extended to work even on the characteristics extracted from the spatial domain. To validate our approach, we test it against a benchmark often used for human action recognition problems and we show that our solution obtains comparable results to the state-of-the-art, but with a significant increase in the inference speed.
format Online
Article
Text
id pubmed-8001872
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80018722021-03-28 Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition Nan, Mihai Trăscău, Mihai Florea, Adina Magda Iacob, Cezar Cătălin Sensors (Basel) Article Action recognition plays an important role in various applications such as video monitoring, automatic video indexing, crowd analysis, human-machine interaction, smart homes and personal assistive robotics. In this paper, we propose improvements to some methods for human action recognition from videos that work with data represented in the form of skeleton poses. These methods are based on the most widely used techniques for this problem—Graph Convolutional Networks (GCNs), Temporal Convolutional Networks (TCNs) and Recurrent Neural Networks (RNNs). Initially, the paper explores and compares different ways to extract the most relevant spatial and temporal characteristics for a sequence of frames describing an action. Based on this comparative analysis, we show how a TCN type unit can be extended to work even on the characteristics extracted from the spatial domain. To validate our approach, we test it against a benchmark often used for human action recognition problems and we show that our solution obtains comparable results to the state-of-the-art, but with a significant increase in the inference speed. MDPI 2021-03-15 /pmc/articles/PMC8001872/ /pubmed/33803929 http://dx.doi.org/10.3390/s21062051 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Nan, Mihai
Trăscău, Mihai
Florea, Adina Magda
Iacob, Cezar Cătălin
Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition
title Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition
title_full Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition
title_fullStr Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition
title_full_unstemmed Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition
title_short Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition
title_sort comparison between recurrent networks and temporal convolutional networks approaches for skeleton-based action recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8001872/
https://www.ncbi.nlm.nih.gov/pubmed/33803929
http://dx.doi.org/10.3390/s21062051
work_keys_str_mv AT nanmihai comparisonbetweenrecurrentnetworksandtemporalconvolutionalnetworksapproachesforskeletonbasedactionrecognition
AT trascaumihai comparisonbetweenrecurrentnetworksandtemporalconvolutionalnetworksapproachesforskeletonbasedactionrecognition
AT floreaadinamagda comparisonbetweenrecurrentnetworksandtemporalconvolutionalnetworksapproachesforskeletonbasedactionrecognition
AT iacobcezarcatalin comparisonbetweenrecurrentnetworksandtemporalconvolutionalnetworksapproachesforskeletonbasedactionrecognition