Cargando…
Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition
Action recognition plays an important role in various applications such as video monitoring, automatic video indexing, crowd analysis, human-machine interaction, smart homes and personal assistive robotics. In this paper, we propose improvements to some methods for human action recognition from vide...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8001872/ https://www.ncbi.nlm.nih.gov/pubmed/33803929 http://dx.doi.org/10.3390/s21062051 |
_version_ | 1783671331608330240 |
---|---|
author | Nan, Mihai Trăscău, Mihai Florea, Adina Magda Iacob, Cezar Cătălin |
author_facet | Nan, Mihai Trăscău, Mihai Florea, Adina Magda Iacob, Cezar Cătălin |
author_sort | Nan, Mihai |
collection | PubMed |
description | Action recognition plays an important role in various applications such as video monitoring, automatic video indexing, crowd analysis, human-machine interaction, smart homes and personal assistive robotics. In this paper, we propose improvements to some methods for human action recognition from videos that work with data represented in the form of skeleton poses. These methods are based on the most widely used techniques for this problem—Graph Convolutional Networks (GCNs), Temporal Convolutional Networks (TCNs) and Recurrent Neural Networks (RNNs). Initially, the paper explores and compares different ways to extract the most relevant spatial and temporal characteristics for a sequence of frames describing an action. Based on this comparative analysis, we show how a TCN type unit can be extended to work even on the characteristics extracted from the spatial domain. To validate our approach, we test it against a benchmark often used for human action recognition problems and we show that our solution obtains comparable results to the state-of-the-art, but with a significant increase in the inference speed. |
format | Online Article Text |
id | pubmed-8001872 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-80018722021-03-28 Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition Nan, Mihai Trăscău, Mihai Florea, Adina Magda Iacob, Cezar Cătălin Sensors (Basel) Article Action recognition plays an important role in various applications such as video monitoring, automatic video indexing, crowd analysis, human-machine interaction, smart homes and personal assistive robotics. In this paper, we propose improvements to some methods for human action recognition from videos that work with data represented in the form of skeleton poses. These methods are based on the most widely used techniques for this problem—Graph Convolutional Networks (GCNs), Temporal Convolutional Networks (TCNs) and Recurrent Neural Networks (RNNs). Initially, the paper explores and compares different ways to extract the most relevant spatial and temporal characteristics for a sequence of frames describing an action. Based on this comparative analysis, we show how a TCN type unit can be extended to work even on the characteristics extracted from the spatial domain. To validate our approach, we test it against a benchmark often used for human action recognition problems and we show that our solution obtains comparable results to the state-of-the-art, but with a significant increase in the inference speed. MDPI 2021-03-15 /pmc/articles/PMC8001872/ /pubmed/33803929 http://dx.doi.org/10.3390/s21062051 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Nan, Mihai Trăscău, Mihai Florea, Adina Magda Iacob, Cezar Cătălin Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition |
title | Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition |
title_full | Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition |
title_fullStr | Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition |
title_full_unstemmed | Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition |
title_short | Comparison between Recurrent Networks and Temporal Convolutional Networks Approaches for Skeleton-Based Action Recognition |
title_sort | comparison between recurrent networks and temporal convolutional networks approaches for skeleton-based action recognition |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8001872/ https://www.ncbi.nlm.nih.gov/pubmed/33803929 http://dx.doi.org/10.3390/s21062051 |
work_keys_str_mv | AT nanmihai comparisonbetweenrecurrentnetworksandtemporalconvolutionalnetworksapproachesforskeletonbasedactionrecognition AT trascaumihai comparisonbetweenrecurrentnetworksandtemporalconvolutionalnetworksapproachesforskeletonbasedactionrecognition AT floreaadinamagda comparisonbetweenrecurrentnetworksandtemporalconvolutionalnetworksapproachesforskeletonbasedactionrecognition AT iacobcezarcatalin comparisonbetweenrecurrentnetworksandtemporalconvolutionalnetworksapproachesforskeletonbasedactionrecognition |