Cargando…

Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

In the skeleton-based human action recognition domain, the spatial-temporal graph convolution networks (ST-GCNs) have made great progress recently. However, they use only one fixed temporal convolution kernel, which is not enough to extract the temporal cues comprehensively. Moreover, simply connect...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Fanjia, Li, Juanjuan, Zhu, Aichun, Xu, Yonggang, Yin, Hongsheng, Hua, Gang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7571203/ https://www.ncbi.nlm.nih.gov/pubmed/32942579 http://dx.doi.org/10.3390/s20185260

_version_	1783597123245178880
author	Li, Fanjia Li, Juanjuan Zhu, Aichun Xu, Yonggang Yin, Hongsheng Hua, Gang
author_facet	Li, Fanjia Li, Juanjuan Zhu, Aichun Xu, Yonggang Yin, Hongsheng Hua, Gang
author_sort	Li, Fanjia
collection	PubMed
description	In the skeleton-based human action recognition domain, the spatial-temporal graph convolution networks (ST-GCNs) have made great progress recently. However, they use only one fixed temporal convolution kernel, which is not enough to extract the temporal cues comprehensively. Moreover, simply connecting the spatial graph convolution layer (GCL) and the temporal GCL in series is not the optimal solution. To this end, we propose a novel enhanced spatial and extended temporal graph convolutional network (EE-GCN) in this paper. Three convolution kernels with different sizes are chosen to extract the discriminative temporal features from shorter to longer terms. The corresponding GCLs are then concatenated by a powerful yet efficient one-shot aggregation (OSA) + effective squeeze-excitation (eSE) structure. The OSA module aggregates the features from each layer once to the output, and the eSE module explores the interdependency between the channels of the output. Besides, we propose a new connection paradigm to enhance the spatial features, which expand the serial connection to a combination of serial and parallel connections by adding a spatial GCL in parallel with the temporal GCLs. The proposed method is evaluated on three large scale datasets, and the experimental results show that the performance of our method exceeds previous state-of-the-art methods.
format	Online Article Text
id	pubmed-7571203
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-75712032020-10-28 Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition Li, Fanjia Li, Juanjuan Zhu, Aichun Xu, Yonggang Yin, Hongsheng Hua, Gang Sensors (Basel) Article In the skeleton-based human action recognition domain, the spatial-temporal graph convolution networks (ST-GCNs) have made great progress recently. However, they use only one fixed temporal convolution kernel, which is not enough to extract the temporal cues comprehensively. Moreover, simply connecting the spatial graph convolution layer (GCL) and the temporal GCL in series is not the optimal solution. To this end, we propose a novel enhanced spatial and extended temporal graph convolutional network (EE-GCN) in this paper. Three convolution kernels with different sizes are chosen to extract the discriminative temporal features from shorter to longer terms. The corresponding GCLs are then concatenated by a powerful yet efficient one-shot aggregation (OSA) + effective squeeze-excitation (eSE) structure. The OSA module aggregates the features from each layer once to the output, and the eSE module explores the interdependency between the channels of the output. Besides, we propose a new connection paradigm to enhance the spatial features, which expand the serial connection to a combination of serial and parallel connections by adding a spatial GCL in parallel with the temporal GCLs. The proposed method is evaluated on three large scale datasets, and the experimental results show that the performance of our method exceeds previous state-of-the-art methods. MDPI 2020-09-15 /pmc/articles/PMC7571203/ /pubmed/32942579 http://dx.doi.org/10.3390/s20185260 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Li, Fanjia Li, Juanjuan Zhu, Aichun Xu, Yonggang Yin, Hongsheng Hua, Gang Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
title	Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
title_full	Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
title_fullStr	Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
title_full_unstemmed	Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
title_short	Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
title_sort	enhanced spatial and extended temporal graph convolutional network for skeleton-based action recognition
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7571203/ https://www.ncbi.nlm.nih.gov/pubmed/32942579 http://dx.doi.org/10.3390/s20185260
work_keys_str_mv	AT lifanjia enhancedspatialandextendedtemporalgraphconvolutionalnetworkforskeletonbasedactionrecognition AT lijuanjuan enhancedspatialandextendedtemporalgraphconvolutionalnetworkforskeletonbasedactionrecognition AT zhuaichun enhancedspatialandextendedtemporalgraphconvolutionalnetworkforskeletonbasedactionrecognition AT xuyonggang enhancedspatialandextendedtemporalgraphconvolutionalnetworkforskeletonbasedactionrecognition AT yinhongsheng enhancedspatialandextendedtemporalgraphconvolutionalnetworkforskeletonbasedactionrecognition AT huagang enhancedspatialandextendedtemporalgraphconvolutionalnetworkforskeletonbasedactionrecognition

Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

Ejemplares similares