Cargando…

Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks

Skeleton-based human action recognition has attracted much attention in the field of computer vision. Most of the previous studies are based on fixed skeleton graphs so that only the local physical dependencies among joints can be captured, resulting in the omission of implicit joint correlations. I...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Min, Yang, Haijie, Li, Pengfei, Jiang, Ming
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8390161/ https://www.ncbi.nlm.nih.gov/pubmed/34456989 http://dx.doi.org/10.1155/2021/1507770

_version_	1783743034221920256
author	Zhang, Min Yang, Haijie Li, Pengfei Jiang, Ming
author_facet	Zhang, Min Yang, Haijie Li, Pengfei Jiang, Ming
author_sort	Zhang, Min
collection	PubMed
description	Skeleton-based human action recognition has attracted much attention in the field of computer vision. Most of the previous studies are based on fixed skeleton graphs so that only the local physical dependencies among joints can be captured, resulting in the omission of implicit joint correlations. In addition, under different views, the content of the same action is very different. In some views, keypoints will be blocked, which will cause recognition errors. In this paper, an action recognition method based on distance vector and multihigh view adaptive network (DV-MHNet) is proposed to address this challenging task. Among the mentioned techniques, the multihigh (MH) view adaptive networks are constructed to automatically determine the best observation view at different heights, obtain complete keypoints information of the current frame image, and enhance the robustness and generalization of the model to recognize actions at different heights. Then, the distance vector (DV) mechanism is introduced on this basis to establish the relative distance and relative orientation between different keypoints in the same frame and the same keypoints in different frame to obtain the global potential relationship of each keypoint, and finally by constructing the spatial temporal graph convolutional network to take into account the information in space and time, the characteristics of the action are learned. This paper has done the ablation study with traditional spatial temporal graph convolutional networks and with or without multihigh view adaptive networks, which reasonably proves the effectiveness of the model. The model is evaluated on two widely used action recognition benchmarks (NTU-RGB + D and PKU-MMD). Our method achieves better performance on both datasets.
format	Online Article Text
id	pubmed-8390161
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-83901612021-08-27 Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks Zhang, Min Yang, Haijie Li, Pengfei Jiang, Ming Comput Intell Neurosci Research Article Skeleton-based human action recognition has attracted much attention in the field of computer vision. Most of the previous studies are based on fixed skeleton graphs so that only the local physical dependencies among joints can be captured, resulting in the omission of implicit joint correlations. In addition, under different views, the content of the same action is very different. In some views, keypoints will be blocked, which will cause recognition errors. In this paper, an action recognition method based on distance vector and multihigh view adaptive network (DV-MHNet) is proposed to address this challenging task. Among the mentioned techniques, the multihigh (MH) view adaptive networks are constructed to automatically determine the best observation view at different heights, obtain complete keypoints information of the current frame image, and enhance the robustness and generalization of the model to recognize actions at different heights. Then, the distance vector (DV) mechanism is introduced on this basis to establish the relative distance and relative orientation between different keypoints in the same frame and the same keypoints in different frame to obtain the global potential relationship of each keypoint, and finally by constructing the spatial temporal graph convolutional network to take into account the information in space and time, the characteristics of the action are learned. This paper has done the ablation study with traditional spatial temporal graph convolutional networks and with or without multihigh view adaptive networks, which reasonably proves the effectiveness of the model. The model is evaluated on two widely used action recognition benchmarks (NTU-RGB + D and PKU-MMD). Our method achieves better performance on both datasets. Hindawi 2021-08-18 /pmc/articles/PMC8390161/ /pubmed/34456989 http://dx.doi.org/10.1155/2021/1507770 Text en Copyright © 2021 Min Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Zhang, Min Yang, Haijie Li, Pengfei Jiang, Ming Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title	Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title_full	Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title_fullStr	Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title_full_unstemmed	Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title_short	Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title_sort	skeleton-based action recognition based on distance vector and multihigh view adaptive networks
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8390161/ https://www.ncbi.nlm.nih.gov/pubmed/34456989 http://dx.doi.org/10.1155/2021/1507770
work_keys_str_mv	AT zhangmin skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks AT yanghaijie skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks AT lipengfei skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks AT jiangming skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks

Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks

Ejemplares similares