Cargando…

Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks

Skeleton-based human action recognition has attracted much attention in the field of computer vision. Most of the previous studies are based on fixed skeleton graphs so that only the local physical dependencies among joints can be captured, resulting in the omission of implicit joint correlations. I...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Min, Yang, Haijie, Li, Pengfei, Jiang, Ming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8390161/
https://www.ncbi.nlm.nih.gov/pubmed/34456989
http://dx.doi.org/10.1155/2021/1507770
_version_ 1783743034221920256
author Zhang, Min
Yang, Haijie
Li, Pengfei
Jiang, Ming
author_facet Zhang, Min
Yang, Haijie
Li, Pengfei
Jiang, Ming
author_sort Zhang, Min
collection PubMed
description Skeleton-based human action recognition has attracted much attention in the field of computer vision. Most of the previous studies are based on fixed skeleton graphs so that only the local physical dependencies among joints can be captured, resulting in the omission of implicit joint correlations. In addition, under different views, the content of the same action is very different. In some views, keypoints will be blocked, which will cause recognition errors. In this paper, an action recognition method based on distance vector and multihigh view adaptive network (DV-MHNet) is proposed to address this challenging task. Among the mentioned techniques, the multihigh (MH) view adaptive networks are constructed to automatically determine the best observation view at different heights, obtain complete keypoints information of the current frame image, and enhance the robustness and generalization of the model to recognize actions at different heights. Then, the distance vector (DV) mechanism is introduced on this basis to establish the relative distance and relative orientation between different keypoints in the same frame and the same keypoints in different frame to obtain the global potential relationship of each keypoint, and finally by constructing the spatial temporal graph convolutional network to take into account the information in space and time, the characteristics of the action are learned. This paper has done the ablation study with traditional spatial temporal graph convolutional networks and with or without multihigh view adaptive networks, which reasonably proves the effectiveness of the model. The model is evaluated on two widely used action recognition benchmarks (NTU-RGB + D and PKU-MMD). Our method achieves better performance on both datasets.
format Online
Article
Text
id pubmed-8390161
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-83901612021-08-27 Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks Zhang, Min Yang, Haijie Li, Pengfei Jiang, Ming Comput Intell Neurosci Research Article Skeleton-based human action recognition has attracted much attention in the field of computer vision. Most of the previous studies are based on fixed skeleton graphs so that only the local physical dependencies among joints can be captured, resulting in the omission of implicit joint correlations. In addition, under different views, the content of the same action is very different. In some views, keypoints will be blocked, which will cause recognition errors. In this paper, an action recognition method based on distance vector and multihigh view adaptive network (DV-MHNet) is proposed to address this challenging task. Among the mentioned techniques, the multihigh (MH) view adaptive networks are constructed to automatically determine the best observation view at different heights, obtain complete keypoints information of the current frame image, and enhance the robustness and generalization of the model to recognize actions at different heights. Then, the distance vector (DV) mechanism is introduced on this basis to establish the relative distance and relative orientation between different keypoints in the same frame and the same keypoints in different frame to obtain the global potential relationship of each keypoint, and finally by constructing the spatial temporal graph convolutional network to take into account the information in space and time, the characteristics of the action are learned. This paper has done the ablation study with traditional spatial temporal graph convolutional networks and with or without multihigh view adaptive networks, which reasonably proves the effectiveness of the model. The model is evaluated on two widely used action recognition benchmarks (NTU-RGB + D and PKU-MMD). Our method achieves better performance on both datasets. Hindawi 2021-08-18 /pmc/articles/PMC8390161/ /pubmed/34456989 http://dx.doi.org/10.1155/2021/1507770 Text en Copyright © 2021 Min Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Min
Yang, Haijie
Li, Pengfei
Jiang, Ming
Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title_full Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title_fullStr Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title_full_unstemmed Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title_short Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
title_sort skeleton-based action recognition based on distance vector and multihigh view adaptive networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8390161/
https://www.ncbi.nlm.nih.gov/pubmed/34456989
http://dx.doi.org/10.1155/2021/1507770
work_keys_str_mv AT zhangmin skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks
AT yanghaijie skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks
AT lipengfei skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks
AT jiangming skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks