Cargando…
Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks
Skeleton-based human action recognition has attracted much attention in the field of computer vision. Most of the previous studies are based on fixed skeleton graphs so that only the local physical dependencies among joints can be captured, resulting in the omission of implicit joint correlations. I...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8390161/ https://www.ncbi.nlm.nih.gov/pubmed/34456989 http://dx.doi.org/10.1155/2021/1507770 |
_version_ | 1783743034221920256 |
---|---|
author | Zhang, Min Yang, Haijie Li, Pengfei Jiang, Ming |
author_facet | Zhang, Min Yang, Haijie Li, Pengfei Jiang, Ming |
author_sort | Zhang, Min |
collection | PubMed |
description | Skeleton-based human action recognition has attracted much attention in the field of computer vision. Most of the previous studies are based on fixed skeleton graphs so that only the local physical dependencies among joints can be captured, resulting in the omission of implicit joint correlations. In addition, under different views, the content of the same action is very different. In some views, keypoints will be blocked, which will cause recognition errors. In this paper, an action recognition method based on distance vector and multihigh view adaptive network (DV-MHNet) is proposed to address this challenging task. Among the mentioned techniques, the multihigh (MH) view adaptive networks are constructed to automatically determine the best observation view at different heights, obtain complete keypoints information of the current frame image, and enhance the robustness and generalization of the model to recognize actions at different heights. Then, the distance vector (DV) mechanism is introduced on this basis to establish the relative distance and relative orientation between different keypoints in the same frame and the same keypoints in different frame to obtain the global potential relationship of each keypoint, and finally by constructing the spatial temporal graph convolutional network to take into account the information in space and time, the characteristics of the action are learned. This paper has done the ablation study with traditional spatial temporal graph convolutional networks and with or without multihigh view adaptive networks, which reasonably proves the effectiveness of the model. The model is evaluated on two widely used action recognition benchmarks (NTU-RGB + D and PKU-MMD). Our method achieves better performance on both datasets. |
format | Online Article Text |
id | pubmed-8390161 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-83901612021-08-27 Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks Zhang, Min Yang, Haijie Li, Pengfei Jiang, Ming Comput Intell Neurosci Research Article Skeleton-based human action recognition has attracted much attention in the field of computer vision. Most of the previous studies are based on fixed skeleton graphs so that only the local physical dependencies among joints can be captured, resulting in the omission of implicit joint correlations. In addition, under different views, the content of the same action is very different. In some views, keypoints will be blocked, which will cause recognition errors. In this paper, an action recognition method based on distance vector and multihigh view adaptive network (DV-MHNet) is proposed to address this challenging task. Among the mentioned techniques, the multihigh (MH) view adaptive networks are constructed to automatically determine the best observation view at different heights, obtain complete keypoints information of the current frame image, and enhance the robustness and generalization of the model to recognize actions at different heights. Then, the distance vector (DV) mechanism is introduced on this basis to establish the relative distance and relative orientation between different keypoints in the same frame and the same keypoints in different frame to obtain the global potential relationship of each keypoint, and finally by constructing the spatial temporal graph convolutional network to take into account the information in space and time, the characteristics of the action are learned. This paper has done the ablation study with traditional spatial temporal graph convolutional networks and with or without multihigh view adaptive networks, which reasonably proves the effectiveness of the model. The model is evaluated on two widely used action recognition benchmarks (NTU-RGB + D and PKU-MMD). Our method achieves better performance on both datasets. Hindawi 2021-08-18 /pmc/articles/PMC8390161/ /pubmed/34456989 http://dx.doi.org/10.1155/2021/1507770 Text en Copyright © 2021 Min Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhang, Min Yang, Haijie Li, Pengfei Jiang, Ming Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks |
title | Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks |
title_full | Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks |
title_fullStr | Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks |
title_full_unstemmed | Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks |
title_short | Skeleton-Based Action Recognition Based on Distance Vector and Multihigh View Adaptive Networks |
title_sort | skeleton-based action recognition based on distance vector and multihigh view adaptive networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8390161/ https://www.ncbi.nlm.nih.gov/pubmed/34456989 http://dx.doi.org/10.1155/2021/1507770 |
work_keys_str_mv | AT zhangmin skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks AT yanghaijie skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks AT lipengfei skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks AT jiangming skeletonbasedactionrecognitionbasedondistancevectorandmultihighviewadaptivenetworks |