Cargando…

ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition

Human action recognition has drawn significant attention because of its importance in computer vision-based applications. Action recognition based on skeleton sequences has rapidly advanced in the last decade. Conventional deep learning-based approaches are based on extracting skeleton sequences thr...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dai, Chuan, Wei, Yajuan, Xu, Zhijie, Chen, Minsi, Liu, Ying, Fan, Jiulun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007586/ https://www.ncbi.nlm.nih.gov/pubmed/36904656 http://dx.doi.org/10.3390/s23052452

_version_	1784905558907682816
author	Dai, Chuan Wei, Yajuan Xu, Zhijie Chen, Minsi Liu, Ying Fan, Jiulun
author_facet	Dai, Chuan Wei, Yajuan Xu, Zhijie Chen, Minsi Liu, Ying Fan, Jiulun
author_sort	Dai, Chuan
collection	PubMed
description	Human action recognition has drawn significant attention because of its importance in computer vision-based applications. Action recognition based on skeleton sequences has rapidly advanced in the last decade. Conventional deep learning-based approaches are based on extracting skeleton sequences through convolutional operations. Most of these architectures are implemented by learning spatial and temporal features through multiple streams. These studies have enlightened the action recognition endeavor from various algorithmic angles. However, three common issues are observed: (1) The models are usually complicated; therefore, they have a correspondingly higher computational complexity. (2) For supervised learning models, the reliance on labels during training is always a drawback. (3) Implementing large models is not beneficial to real-time applications. To address the above issues, in this paper, we propose a multi-layer perceptron (MLP)-based self-supervised learning framework with a contrastive learning loss function (ConMLP). ConMLP does not require a massive computational setup; it can effectively reduce the consumption of computational resources. Compared with supervised learning frameworks, ConMLP is friendly to the huge amount of unlabeled training data. In addition, it has low requirements for system configuration and is more conducive to being embedded in real-world applications. Extensive experiments show that ConMLP achieves the top one inference result of 96.9% on the NTU RGB+D dataset. This accuracy is higher than the state-of-the-art self-supervised learning method. Meanwhile, ConMLP is also evaluated in a supervised learning manner, which has achieved comparable performance to the state of the art of recognition accuracy.
format	Online Article Text
id	pubmed-10007586
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-100075862023-03-12 ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition Dai, Chuan Wei, Yajuan Xu, Zhijie Chen, Minsi Liu, Ying Fan, Jiulun Sensors (Basel) Article Human action recognition has drawn significant attention because of its importance in computer vision-based applications. Action recognition based on skeleton sequences has rapidly advanced in the last decade. Conventional deep learning-based approaches are based on extracting skeleton sequences through convolutional operations. Most of these architectures are implemented by learning spatial and temporal features through multiple streams. These studies have enlightened the action recognition endeavor from various algorithmic angles. However, three common issues are observed: (1) The models are usually complicated; therefore, they have a correspondingly higher computational complexity. (2) For supervised learning models, the reliance on labels during training is always a drawback. (3) Implementing large models is not beneficial to real-time applications. To address the above issues, in this paper, we propose a multi-layer perceptron (MLP)-based self-supervised learning framework with a contrastive learning loss function (ConMLP). ConMLP does not require a massive computational setup; it can effectively reduce the consumption of computational resources. Compared with supervised learning frameworks, ConMLP is friendly to the huge amount of unlabeled training data. In addition, it has low requirements for system configuration and is more conducive to being embedded in real-world applications. Extensive experiments show that ConMLP achieves the top one inference result of 96.9% on the NTU RGB+D dataset. This accuracy is higher than the state-of-the-art self-supervised learning method. Meanwhile, ConMLP is also evaluated in a supervised learning manner, which has achieved comparable performance to the state of the art of recognition accuracy. MDPI 2023-02-22 /pmc/articles/PMC10007586/ /pubmed/36904656 http://dx.doi.org/10.3390/s23052452 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Dai, Chuan Wei, Yajuan Xu, Zhijie Chen, Minsi Liu, Ying Fan, Jiulun ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition
title	ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition
title_full	ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition
title_fullStr	ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition
title_full_unstemmed	ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition
title_short	ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition
title_sort	conmlp: mlp-based self-supervised contrastive learning for skeleton data analysis and action recognition
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007586/ https://www.ncbi.nlm.nih.gov/pubmed/36904656 http://dx.doi.org/10.3390/s23052452
work_keys_str_mv	AT daichuan conmlpmlpbasedselfsupervisedcontrastivelearningforskeletondataanalysisandactionrecognition AT weiyajuan conmlpmlpbasedselfsupervisedcontrastivelearningforskeletondataanalysisandactionrecognition AT xuzhijie conmlpmlpbasedselfsupervisedcontrastivelearningforskeletondataanalysisandactionrecognition AT chenminsi conmlpmlpbasedselfsupervisedcontrastivelearningforskeletondataanalysisandactionrecognition AT liuying conmlpmlpbasedselfsupervisedcontrastivelearningforskeletondataanalysisandactionrecognition AT fanjiulun conmlpmlpbasedselfsupervisedcontrastivelearningforskeletondataanalysisandactionrecognition

ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition

Ejemplares similares