Cargando…

Using a Selective Ensemble Support Vector Machine to Fuse Multimodal Features for Human Action Recognition

The traditional human action recognition (HAR) method is based on RGB video. Recently, with the introduction of Microsoft Kinect and other consumer class depth cameras, HAR based on RGB-D (RGB-Depth) has drawn increasing attention from scholars and industry. Compared with the traditional method, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Chao, Tong, Anyang, Zheng, Aihua, Peng, Hua, Li, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8763533/
https://www.ncbi.nlm.nih.gov/pubmed/35047028
http://dx.doi.org/10.1155/2022/1877464
_version_ 1784633964959367168
author Tang, Chao
Tong, Anyang
Zheng, Aihua
Peng, Hua
Li, Wei
author_facet Tang, Chao
Tong, Anyang
Zheng, Aihua
Peng, Hua
Li, Wei
author_sort Tang, Chao
collection PubMed
description The traditional human action recognition (HAR) method is based on RGB video. Recently, with the introduction of Microsoft Kinect and other consumer class depth cameras, HAR based on RGB-D (RGB-Depth) has drawn increasing attention from scholars and industry. Compared with the traditional method, the HAR based on RGB-D has high accuracy and strong robustness. In this paper, using a selective ensemble support vector machine to fuse multimodal features for human action recognition is proposed. The algorithm combines the improved HOG feature-based RGB modal data, the depth motion map-based local binary pattern features (DMM-LBP), and the hybrid joint features (HJF)-based joints modal data. Concomitantly, a frame-based selective ensemble support vector machine classification model (SESVM) is proposed, which effectively integrates the selective ensemble strategy with the selection of SVM base classifiers, thus increasing the differences between the base classifiers. The experimental results have demonstrated that the proposed method is simple, fast, and efficient on public datasets in comparison with other action recognition algorithms.
format Online
Article
Text
id pubmed-8763533
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-87635332022-01-18 Using a Selective Ensemble Support Vector Machine to Fuse Multimodal Features for Human Action Recognition Tang, Chao Tong, Anyang Zheng, Aihua Peng, Hua Li, Wei Comput Intell Neurosci Research Article The traditional human action recognition (HAR) method is based on RGB video. Recently, with the introduction of Microsoft Kinect and other consumer class depth cameras, HAR based on RGB-D (RGB-Depth) has drawn increasing attention from scholars and industry. Compared with the traditional method, the HAR based on RGB-D has high accuracy and strong robustness. In this paper, using a selective ensemble support vector machine to fuse multimodal features for human action recognition is proposed. The algorithm combines the improved HOG feature-based RGB modal data, the depth motion map-based local binary pattern features (DMM-LBP), and the hybrid joint features (HJF)-based joints modal data. Concomitantly, a frame-based selective ensemble support vector machine classification model (SESVM) is proposed, which effectively integrates the selective ensemble strategy with the selection of SVM base classifiers, thus increasing the differences between the base classifiers. The experimental results have demonstrated that the proposed method is simple, fast, and efficient on public datasets in comparison with other action recognition algorithms. Hindawi 2022-01-10 /pmc/articles/PMC8763533/ /pubmed/35047028 http://dx.doi.org/10.1155/2022/1877464 Text en Copyright © 2022 Chao Tang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Tang, Chao
Tong, Anyang
Zheng, Aihua
Peng, Hua
Li, Wei
Using a Selective Ensemble Support Vector Machine to Fuse Multimodal Features for Human Action Recognition
title Using a Selective Ensemble Support Vector Machine to Fuse Multimodal Features for Human Action Recognition
title_full Using a Selective Ensemble Support Vector Machine to Fuse Multimodal Features for Human Action Recognition
title_fullStr Using a Selective Ensemble Support Vector Machine to Fuse Multimodal Features for Human Action Recognition
title_full_unstemmed Using a Selective Ensemble Support Vector Machine to Fuse Multimodal Features for Human Action Recognition
title_short Using a Selective Ensemble Support Vector Machine to Fuse Multimodal Features for Human Action Recognition
title_sort using a selective ensemble support vector machine to fuse multimodal features for human action recognition
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8763533/
https://www.ncbi.nlm.nih.gov/pubmed/35047028
http://dx.doi.org/10.1155/2022/1877464
work_keys_str_mv AT tangchao usingaselectiveensemblesupportvectormachinetofusemultimodalfeaturesforhumanactionrecognition
AT tonganyang usingaselectiveensemblesupportvectormachinetofusemultimodalfeaturesforhumanactionrecognition
AT zhengaihua usingaselectiveensemblesupportvectormachinetofusemultimodalfeaturesforhumanactionrecognition
AT penghua usingaselectiveensemblesupportvectormachinetofusemultimodalfeaturesforhumanactionrecognition
AT liwei usingaselectiveensemblesupportvectormachinetofusemultimodalfeaturesforhumanactionrecognition