Cargando…

A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities

The paper presents a novel hybrid network for large-scale action recognition from multiple modalities. The network is built upon the proposed weighted dynamic images. It effectively leverages the strengths of the emerging Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) based ap...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Huogen, Song, Zhanjie, Li, Wanqing, Wang, Pichao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7308905/
https://www.ncbi.nlm.nih.gov/pubmed/32532007
http://dx.doi.org/10.3390/s20113305
_version_ 1783549100480790528
author Wang, Huogen
Song, Zhanjie
Li, Wanqing
Wang, Pichao
author_facet Wang, Huogen
Song, Zhanjie
Li, Wanqing
Wang, Pichao
author_sort Wang, Huogen
collection PubMed
description The paper presents a novel hybrid network for large-scale action recognition from multiple modalities. The network is built upon the proposed weighted dynamic images. It effectively leverages the strengths of the emerging Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) based approaches to specifically address the challenges that occur in large-scale action recognition and are not fully dealt with by the state-of-the-art methods. Specifically, the proposed hybrid network consists of a CNN based component and an RNN based component. Features extracted by the two components are fused through canonical correlation analysis and then fed to a linear Support Vector Machine (SVM) for classification. The proposed network achieved state-of-the-art results on the ChaLearn LAP IsoGD, NTU RGB+D and Multi-modal & Multi-view & Interactive ([Formula: see text]) datasets and outperformed existing methods by a large margin (over 10 percentage points in some cases).
format Online
Article
Text
id pubmed-7308905
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73089052020-06-25 A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities Wang, Huogen Song, Zhanjie Li, Wanqing Wang, Pichao Sensors (Basel) Article The paper presents a novel hybrid network for large-scale action recognition from multiple modalities. The network is built upon the proposed weighted dynamic images. It effectively leverages the strengths of the emerging Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) based approaches to specifically address the challenges that occur in large-scale action recognition and are not fully dealt with by the state-of-the-art methods. Specifically, the proposed hybrid network consists of a CNN based component and an RNN based component. Features extracted by the two components are fused through canonical correlation analysis and then fed to a linear Support Vector Machine (SVM) for classification. The proposed network achieved state-of-the-art results on the ChaLearn LAP IsoGD, NTU RGB+D and Multi-modal & Multi-view & Interactive ([Formula: see text]) datasets and outperformed existing methods by a large margin (over 10 percentage points in some cases). MDPI 2020-06-10 /pmc/articles/PMC7308905/ /pubmed/32532007 http://dx.doi.org/10.3390/s20113305 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Huogen
Song, Zhanjie
Li, Wanqing
Wang, Pichao
A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities
title A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities
title_full A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities
title_fullStr A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities
title_full_unstemmed A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities
title_short A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities
title_sort hybrid network for large-scale action recognition from rgb and depth modalities
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7308905/
https://www.ncbi.nlm.nih.gov/pubmed/32532007
http://dx.doi.org/10.3390/s20113305
work_keys_str_mv AT wanghuogen ahybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities
AT songzhanjie ahybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities
AT liwanqing ahybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities
AT wangpichao ahybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities
AT wanghuogen hybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities
AT songzhanjie hybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities
AT liwanqing hybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities
AT wangpichao hybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities