Cargando…
A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities
The paper presents a novel hybrid network for large-scale action recognition from multiple modalities. The network is built upon the proposed weighted dynamic images. It effectively leverages the strengths of the emerging Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) based ap...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7308905/ https://www.ncbi.nlm.nih.gov/pubmed/32532007 http://dx.doi.org/10.3390/s20113305 |
_version_ | 1783549100480790528 |
---|---|
author | Wang, Huogen Song, Zhanjie Li, Wanqing Wang, Pichao |
author_facet | Wang, Huogen Song, Zhanjie Li, Wanqing Wang, Pichao |
author_sort | Wang, Huogen |
collection | PubMed |
description | The paper presents a novel hybrid network for large-scale action recognition from multiple modalities. The network is built upon the proposed weighted dynamic images. It effectively leverages the strengths of the emerging Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) based approaches to specifically address the challenges that occur in large-scale action recognition and are not fully dealt with by the state-of-the-art methods. Specifically, the proposed hybrid network consists of a CNN based component and an RNN based component. Features extracted by the two components are fused through canonical correlation analysis and then fed to a linear Support Vector Machine (SVM) for classification. The proposed network achieved state-of-the-art results on the ChaLearn LAP IsoGD, NTU RGB+D and Multi-modal & Multi-view & Interactive ([Formula: see text]) datasets and outperformed existing methods by a large margin (over 10 percentage points in some cases). |
format | Online Article Text |
id | pubmed-7308905 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-73089052020-06-25 A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities Wang, Huogen Song, Zhanjie Li, Wanqing Wang, Pichao Sensors (Basel) Article The paper presents a novel hybrid network for large-scale action recognition from multiple modalities. The network is built upon the proposed weighted dynamic images. It effectively leverages the strengths of the emerging Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) based approaches to specifically address the challenges that occur in large-scale action recognition and are not fully dealt with by the state-of-the-art methods. Specifically, the proposed hybrid network consists of a CNN based component and an RNN based component. Features extracted by the two components are fused through canonical correlation analysis and then fed to a linear Support Vector Machine (SVM) for classification. The proposed network achieved state-of-the-art results on the ChaLearn LAP IsoGD, NTU RGB+D and Multi-modal & Multi-view & Interactive ([Formula: see text]) datasets and outperformed existing methods by a large margin (over 10 percentage points in some cases). MDPI 2020-06-10 /pmc/articles/PMC7308905/ /pubmed/32532007 http://dx.doi.org/10.3390/s20113305 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Huogen Song, Zhanjie Li, Wanqing Wang, Pichao A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities |
title | A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities |
title_full | A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities |
title_fullStr | A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities |
title_full_unstemmed | A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities |
title_short | A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities |
title_sort | hybrid network for large-scale action recognition from rgb and depth modalities |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7308905/ https://www.ncbi.nlm.nih.gov/pubmed/32532007 http://dx.doi.org/10.3390/s20113305 |
work_keys_str_mv | AT wanghuogen ahybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities AT songzhanjie ahybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities AT liwanqing ahybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities AT wangpichao ahybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities AT wanghuogen hybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities AT songzhanjie hybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities AT liwanqing hybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities AT wangpichao hybridnetworkforlargescaleactionrecognitionfromrgbanddepthmodalities |