Cargando…

Energy-Guided Temporal Segmentation Network for Multimodal Human Action Recognition

To achieve the satisfactory performance of human action recognition, a central task is to address the sub-action sharing problem, especially in similar action classes. Nevertheless, most existing convolutional neural network (CNN)-based action recognition algorithms uniformly divide video into frame...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Qiang, Chen, Enqing, Gao, Lei, Liang, Chengwu, Liu, Hao
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506802/ https://www.ncbi.nlm.nih.gov/pubmed/32825038 http://dx.doi.org/10.3390/s20174673

_version_	1783585097526542336
author	Liu, Qiang Chen, Enqing Gao, Lei Liang, Chengwu Liu, Hao
author_facet	Liu, Qiang Chen, Enqing Gao, Lei Liang, Chengwu Liu, Hao
author_sort	Liu, Qiang
collection	PubMed
description	To achieve the satisfactory performance of human action recognition, a central task is to address the sub-action sharing problem, especially in similar action classes. Nevertheless, most existing convolutional neural network (CNN)-based action recognition algorithms uniformly divide video into frames and then randomly select the frames as inputs, ignoring the distinct characteristics among different frames. In recent years, depth videos have been increasingly used for action recognition, but most methods merely focus on the spatial information of the different actions without utilizing temporal information. In order to address these issues, a novel energy-guided temporal segmentation method is proposed here, and a multimodal fusion strategy is employed with the proposed segmentation method to construct an energy-guided temporal segmentation network (EGTSN). Specifically, the EGTSN had two parts: energy-guided video segmentation and a multimodal fusion heterogeneous CNN. The proposed solution was evaluated on a public large-scale NTU RGB+D dataset. Comparisons with state-of-the-art methods demonstrate the effectiveness of the proposed network.
format	Online Article Text
id	pubmed-7506802
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-75068022020-09-26 Energy-Guided Temporal Segmentation Network for Multimodal Human Action Recognition Liu, Qiang Chen, Enqing Gao, Lei Liang, Chengwu Liu, Hao Sensors (Basel) Article To achieve the satisfactory performance of human action recognition, a central task is to address the sub-action sharing problem, especially in similar action classes. Nevertheless, most existing convolutional neural network (CNN)-based action recognition algorithms uniformly divide video into frames and then randomly select the frames as inputs, ignoring the distinct characteristics among different frames. In recent years, depth videos have been increasingly used for action recognition, but most methods merely focus on the spatial information of the different actions without utilizing temporal information. In order to address these issues, a novel energy-guided temporal segmentation method is proposed here, and a multimodal fusion strategy is employed with the proposed segmentation method to construct an energy-guided temporal segmentation network (EGTSN). Specifically, the EGTSN had two parts: energy-guided video segmentation and a multimodal fusion heterogeneous CNN. The proposed solution was evaluated on a public large-scale NTU RGB+D dataset. Comparisons with state-of-the-art methods demonstrate the effectiveness of the proposed network. MDPI 2020-08-19 /pmc/articles/PMC7506802/ /pubmed/32825038 http://dx.doi.org/10.3390/s20174673 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Liu, Qiang Chen, Enqing Gao, Lei Liang, Chengwu Liu, Hao Energy-Guided Temporal Segmentation Network for Multimodal Human Action Recognition
title	Energy-Guided Temporal Segmentation Network for Multimodal Human Action Recognition
title_full	Energy-Guided Temporal Segmentation Network for Multimodal Human Action Recognition
title_fullStr	Energy-Guided Temporal Segmentation Network for Multimodal Human Action Recognition
title_full_unstemmed	Energy-Guided Temporal Segmentation Network for Multimodal Human Action Recognition
title_short	Energy-Guided Temporal Segmentation Network for Multimodal Human Action Recognition
title_sort	energy-guided temporal segmentation network for multimodal human action recognition
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506802/ https://www.ncbi.nlm.nih.gov/pubmed/32825038 http://dx.doi.org/10.3390/s20174673
work_keys_str_mv	AT liuqiang energyguidedtemporalsegmentationnetworkformultimodalhumanactionrecognition AT chenenqing energyguidedtemporalsegmentationnetworkformultimodalhumanactionrecognition AT gaolei energyguidedtemporalsegmentationnetworkformultimodalhumanactionrecognition AT liangchengwu energyguidedtemporalsegmentationnetworkformultimodalhumanactionrecognition AT liuhao energyguidedtemporalsegmentationnetworkformultimodalhumanactionrecognition

Energy-Guided Temporal Segmentation Network for Multimodal Human Action Recognition

Ejemplares similares