Cargando…

Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network

Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB f...

Descripción completa

Detalles Bibliográficos
Autores principales:	Xiong, Xin, Min, Weidong, Han, Qing, Wang, Qi, Zha, Cheng
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9208928/ https://www.ncbi.nlm.nih.gov/pubmed/35733557 http://dx.doi.org/10.1155/2022/6608448

_version_	1784729820691693568
author	Xiong, Xin Min, Weidong Han, Qing Wang, Qi Zha, Cheng
author_facet	Xiong, Xin Min, Weidong Han, Qing Wang, Qi Zha, Cheng
author_sort	Xiong, Xin
collection	PubMed
description	Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB flow alone or combined with optical flow. A novel recognition method using action sequences optimization and two-stream fusion network with different modalities is proposed to solve these problems. The method is based on shot segmentation and dynamic weighted sampling, and it reconstructs the video by reinforcing the proportion of high-activity action areas, eliminating redundant intervals, and extracting long-range temporal information. A two-stream 3D dilated neural network that integrates features of RGB and human skeleton information is also proposed. The human skeleton information strengthens the deep representation of humans for robust processing, alleviating the interference of background changes, and the dilated CNN enlarges the receptive field of feature extraction. Compared with existing approaches, the proposed method achieves superior or comparable classification accuracies on benchmark datasets UCF101 and HMDB51.
format	Online Article Text
id	pubmed-9208928
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-92089282022-06-21 Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network Xiong, Xin Min, Weidong Han, Qing Wang, Qi Zha, Cheng Comput Intell Neurosci Research Article Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB flow alone or combined with optical flow. A novel recognition method using action sequences optimization and two-stream fusion network with different modalities is proposed to solve these problems. The method is based on shot segmentation and dynamic weighted sampling, and it reconstructs the video by reinforcing the proportion of high-activity action areas, eliminating redundant intervals, and extracting long-range temporal information. A two-stream 3D dilated neural network that integrates features of RGB and human skeleton information is also proposed. The human skeleton information strengthens the deep representation of humans for robust processing, alleviating the interference of background changes, and the dilated CNN enlarges the receptive field of feature extraction. Compared with existing approaches, the proposed method achieves superior or comparable classification accuracies on benchmark datasets UCF101 and HMDB51. Hindawi 2022-06-13 /pmc/articles/PMC9208928/ /pubmed/35733557 http://dx.doi.org/10.1155/2022/6608448 Text en Copyright © 2022 Xin Xiong et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Xiong, Xin Min, Weidong Han, Qing Wang, Qi Zha, Cheng Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title	Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title_full	Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title_fullStr	Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title_full_unstemmed	Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title_short	Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title_sort	action recognition using action sequences optimization and two-stream 3d dilated neural network
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9208928/ https://www.ncbi.nlm.nih.gov/pubmed/35733557 http://dx.doi.org/10.1155/2022/6608448
work_keys_str_mv	AT xiongxin actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork AT minweidong actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork AT hanqing actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork AT wangqi actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork AT zhacheng actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork

Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network

Ejemplares similares