Cargando…

Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network

Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB f...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Xin, Min, Weidong, Han, Qing, Wang, Qi, Zha, Cheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9208928/
https://www.ncbi.nlm.nih.gov/pubmed/35733557
http://dx.doi.org/10.1155/2022/6608448
_version_ 1784729820691693568
author Xiong, Xin
Min, Weidong
Han, Qing
Wang, Qi
Zha, Cheng
author_facet Xiong, Xin
Min, Weidong
Han, Qing
Wang, Qi
Zha, Cheng
author_sort Xiong, Xin
collection PubMed
description Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB flow alone or combined with optical flow. A novel recognition method using action sequences optimization and two-stream fusion network with different modalities is proposed to solve these problems. The method is based on shot segmentation and dynamic weighted sampling, and it reconstructs the video by reinforcing the proportion of high-activity action areas, eliminating redundant intervals, and extracting long-range temporal information. A two-stream 3D dilated neural network that integrates features of RGB and human skeleton information is also proposed. The human skeleton information strengthens the deep representation of humans for robust processing, alleviating the interference of background changes, and the dilated CNN enlarges the receptive field of feature extraction. Compared with existing approaches, the proposed method achieves superior or comparable classification accuracies on benchmark datasets UCF101 and HMDB51.
format Online
Article
Text
id pubmed-9208928
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-92089282022-06-21 Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network Xiong, Xin Min, Weidong Han, Qing Wang, Qi Zha, Cheng Comput Intell Neurosci Research Article Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB flow alone or combined with optical flow. A novel recognition method using action sequences optimization and two-stream fusion network with different modalities is proposed to solve these problems. The method is based on shot segmentation and dynamic weighted sampling, and it reconstructs the video by reinforcing the proportion of high-activity action areas, eliminating redundant intervals, and extracting long-range temporal information. A two-stream 3D dilated neural network that integrates features of RGB and human skeleton information is also proposed. The human skeleton information strengthens the deep representation of humans for robust processing, alleviating the interference of background changes, and the dilated CNN enlarges the receptive field of feature extraction. Compared with existing approaches, the proposed method achieves superior or comparable classification accuracies on benchmark datasets UCF101 and HMDB51. Hindawi 2022-06-13 /pmc/articles/PMC9208928/ /pubmed/35733557 http://dx.doi.org/10.1155/2022/6608448 Text en Copyright © 2022 Xin Xiong et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xiong, Xin
Min, Weidong
Han, Qing
Wang, Qi
Zha, Cheng
Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title_full Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title_fullStr Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title_full_unstemmed Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title_short Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
title_sort action recognition using action sequences optimization and two-stream 3d dilated neural network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9208928/
https://www.ncbi.nlm.nih.gov/pubmed/35733557
http://dx.doi.org/10.1155/2022/6608448
work_keys_str_mv AT xiongxin actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork
AT minweidong actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork
AT hanqing actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork
AT wangqi actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork
AT zhacheng actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork