Cargando…
Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network
Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB f...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9208928/ https://www.ncbi.nlm.nih.gov/pubmed/35733557 http://dx.doi.org/10.1155/2022/6608448 |
_version_ | 1784729820691693568 |
---|---|
author | Xiong, Xin Min, Weidong Han, Qing Wang, Qi Zha, Cheng |
author_facet | Xiong, Xin Min, Weidong Han, Qing Wang, Qi Zha, Cheng |
author_sort | Xiong, Xin |
collection | PubMed |
description | Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB flow alone or combined with optical flow. A novel recognition method using action sequences optimization and two-stream fusion network with different modalities is proposed to solve these problems. The method is based on shot segmentation and dynamic weighted sampling, and it reconstructs the video by reinforcing the proportion of high-activity action areas, eliminating redundant intervals, and extracting long-range temporal information. A two-stream 3D dilated neural network that integrates features of RGB and human skeleton information is also proposed. The human skeleton information strengthens the deep representation of humans for robust processing, alleviating the interference of background changes, and the dilated CNN enlarges the receptive field of feature extraction. Compared with existing approaches, the proposed method achieves superior or comparable classification accuracies on benchmark datasets UCF101 and HMDB51. |
format | Online Article Text |
id | pubmed-9208928 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-92089282022-06-21 Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network Xiong, Xin Min, Weidong Han, Qing Wang, Qi Zha, Cheng Comput Intell Neurosci Research Article Effective extraction and representation of action information are critical in action recognition. The majority of existing methods fail to recognize actions accurately because of interference of background changes when the proportion of high-activity action areas is not reinforced and by using RGB flow alone or combined with optical flow. A novel recognition method using action sequences optimization and two-stream fusion network with different modalities is proposed to solve these problems. The method is based on shot segmentation and dynamic weighted sampling, and it reconstructs the video by reinforcing the proportion of high-activity action areas, eliminating redundant intervals, and extracting long-range temporal information. A two-stream 3D dilated neural network that integrates features of RGB and human skeleton information is also proposed. The human skeleton information strengthens the deep representation of humans for robust processing, alleviating the interference of background changes, and the dilated CNN enlarges the receptive field of feature extraction. Compared with existing approaches, the proposed method achieves superior or comparable classification accuracies on benchmark datasets UCF101 and HMDB51. Hindawi 2022-06-13 /pmc/articles/PMC9208928/ /pubmed/35733557 http://dx.doi.org/10.1155/2022/6608448 Text en Copyright © 2022 Xin Xiong et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Xiong, Xin Min, Weidong Han, Qing Wang, Qi Zha, Cheng Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network |
title | Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network |
title_full | Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network |
title_fullStr | Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network |
title_full_unstemmed | Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network |
title_short | Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network |
title_sort | action recognition using action sequences optimization and two-stream 3d dilated neural network |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9208928/ https://www.ncbi.nlm.nih.gov/pubmed/35733557 http://dx.doi.org/10.1155/2022/6608448 |
work_keys_str_mv | AT xiongxin actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork AT minweidong actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork AT hanqing actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork AT wangqi actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork AT zhacheng actionrecognitionusingactionsequencesoptimizationandtwostream3ddilatedneuralnetwork |