Cargando…

ASNet: Auto-Augmented Siamese Neural Network for Action Recognition

Human action recognition methods in videos based on deep convolutional neural networks usually use random cropping or its variants for data augmentation. However, this traditional data augmentation approach may generate many non-informative samples (video patches covering only a small part of the fo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Yujia, Po, Lai-Man, Xiong, Jingjing, REHMAN, Yasar Abbas Ur, Cheung, Kwok-Wai
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309510/ https://www.ncbi.nlm.nih.gov/pubmed/34300460 http://dx.doi.org/10.3390/s21144720

_version_	1783728538531135488
author	Zhang, Yujia Po, Lai-Man Xiong, Jingjing REHMAN, Yasar Abbas Ur Cheung, Kwok-Wai
author_facet	Zhang, Yujia Po, Lai-Man Xiong, Jingjing REHMAN, Yasar Abbas Ur Cheung, Kwok-Wai
author_sort	Zhang, Yujia
collection	PubMed
description	Human action recognition methods in videos based on deep convolutional neural networks usually use random cropping or its variants for data augmentation. However, this traditional data augmentation approach may generate many non-informative samples (video patches covering only a small part of the foreground or only the background) that are not related to a specific action. These samples can be regarded as noisy samples with incorrect labels, which reduces the overall action recognition performance. In this paper, we attempt to mitigate the impact of noisy samples by proposing an Auto-augmented Siamese Neural Network (ASNet). In this framework, we propose backpropagating salient patches and randomly cropped samples in the same iteration to perform gradient compensation to alleviate the adverse gradient effects of non-informative samples. Salient patches refer to the samples containing critical information for human action recognition. The generation of salient patches is formulated as a Markov decision process, and a reinforcement learning agent called SPA (Salient Patch Agent) is introduced to extract patches in a weakly supervised manner without extra labels. Extensive experiments were conducted on two well-known datasets UCF-101 and HMDB-51 to verify the effectiveness of the proposed SPA and ASNet.
format	Online Article Text
id	pubmed-8309510
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-83095102021-07-25 ASNet: Auto-Augmented Siamese Neural Network for Action Recognition Zhang, Yujia Po, Lai-Man Xiong, Jingjing REHMAN, Yasar Abbas Ur Cheung, Kwok-Wai Sensors (Basel) Article Human action recognition methods in videos based on deep convolutional neural networks usually use random cropping or its variants for data augmentation. However, this traditional data augmentation approach may generate many non-informative samples (video patches covering only a small part of the foreground or only the background) that are not related to a specific action. These samples can be regarded as noisy samples with incorrect labels, which reduces the overall action recognition performance. In this paper, we attempt to mitigate the impact of noisy samples by proposing an Auto-augmented Siamese Neural Network (ASNet). In this framework, we propose backpropagating salient patches and randomly cropped samples in the same iteration to perform gradient compensation to alleviate the adverse gradient effects of non-informative samples. Salient patches refer to the samples containing critical information for human action recognition. The generation of salient patches is formulated as a Markov decision process, and a reinforcement learning agent called SPA (Salient Patch Agent) is introduced to extract patches in a weakly supervised manner without extra labels. Extensive experiments were conducted on two well-known datasets UCF-101 and HMDB-51 to verify the effectiveness of the proposed SPA and ASNet. MDPI 2021-07-10 /pmc/articles/PMC8309510/ /pubmed/34300460 http://dx.doi.org/10.3390/s21144720 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhang, Yujia Po, Lai-Man Xiong, Jingjing REHMAN, Yasar Abbas Ur Cheung, Kwok-Wai ASNet: Auto-Augmented Siamese Neural Network for Action Recognition
title	ASNet: Auto-Augmented Siamese Neural Network for Action Recognition
title_full	ASNet: Auto-Augmented Siamese Neural Network for Action Recognition
title_fullStr	ASNet: Auto-Augmented Siamese Neural Network for Action Recognition
title_full_unstemmed	ASNet: Auto-Augmented Siamese Neural Network for Action Recognition
title_short	ASNet: Auto-Augmented Siamese Neural Network for Action Recognition
title_sort	asnet: auto-augmented siamese neural network for action recognition
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309510/ https://www.ncbi.nlm.nih.gov/pubmed/34300460 http://dx.doi.org/10.3390/s21144720
work_keys_str_mv	AT zhangyujia asnetautoaugmentedsiameseneuralnetworkforactionrecognition AT polaiman asnetautoaugmentedsiameseneuralnetworkforactionrecognition AT xiongjingjing asnetautoaugmentedsiameseneuralnetworkforactionrecognition AT rehmanyasarabbasur asnetautoaugmentedsiameseneuralnetworkforactionrecognition AT cheungkwokwai asnetautoaugmentedsiameseneuralnetworkforactionrecognition

ASNet: Auto-Augmented Siamese Neural Network for Action Recognition

Ejemplares similares