Cargando…

Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network

Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs). However, effective and efficient methods for incorporation of temporal information into CNNs are still being actively explored in...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Le, Zang, Jinliang, Zhang, Qilin, Niu, Zhenxing, Hua, Gang, Zheng, Nanning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6069475/
https://www.ncbi.nlm.nih.gov/pubmed/29933555
http://dx.doi.org/10.3390/s18071979
_version_ 1783343504921985024
author Wang, Le
Zang, Jinliang
Zhang, Qilin
Niu, Zhenxing
Hua, Gang
Zheng, Nanning
author_facet Wang, Le
Zang, Jinliang
Zhang, Qilin
Niu, Zhenxing
Hua, Gang
Zheng, Nanning
author_sort Wang, Le
collection PubMed
description Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs). However, effective and efficient methods for incorporation of temporal information into CNNs are still being actively explored in the recent literature. Motivated by the popular recurrent attention models in the research area of natural language processing, we propose the Attention-aware Temporal Weighted CNN (ATW CNN) for action recognition in videos, which embeds a visual attention model into a temporal weighted multi-stream CNN. This attention model is simply implemented as temporal weighting yet it effectively boosts the recognition performance of video representations. Besides, each stream in the proposed ATW CNN framework is capable of end-to-end training, with both network parameters and temporal weights optimized by stochastic gradient descent (SGD) with back-propagation. Our experimental results on the UCF-101 and HMDB-51 datasets showed that the proposed attention mechanism contributes substantially to the performance gains with the more discriminative snippets by focusing on more relevant video segments.
format Online
Article
Text
id pubmed-6069475
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-60694752018-08-07 Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network Wang, Le Zang, Jinliang Zhang, Qilin Niu, Zhenxing Hua, Gang Zheng, Nanning Sensors (Basel) Article Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs). However, effective and efficient methods for incorporation of temporal information into CNNs are still being actively explored in the recent literature. Motivated by the popular recurrent attention models in the research area of natural language processing, we propose the Attention-aware Temporal Weighted CNN (ATW CNN) for action recognition in videos, which embeds a visual attention model into a temporal weighted multi-stream CNN. This attention model is simply implemented as temporal weighting yet it effectively boosts the recognition performance of video representations. Besides, each stream in the proposed ATW CNN framework is capable of end-to-end training, with both network parameters and temporal weights optimized by stochastic gradient descent (SGD) with back-propagation. Our experimental results on the UCF-101 and HMDB-51 datasets showed that the proposed attention mechanism contributes substantially to the performance gains with the more discriminative snippets by focusing on more relevant video segments. MDPI 2018-06-21 /pmc/articles/PMC6069475/ /pubmed/29933555 http://dx.doi.org/10.3390/s18071979 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Le
Zang, Jinliang
Zhang, Qilin
Niu, Zhenxing
Hua, Gang
Zheng, Nanning
Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network
title Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network
title_full Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network
title_fullStr Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network
title_full_unstemmed Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network
title_short Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network
title_sort action recognition by an attention-aware temporal weighted convolutional neural network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6069475/
https://www.ncbi.nlm.nih.gov/pubmed/29933555
http://dx.doi.org/10.3390/s18071979
work_keys_str_mv AT wangle actionrecognitionbyanattentionawaretemporalweightedconvolutionalneuralnetwork
AT zangjinliang actionrecognitionbyanattentionawaretemporalweightedconvolutionalneuralnetwork
AT zhangqilin actionrecognitionbyanattentionawaretemporalweightedconvolutionalneuralnetwork
AT niuzhenxing actionrecognitionbyanattentionawaretemporalweightedconvolutionalneuralnetwork
AT huagang actionrecognitionbyanattentionawaretemporalweightedconvolutionalneuralnetwork
AT zhengnanning actionrecognitionbyanattentionawaretemporalweightedconvolutionalneuralnetwork