Cargando…

Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition

Human action recognition (HAR) is an important yet challenging task. This paper presents a novel method. First, fuzzy weight functions are used in computations of depth motion maps (DMMs). Multiple length motion information is also used. These features are referred to as fuzzy weighted multi-resolut...

Descripción completa

Detalles Bibliográficos
Autores principales:	Al-Faris, Mahmoud, Chiverton, John, Yang, Yanyan, Ndzi, David
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321166/ https://www.ncbi.nlm.nih.gov/pubmed/34460648 http://dx.doi.org/10.3390/jimaging5100082

_version_	1783730786363506688
author	Al-Faris, Mahmoud Chiverton, John Yang, Yanyan Ndzi, David
author_facet	Al-Faris, Mahmoud Chiverton, John Yang, Yanyan Ndzi, David
author_sort	Al-Faris, Mahmoud
collection	PubMed
description	Human action recognition (HAR) is an important yet challenging task. This paper presents a novel method. First, fuzzy weight functions are used in computations of depth motion maps (DMMs). Multiple length motion information is also used. These features are referred to as fuzzy weighted multi-resolution DMMs (FWMDMMs). This formulation allows for various aspects of individual actions to be emphasized. It also helps to characterise the importance of the temporal dimension. This is important to help overcome, e.g., variations in time over which a single type of action might be performed. A deep convolutional neural network (CNN) motion model is created and trained to extract discriminative and compact features. Transfer learning is also used to extract spatial information from RGB and depth data using the AlexNet network. Different late fusion techniques are then investigated to fuse the deep motion model with the spatial network. The result is a spatial temporal HAR model. The developed approach is capable of recognising both human action and human–object interaction. Three public domain datasets are used to evaluate the proposed solution. The experimental results demonstrate the robustness of this approach compared with state-of-the art algorithms.
format	Online Article Text
id	pubmed-8321166
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-83211662021-08-26 Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition Al-Faris, Mahmoud Chiverton, John Yang, Yanyan Ndzi, David J Imaging Article Human action recognition (HAR) is an important yet challenging task. This paper presents a novel method. First, fuzzy weight functions are used in computations of depth motion maps (DMMs). Multiple length motion information is also used. These features are referred to as fuzzy weighted multi-resolution DMMs (FWMDMMs). This formulation allows for various aspects of individual actions to be emphasized. It also helps to characterise the importance of the temporal dimension. This is important to help overcome, e.g., variations in time over which a single type of action might be performed. A deep convolutional neural network (CNN) motion model is created and trained to extract discriminative and compact features. Transfer learning is also used to extract spatial information from RGB and depth data using the AlexNet network. Different late fusion techniques are then investigated to fuse the deep motion model with the spatial network. The result is a spatial temporal HAR model. The developed approach is capable of recognising both human action and human–object interaction. Three public domain datasets are used to evaluate the proposed solution. The experimental results demonstrate the robustness of this approach compared with state-of-the art algorithms. MDPI 2019-10-21 /pmc/articles/PMC8321166/ /pubmed/34460648 http://dx.doi.org/10.3390/jimaging5100082 Text en © 2019 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle	Article Al-Faris, Mahmoud Chiverton, John Yang, Yanyan Ndzi, David Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition
title	Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition
title_full	Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition
title_fullStr	Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition
title_full_unstemmed	Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition
title_short	Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition
title_sort	deep learning of fuzzy weighted multi-resolution depth motion maps with spatial feature fusion for action recognition
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321166/ https://www.ncbi.nlm.nih.gov/pubmed/34460648 http://dx.doi.org/10.3390/jimaging5100082
work_keys_str_mv	AT alfarismahmoud deeplearningoffuzzyweightedmultiresolutiondepthmotionmapswithspatialfeaturefusionforactionrecognition AT chivertonjohn deeplearningoffuzzyweightedmultiresolutiondepthmotionmapswithspatialfeaturefusionforactionrecognition AT yangyanyan deeplearningoffuzzyweightedmultiresolutiondepthmotionmapswithspatialfeaturefusionforactionrecognition AT ndzidavid deeplearningoffuzzyweightedmultiresolutiondepthmotionmapswithspatialfeaturefusionforactionrecognition

Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition

Ejemplares similares