Cargando…

Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding

Gait recognition and understanding systems have shown a wide-ranging application prospect. However, their use of unstructured data from image and video has affected their performance, e.g., they are easily influenced by multi-views, occlusion, clothes, and object carrying conditions. This paper addr...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Jian, Tjahjadi, Tardi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146167/
https://www.ncbi.nlm.nih.gov/pubmed/32188067
http://dx.doi.org/10.3390/s20061646
_version_ 1783520137909895168
author Luo, Jian
Tjahjadi, Tardi
author_facet Luo, Jian
Tjahjadi, Tardi
author_sort Luo, Jian
collection PubMed
description Gait recognition and understanding systems have shown a wide-ranging application prospect. However, their use of unstructured data from image and video has affected their performance, e.g., they are easily influenced by multi-views, occlusion, clothes, and object carrying conditions. This paper addresses these problems using a realistic 3-dimensional (3D) human structural data and sequential pattern learning framework with top-down attention modulating mechanism based on Hierarchical Temporal Memory (HTM). First, an accurate 2-dimensional (2D) to 3D human body pose and shape semantic parameters estimation method is proposed, which exploits the advantages of an instance-level body parsing model and a virtual dressing method. Second, by using gait semantic folding, the estimated body parameters are encoded using a sparse 2D matrix to construct the structural gait semantic image. In order to achieve time-based gait recognition, an HTM Network is constructed to obtain the sequence-level gait sparse distribution representations (SL-GSDRs). A top-down attention mechanism is introduced to deal with various conditions including multi-views by refining the SL-GSDRs, according to prior knowledge. The proposed gait learning model not only aids gait recognition tasks to overcome the difficulties in real application scenarios but also provides the structured gait semantic images for visual cognition. Experimental analyses on CMU MoBo, CASIA B, TUM-IITKGP, and KY4D datasets show a significant performance gain in terms of accuracy and robustness.
format Online
Article
Text
id pubmed-7146167
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-71461672020-04-15 Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding Luo, Jian Tjahjadi, Tardi Sensors (Basel) Article Gait recognition and understanding systems have shown a wide-ranging application prospect. However, their use of unstructured data from image and video has affected their performance, e.g., they are easily influenced by multi-views, occlusion, clothes, and object carrying conditions. This paper addresses these problems using a realistic 3-dimensional (3D) human structural data and sequential pattern learning framework with top-down attention modulating mechanism based on Hierarchical Temporal Memory (HTM). First, an accurate 2-dimensional (2D) to 3D human body pose and shape semantic parameters estimation method is proposed, which exploits the advantages of an instance-level body parsing model and a virtual dressing method. Second, by using gait semantic folding, the estimated body parameters are encoded using a sparse 2D matrix to construct the structural gait semantic image. In order to achieve time-based gait recognition, an HTM Network is constructed to obtain the sequence-level gait sparse distribution representations (SL-GSDRs). A top-down attention mechanism is introduced to deal with various conditions including multi-views by refining the SL-GSDRs, according to prior knowledge. The proposed gait learning model not only aids gait recognition tasks to overcome the difficulties in real application scenarios but also provides the structured gait semantic images for visual cognition. Experimental analyses on CMU MoBo, CASIA B, TUM-IITKGP, and KY4D datasets show a significant performance gain in terms of accuracy and robustness. MDPI 2020-03-16 /pmc/articles/PMC7146167/ /pubmed/32188067 http://dx.doi.org/10.3390/s20061646 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Luo, Jian
Tjahjadi, Tardi
Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding
title Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding
title_full Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding
title_fullStr Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding
title_full_unstemmed Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding
title_short Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding
title_sort gait recognition and understanding based on hierarchical temporal memory using 3d gait semantic folding
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146167/
https://www.ncbi.nlm.nih.gov/pubmed/32188067
http://dx.doi.org/10.3390/s20061646
work_keys_str_mv AT luojian gaitrecognitionandunderstandingbasedonhierarchicaltemporalmemoryusing3dgaitsemanticfolding
AT tjahjaditardi gaitrecognitionandunderstandingbasedonhierarchicaltemporalmemoryusing3dgaitsemanticfolding