Cargando…
Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding
Gait recognition and understanding systems have shown a wide-ranging application prospect. However, their use of unstructured data from image and video has affected their performance, e.g., they are easily influenced by multi-views, occlusion, clothes, and object carrying conditions. This paper addr...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146167/ https://www.ncbi.nlm.nih.gov/pubmed/32188067 http://dx.doi.org/10.3390/s20061646 |
_version_ | 1783520137909895168 |
---|---|
author | Luo, Jian Tjahjadi, Tardi |
author_facet | Luo, Jian Tjahjadi, Tardi |
author_sort | Luo, Jian |
collection | PubMed |
description | Gait recognition and understanding systems have shown a wide-ranging application prospect. However, their use of unstructured data from image and video has affected their performance, e.g., they are easily influenced by multi-views, occlusion, clothes, and object carrying conditions. This paper addresses these problems using a realistic 3-dimensional (3D) human structural data and sequential pattern learning framework with top-down attention modulating mechanism based on Hierarchical Temporal Memory (HTM). First, an accurate 2-dimensional (2D) to 3D human body pose and shape semantic parameters estimation method is proposed, which exploits the advantages of an instance-level body parsing model and a virtual dressing method. Second, by using gait semantic folding, the estimated body parameters are encoded using a sparse 2D matrix to construct the structural gait semantic image. In order to achieve time-based gait recognition, an HTM Network is constructed to obtain the sequence-level gait sparse distribution representations (SL-GSDRs). A top-down attention mechanism is introduced to deal with various conditions including multi-views by refining the SL-GSDRs, according to prior knowledge. The proposed gait learning model not only aids gait recognition tasks to overcome the difficulties in real application scenarios but also provides the structured gait semantic images for visual cognition. Experimental analyses on CMU MoBo, CASIA B, TUM-IITKGP, and KY4D datasets show a significant performance gain in terms of accuracy and robustness. |
format | Online Article Text |
id | pubmed-7146167 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-71461672020-04-15 Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding Luo, Jian Tjahjadi, Tardi Sensors (Basel) Article Gait recognition and understanding systems have shown a wide-ranging application prospect. However, their use of unstructured data from image and video has affected their performance, e.g., they are easily influenced by multi-views, occlusion, clothes, and object carrying conditions. This paper addresses these problems using a realistic 3-dimensional (3D) human structural data and sequential pattern learning framework with top-down attention modulating mechanism based on Hierarchical Temporal Memory (HTM). First, an accurate 2-dimensional (2D) to 3D human body pose and shape semantic parameters estimation method is proposed, which exploits the advantages of an instance-level body parsing model and a virtual dressing method. Second, by using gait semantic folding, the estimated body parameters are encoded using a sparse 2D matrix to construct the structural gait semantic image. In order to achieve time-based gait recognition, an HTM Network is constructed to obtain the sequence-level gait sparse distribution representations (SL-GSDRs). A top-down attention mechanism is introduced to deal with various conditions including multi-views by refining the SL-GSDRs, according to prior knowledge. The proposed gait learning model not only aids gait recognition tasks to overcome the difficulties in real application scenarios but also provides the structured gait semantic images for visual cognition. Experimental analyses on CMU MoBo, CASIA B, TUM-IITKGP, and KY4D datasets show a significant performance gain in terms of accuracy and robustness. MDPI 2020-03-16 /pmc/articles/PMC7146167/ /pubmed/32188067 http://dx.doi.org/10.3390/s20061646 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Luo, Jian Tjahjadi, Tardi Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding |
title | Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding |
title_full | Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding |
title_fullStr | Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding |
title_full_unstemmed | Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding |
title_short | Gait Recognition and Understanding Based on Hierarchical Temporal Memory Using 3D Gait Semantic Folding |
title_sort | gait recognition and understanding based on hierarchical temporal memory using 3d gait semantic folding |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146167/ https://www.ncbi.nlm.nih.gov/pubmed/32188067 http://dx.doi.org/10.3390/s20061646 |
work_keys_str_mv | AT luojian gaitrecognitionandunderstandingbasedonhierarchicaltemporalmemoryusing3dgaitsemanticfolding AT tjahjaditardi gaitrecognitionandunderstandingbasedonhierarchicaltemporalmemoryusing3dgaitsemanticfolding |