Cargando…

Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer

Gait recognition, the task of identifying an individual based on their unique walking style, can be difficult because walking styles can be influenced by external factors such as clothing, viewing angle, and carrying conditions. To address these challenges, this paper proposes a multi-model gait rec...

Descripción completa

Detalles Bibliográficos
Autores principales: Mogan, Jashila Nair, Lee, Chin Poo, Lim, Kian Ming, Ali, Mohammed, Alqahtani, Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10143319/
https://www.ncbi.nlm.nih.gov/pubmed/37112147
http://dx.doi.org/10.3390/s23083809
_version_ 1785033823346491392
author Mogan, Jashila Nair
Lee, Chin Poo
Lim, Kian Ming
Ali, Mohammed
Alqahtani, Ali
author_facet Mogan, Jashila Nair
Lee, Chin Poo
Lim, Kian Ming
Ali, Mohammed
Alqahtani, Ali
author_sort Mogan, Jashila Nair
collection PubMed
description Gait recognition, the task of identifying an individual based on their unique walking style, can be difficult because walking styles can be influenced by external factors such as clothing, viewing angle, and carrying conditions. To address these challenges, this paper proposes a multi-model gait recognition system that integrates Convolutional Neural Networks (CNNs) and Vision Transformer. The first step in the process is to obtain a gait energy image, which is achieved by applying an averaging technique to a gait cycle. The gait energy image is then fed into three different models, DenseNet-201, VGG-16, and a Vision Transformer. These models are pre-trained and fine-tuned to encode the salient gait features that are specific to an individual’s walking style. Each model provides prediction scores for the classes based on the encoded features, and these scores are then summed and averaged to produce the final class label. The performance of this multi-model gait recognition system was evaluated on three datasets, CASIA-B, OU-ISIR dataset D, and OU-ISIR Large Population dataset. The experimental results showed substantial improvement compared to existing methods on all three datasets. The integration of CNNs and ViT allows the system to learn both the pre-defined and distinct features, providing a robust solution for gait recognition even under the influence of covariates.
format Online
Article
Text
id pubmed-10143319
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101433192023-04-29 Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer Mogan, Jashila Nair Lee, Chin Poo Lim, Kian Ming Ali, Mohammed Alqahtani, Ali Sensors (Basel) Article Gait recognition, the task of identifying an individual based on their unique walking style, can be difficult because walking styles can be influenced by external factors such as clothing, viewing angle, and carrying conditions. To address these challenges, this paper proposes a multi-model gait recognition system that integrates Convolutional Neural Networks (CNNs) and Vision Transformer. The first step in the process is to obtain a gait energy image, which is achieved by applying an averaging technique to a gait cycle. The gait energy image is then fed into three different models, DenseNet-201, VGG-16, and a Vision Transformer. These models are pre-trained and fine-tuned to encode the salient gait features that are specific to an individual’s walking style. Each model provides prediction scores for the classes based on the encoded features, and these scores are then summed and averaged to produce the final class label. The performance of this multi-model gait recognition system was evaluated on three datasets, CASIA-B, OU-ISIR dataset D, and OU-ISIR Large Population dataset. The experimental results showed substantial improvement compared to existing methods on all three datasets. The integration of CNNs and ViT allows the system to learn both the pre-defined and distinct features, providing a robust solution for gait recognition even under the influence of covariates. MDPI 2023-04-07 /pmc/articles/PMC10143319/ /pubmed/37112147 http://dx.doi.org/10.3390/s23083809 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mogan, Jashila Nair
Lee, Chin Poo
Lim, Kian Ming
Ali, Mohammed
Alqahtani, Ali
Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer
title Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer
title_full Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer
title_fullStr Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer
title_full_unstemmed Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer
title_short Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer
title_sort gait-cnn-vit: multi-model gait recognition with convolutional neural networks and vision transformer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10143319/
https://www.ncbi.nlm.nih.gov/pubmed/37112147
http://dx.doi.org/10.3390/s23083809
work_keys_str_mv AT moganjashilanair gaitcnnvitmultimodelgaitrecognitionwithconvolutionalneuralnetworksandvisiontransformer
AT leechinpoo gaitcnnvitmultimodelgaitrecognitionwithconvolutionalneuralnetworksandvisiontransformer
AT limkianming gaitcnnvitmultimodelgaitrecognitionwithconvolutionalneuralnetworksandvisiontransformer
AT alimohammed gaitcnnvitmultimodelgaitrecognitionwithconvolutionalneuralnetworksandvisiontransformer
AT alqahtaniali gaitcnnvitmultimodelgaitrecognitionwithconvolutionalneuralnetworksandvisiontransformer