Cargando…

A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition

Human action recognition has been actively explored over the past two decades to further advancements in video analytics domain. Numerous research studies have been conducted to investigate the complex sequential patterns of human actions in video streams. In this paper, we propose a knowledge disti...

Descripción completa

Detalles Bibliográficos
Autores principales: Ullah, Hayat, Munir, Arslan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10142350/
https://www.ncbi.nlm.nih.gov/pubmed/37103233
http://dx.doi.org/10.3390/jimaging9040082
_version_ 1785033591912136704
author Ullah, Hayat
Munir, Arslan
author_facet Ullah, Hayat
Munir, Arslan
author_sort Ullah, Hayat
collection PubMed
description Human action recognition has been actively explored over the past two decades to further advancements in video analytics domain. Numerous research studies have been conducted to investigate the complex sequential patterns of human actions in video streams. In this paper, we propose a knowledge distillation framework, which distills spatio-temporal knowledge from a large teacher model to a lightweight student model using an offline knowledge distillation technique. The proposed offline knowledge distillation framework takes two models: a large pre-trained 3DCNN (three-dimensional convolutional neural network) teacher model and a lightweight 3DCNN student model (i.e., the teacher model is pre-trained on the same dataset on which the student model is to be trained on). During offline knowledge distillation training, the distillation algorithm trains only the student model to help enable the student model to achieve the same level of prediction accuracy as the teacher model. To evaluate the performance of the proposed method, we conduct extensive experiments on four benchmark human action datasets. The obtained quantitative results verify the efficiency and robustness of the proposed method over the state-of-the-art human action recognition methods by obtaining up to 35% improvement in accuracy over existing methods. Furthermore, we evaluate the inference time of the proposed method and compare the obtained results with the inference time of the state-of-the-art methods. Experimental results reveal that the proposed method attains an improvement of up to 50× in terms of frames per seconds (FPS) over the state-of-the-art methods. The short inference time and high accuracy make our proposed framework suitable for human activity recognition in real-time applications.
format Online
Article
Text
id pubmed-10142350
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101423502023-04-29 A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition Ullah, Hayat Munir, Arslan J Imaging Article Human action recognition has been actively explored over the past two decades to further advancements in video analytics domain. Numerous research studies have been conducted to investigate the complex sequential patterns of human actions in video streams. In this paper, we propose a knowledge distillation framework, which distills spatio-temporal knowledge from a large teacher model to a lightweight student model using an offline knowledge distillation technique. The proposed offline knowledge distillation framework takes two models: a large pre-trained 3DCNN (three-dimensional convolutional neural network) teacher model and a lightweight 3DCNN student model (i.e., the teacher model is pre-trained on the same dataset on which the student model is to be trained on). During offline knowledge distillation training, the distillation algorithm trains only the student model to help enable the student model to achieve the same level of prediction accuracy as the teacher model. To evaluate the performance of the proposed method, we conduct extensive experiments on four benchmark human action datasets. The obtained quantitative results verify the efficiency and robustness of the proposed method over the state-of-the-art human action recognition methods by obtaining up to 35% improvement in accuracy over existing methods. Furthermore, we evaluate the inference time of the proposed method and compare the obtained results with the inference time of the state-of-the-art methods. Experimental results reveal that the proposed method attains an improvement of up to 50× in terms of frames per seconds (FPS) over the state-of-the-art methods. The short inference time and high accuracy make our proposed framework suitable for human activity recognition in real-time applications. MDPI 2023-04-14 /pmc/articles/PMC10142350/ /pubmed/37103233 http://dx.doi.org/10.3390/jimaging9040082 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ullah, Hayat
Munir, Arslan
A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title_full A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title_fullStr A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title_full_unstemmed A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title_short A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title_sort 3dcnn-based knowledge distillation framework for human activity recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10142350/
https://www.ncbi.nlm.nih.gov/pubmed/37103233
http://dx.doi.org/10.3390/jimaging9040082
work_keys_str_mv AT ullahhayat a3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition
AT munirarslan a3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition
AT ullahhayat 3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition
AT munirarslan 3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition