Cargando…
A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
Human action recognition has been actively explored over the past two decades to further advancements in video analytics domain. Numerous research studies have been conducted to investigate the complex sequential patterns of human actions in video streams. In this paper, we propose a knowledge disti...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10142350/ https://www.ncbi.nlm.nih.gov/pubmed/37103233 http://dx.doi.org/10.3390/jimaging9040082 |
_version_ | 1785033591912136704 |
---|---|
author | Ullah, Hayat Munir, Arslan |
author_facet | Ullah, Hayat Munir, Arslan |
author_sort | Ullah, Hayat |
collection | PubMed |
description | Human action recognition has been actively explored over the past two decades to further advancements in video analytics domain. Numerous research studies have been conducted to investigate the complex sequential patterns of human actions in video streams. In this paper, we propose a knowledge distillation framework, which distills spatio-temporal knowledge from a large teacher model to a lightweight student model using an offline knowledge distillation technique. The proposed offline knowledge distillation framework takes two models: a large pre-trained 3DCNN (three-dimensional convolutional neural network) teacher model and a lightweight 3DCNN student model (i.e., the teacher model is pre-trained on the same dataset on which the student model is to be trained on). During offline knowledge distillation training, the distillation algorithm trains only the student model to help enable the student model to achieve the same level of prediction accuracy as the teacher model. To evaluate the performance of the proposed method, we conduct extensive experiments on four benchmark human action datasets. The obtained quantitative results verify the efficiency and robustness of the proposed method over the state-of-the-art human action recognition methods by obtaining up to 35% improvement in accuracy over existing methods. Furthermore, we evaluate the inference time of the proposed method and compare the obtained results with the inference time of the state-of-the-art methods. Experimental results reveal that the proposed method attains an improvement of up to 50× in terms of frames per seconds (FPS) over the state-of-the-art methods. The short inference time and high accuracy make our proposed framework suitable for human activity recognition in real-time applications. |
format | Online Article Text |
id | pubmed-10142350 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-101423502023-04-29 A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition Ullah, Hayat Munir, Arslan J Imaging Article Human action recognition has been actively explored over the past two decades to further advancements in video analytics domain. Numerous research studies have been conducted to investigate the complex sequential patterns of human actions in video streams. In this paper, we propose a knowledge distillation framework, which distills spatio-temporal knowledge from a large teacher model to a lightweight student model using an offline knowledge distillation technique. The proposed offline knowledge distillation framework takes two models: a large pre-trained 3DCNN (three-dimensional convolutional neural network) teacher model and a lightweight 3DCNN student model (i.e., the teacher model is pre-trained on the same dataset on which the student model is to be trained on). During offline knowledge distillation training, the distillation algorithm trains only the student model to help enable the student model to achieve the same level of prediction accuracy as the teacher model. To evaluate the performance of the proposed method, we conduct extensive experiments on four benchmark human action datasets. The obtained quantitative results verify the efficiency and robustness of the proposed method over the state-of-the-art human action recognition methods by obtaining up to 35% improvement in accuracy over existing methods. Furthermore, we evaluate the inference time of the proposed method and compare the obtained results with the inference time of the state-of-the-art methods. Experimental results reveal that the proposed method attains an improvement of up to 50× in terms of frames per seconds (FPS) over the state-of-the-art methods. The short inference time and high accuracy make our proposed framework suitable for human activity recognition in real-time applications. MDPI 2023-04-14 /pmc/articles/PMC10142350/ /pubmed/37103233 http://dx.doi.org/10.3390/jimaging9040082 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ullah, Hayat Munir, Arslan A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition |
title | A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition |
title_full | A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition |
title_fullStr | A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition |
title_full_unstemmed | A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition |
title_short | A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition |
title_sort | 3dcnn-based knowledge distillation framework for human activity recognition |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10142350/ https://www.ncbi.nlm.nih.gov/pubmed/37103233 http://dx.doi.org/10.3390/jimaging9040082 |
work_keys_str_mv | AT ullahhayat a3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition AT munirarslan a3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition AT ullahhayat 3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition AT munirarslan 3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition |