Cargando…

A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition

Human action recognition has been actively explored over the past two decades to further advancements in video analytics domain. Numerous research studies have been conducted to investigate the complex sequential patterns of human actions in video streams. In this paper, we propose a knowledge disti...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ullah, Hayat, Munir, Arslan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10142350/ https://www.ncbi.nlm.nih.gov/pubmed/37103233 http://dx.doi.org/10.3390/jimaging9040082

_version_	1785033591912136704
author	Ullah, Hayat Munir, Arslan
author_facet	Ullah, Hayat Munir, Arslan
author_sort	Ullah, Hayat
collection	PubMed
description	Human action recognition has been actively explored over the past two decades to further advancements in video analytics domain. Numerous research studies have been conducted to investigate the complex sequential patterns of human actions in video streams. In this paper, we propose a knowledge distillation framework, which distills spatio-temporal knowledge from a large teacher model to a lightweight student model using an offline knowledge distillation technique. The proposed offline knowledge distillation framework takes two models: a large pre-trained 3DCNN (three-dimensional convolutional neural network) teacher model and a lightweight 3DCNN student model (i.e., the teacher model is pre-trained on the same dataset on which the student model is to be trained on). During offline knowledge distillation training, the distillation algorithm trains only the student model to help enable the student model to achieve the same level of prediction accuracy as the teacher model. To evaluate the performance of the proposed method, we conduct extensive experiments on four benchmark human action datasets. The obtained quantitative results verify the efficiency and robustness of the proposed method over the state-of-the-art human action recognition methods by obtaining up to 35% improvement in accuracy over existing methods. Furthermore, we evaluate the inference time of the proposed method and compare the obtained results with the inference time of the state-of-the-art methods. Experimental results reveal that the proposed method attains an improvement of up to 50× in terms of frames per seconds (FPS) over the state-of-the-art methods. The short inference time and high accuracy make our proposed framework suitable for human activity recognition in real-time applications.
format	Online Article Text
id	pubmed-10142350
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-101423502023-04-29 A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition Ullah, Hayat Munir, Arslan J Imaging Article Human action recognition has been actively explored over the past two decades to further advancements in video analytics domain. Numerous research studies have been conducted to investigate the complex sequential patterns of human actions in video streams. In this paper, we propose a knowledge distillation framework, which distills spatio-temporal knowledge from a large teacher model to a lightweight student model using an offline knowledge distillation technique. The proposed offline knowledge distillation framework takes two models: a large pre-trained 3DCNN (three-dimensional convolutional neural network) teacher model and a lightweight 3DCNN student model (i.e., the teacher model is pre-trained on the same dataset on which the student model is to be trained on). During offline knowledge distillation training, the distillation algorithm trains only the student model to help enable the student model to achieve the same level of prediction accuracy as the teacher model. To evaluate the performance of the proposed method, we conduct extensive experiments on four benchmark human action datasets. The obtained quantitative results verify the efficiency and robustness of the proposed method over the state-of-the-art human action recognition methods by obtaining up to 35% improvement in accuracy over existing methods. Furthermore, we evaluate the inference time of the proposed method and compare the obtained results with the inference time of the state-of-the-art methods. Experimental results reveal that the proposed method attains an improvement of up to 50× in terms of frames per seconds (FPS) over the state-of-the-art methods. The short inference time and high accuracy make our proposed framework suitable for human activity recognition in real-time applications. MDPI 2023-04-14 /pmc/articles/PMC10142350/ /pubmed/37103233 http://dx.doi.org/10.3390/jimaging9040082 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Ullah, Hayat Munir, Arslan A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title	A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title_full	A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title_fullStr	A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title_full_unstemmed	A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title_short	A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition
title_sort	3dcnn-based knowledge distillation framework for human activity recognition
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10142350/ https://www.ncbi.nlm.nih.gov/pubmed/37103233 http://dx.doi.org/10.3390/jimaging9040082
work_keys_str_mv	AT ullahhayat a3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition AT munirarslan a3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition AT ullahhayat 3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition AT munirarslan 3dcnnbasedknowledgedistillationframeworkforhumanactivityrecognition

A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition

Ejemplares similares