Cargando…

Class-Incremental Learning on Video-Based Action Recognition by Distillation of Various Knowledge

Recognition of activities in the video is an important field in computer vision. Many successful works have been done on activity recognition and they achieved acceptable results in recent years. However, their training is completely static, meaning that all classes are taught to the system in one t...

Descripción completa

Detalles Bibliográficos
Autores principales: Maraghi, Vali Ollah, Faez, Karim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8970923/
https://www.ncbi.nlm.nih.gov/pubmed/35371208
http://dx.doi.org/10.1155/2022/4879942
_version_ 1784679539574571008
author Maraghi, Vali Ollah
Faez, Karim
author_facet Maraghi, Vali Ollah
Faez, Karim
author_sort Maraghi, Vali Ollah
collection PubMed
description Recognition of activities in the video is an important field in computer vision. Many successful works have been done on activity recognition and they achieved acceptable results in recent years. However, their training is completely static, meaning that all classes are taught to the system in one training step. The system is only able to recognize the equivalent classes. The main disadvantage of this type of training is that if new classes need to be taught to the system, the system must be retrained from scratch and all classes retaught to the system. This specification has many challenges, such as storing and retaining data and respending training costs. We propose an approach for training the action recognition system in video data which can teach new classes to the system without the need for previous data. We will provide an incremental learning algorithm for class recognition tasks in video data. Two different approaches are combined to prevent catastrophic forgetting in the proposed algorithm. In the proposed incremental learning algorithm, two approaches are introduced and used to maintain network information in combination. These two approaches are network sharing and network knowledge distillation. We introduce a neural network architecture for action recognition to understand and represent the video data. We propose the distillation of network knowledge at the classification and feature level, which can be divided into spatial and temporal parts at the feature level. We also suggest initializing new classifiers using previous classifiers. The proposed algorithm is evaluated on the USCF101, HMDB51, and Kinetics-400 datasets. We will consider various factors such as the amount of distillation knowledge, the number of new classes and the incremental learnings stages, and their impact on the final recognition system. Finally, we will show that the proposed algorithm can teach new classes to the recognition system without forgetting the previous classes and does not need the previous data or exemplar data.
format Online
Article
Text
id pubmed-8970923
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-89709232022-04-01 Class-Incremental Learning on Video-Based Action Recognition by Distillation of Various Knowledge Maraghi, Vali Ollah Faez, Karim Comput Intell Neurosci Research Article Recognition of activities in the video is an important field in computer vision. Many successful works have been done on activity recognition and they achieved acceptable results in recent years. However, their training is completely static, meaning that all classes are taught to the system in one training step. The system is only able to recognize the equivalent classes. The main disadvantage of this type of training is that if new classes need to be taught to the system, the system must be retrained from scratch and all classes retaught to the system. This specification has many challenges, such as storing and retaining data and respending training costs. We propose an approach for training the action recognition system in video data which can teach new classes to the system without the need for previous data. We will provide an incremental learning algorithm for class recognition tasks in video data. Two different approaches are combined to prevent catastrophic forgetting in the proposed algorithm. In the proposed incremental learning algorithm, two approaches are introduced and used to maintain network information in combination. These two approaches are network sharing and network knowledge distillation. We introduce a neural network architecture for action recognition to understand and represent the video data. We propose the distillation of network knowledge at the classification and feature level, which can be divided into spatial and temporal parts at the feature level. We also suggest initializing new classifiers using previous classifiers. The proposed algorithm is evaluated on the USCF101, HMDB51, and Kinetics-400 datasets. We will consider various factors such as the amount of distillation knowledge, the number of new classes and the incremental learnings stages, and their impact on the final recognition system. Finally, we will show that the proposed algorithm can teach new classes to the recognition system without forgetting the previous classes and does not need the previous data or exemplar data. Hindawi 2022-03-24 /pmc/articles/PMC8970923/ /pubmed/35371208 http://dx.doi.org/10.1155/2022/4879942 Text en Copyright © 2022 Vali Ollah Maraghi and Karim Faez. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Maraghi, Vali Ollah
Faez, Karim
Class-Incremental Learning on Video-Based Action Recognition by Distillation of Various Knowledge
title Class-Incremental Learning on Video-Based Action Recognition by Distillation of Various Knowledge
title_full Class-Incremental Learning on Video-Based Action Recognition by Distillation of Various Knowledge
title_fullStr Class-Incremental Learning on Video-Based Action Recognition by Distillation of Various Knowledge
title_full_unstemmed Class-Incremental Learning on Video-Based Action Recognition by Distillation of Various Knowledge
title_short Class-Incremental Learning on Video-Based Action Recognition by Distillation of Various Knowledge
title_sort class-incremental learning on video-based action recognition by distillation of various knowledge
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8970923/
https://www.ncbi.nlm.nih.gov/pubmed/35371208
http://dx.doi.org/10.1155/2022/4879942
work_keys_str_mv AT maraghivaliollah classincrementallearningonvideobasedactionrecognitionbydistillationofvariousknowledge
AT faezkarim classincrementallearningonvideobasedactionrecognitionbydistillationofvariousknowledge