Cargando…

An explainable and efficient deep learning framework for video anomaly detection

Deep learning-based video anomaly detection methods have drawn significant attention in the past few years due to their superior performance. However, almost all the leading methods for video anomaly detection rely on large-scale training datasets with long training times. As a result, many real-wor...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Chongke, Shao, Sicong, Tunc, Cihan, Satam, Pratik, Hariri, Salim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8609273/
https://www.ncbi.nlm.nih.gov/pubmed/34840519
http://dx.doi.org/10.1007/s10586-021-03439-5
_version_ 1784602896134832128
author Wu, Chongke
Shao, Sicong
Tunc, Cihan
Satam, Pratik
Hariri, Salim
author_facet Wu, Chongke
Shao, Sicong
Tunc, Cihan
Satam, Pratik
Hariri, Salim
author_sort Wu, Chongke
collection PubMed
description Deep learning-based video anomaly detection methods have drawn significant attention in the past few years due to their superior performance. However, almost all the leading methods for video anomaly detection rely on large-scale training datasets with long training times. As a result, many real-world video analysis tasks are still not applicable for fast deployment. On the other hand, the leading methods cannot provide interpretability due to the uninterpretable feature representations hiding the decision-making process when anomaly detection models are considered as a black box. However, the interpretability for anomaly detection is crucial since the corresponding response to the anomalies in the video is determined by their severity and nature. To tackle these problems, this paper proposes an efficient deep learning framework for video anomaly detection and provides explanations. The proposed framework uses pre-trained deep models to extract high-level concept and context features for training denoising autoencoder (DAE), requiring little training time (i.e., within 10 s on UCSD Pedestrian datasets) while achieving comparable detection performance to the leading methods. Furthermore, this framework presents the first video anomaly detection use of combing autoencoder and SHapley Additive exPlanations (SHAP) for model interpretability. The framework can explain each anomaly detection result in surveillance videos. In the experiments, we evaluate the proposed framework's effectiveness and efficiency while also explaining anomalies behind the autoencoder’s prediction. On the USCD Pedestrian datasets, the DAE achieved 85.9% AUC with a training time of 5 s on the USCD Ped1 and 92.4% AUC with a training time of 2.9 s on the UCSD Ped2.
format Online
Article
Text
id pubmed-8609273
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-86092732021-11-23 An explainable and efficient deep learning framework for video anomaly detection Wu, Chongke Shao, Sicong Tunc, Cihan Satam, Pratik Hariri, Salim Cluster Comput Article Deep learning-based video anomaly detection methods have drawn significant attention in the past few years due to their superior performance. However, almost all the leading methods for video anomaly detection rely on large-scale training datasets with long training times. As a result, many real-world video analysis tasks are still not applicable for fast deployment. On the other hand, the leading methods cannot provide interpretability due to the uninterpretable feature representations hiding the decision-making process when anomaly detection models are considered as a black box. However, the interpretability for anomaly detection is crucial since the corresponding response to the anomalies in the video is determined by their severity and nature. To tackle these problems, this paper proposes an efficient deep learning framework for video anomaly detection and provides explanations. The proposed framework uses pre-trained deep models to extract high-level concept and context features for training denoising autoencoder (DAE), requiring little training time (i.e., within 10 s on UCSD Pedestrian datasets) while achieving comparable detection performance to the leading methods. Furthermore, this framework presents the first video anomaly detection use of combing autoencoder and SHapley Additive exPlanations (SHAP) for model interpretability. The framework can explain each anomaly detection result in surveillance videos. In the experiments, we evaluate the proposed framework's effectiveness and efficiency while also explaining anomalies behind the autoencoder’s prediction. On the USCD Pedestrian datasets, the DAE achieved 85.9% AUC with a training time of 5 s on the USCD Ped1 and 92.4% AUC with a training time of 2.9 s on the UCSD Ped2. Springer US 2021-11-23 2022 /pmc/articles/PMC8609273/ /pubmed/34840519 http://dx.doi.org/10.1007/s10586-021-03439-5 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Wu, Chongke
Shao, Sicong
Tunc, Cihan
Satam, Pratik
Hariri, Salim
An explainable and efficient deep learning framework for video anomaly detection
title An explainable and efficient deep learning framework for video anomaly detection
title_full An explainable and efficient deep learning framework for video anomaly detection
title_fullStr An explainable and efficient deep learning framework for video anomaly detection
title_full_unstemmed An explainable and efficient deep learning framework for video anomaly detection
title_short An explainable and efficient deep learning framework for video anomaly detection
title_sort explainable and efficient deep learning framework for video anomaly detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8609273/
https://www.ncbi.nlm.nih.gov/pubmed/34840519
http://dx.doi.org/10.1007/s10586-021-03439-5
work_keys_str_mv AT wuchongke anexplainableandefficientdeeplearningframeworkforvideoanomalydetection
AT shaosicong anexplainableandefficientdeeplearningframeworkforvideoanomalydetection
AT tunccihan anexplainableandefficientdeeplearningframeworkforvideoanomalydetection
AT satampratik anexplainableandefficientdeeplearningframeworkforvideoanomalydetection
AT haririsalim anexplainableandefficientdeeplearningframeworkforvideoanomalydetection
AT wuchongke explainableandefficientdeeplearningframeworkforvideoanomalydetection
AT shaosicong explainableandefficientdeeplearningframeworkforvideoanomalydetection
AT tunccihan explainableandefficientdeeplearningframeworkforvideoanomalydetection
AT satampratik explainableandefficientdeeplearningframeworkforvideoanomalydetection
AT haririsalim explainableandefficientdeeplearningframeworkforvideoanomalydetection