Cargando…
An explainable and efficient deep learning framework for video anomaly detection
Deep learning-based video anomaly detection methods have drawn significant attention in the past few years due to their superior performance. However, almost all the leading methods for video anomaly detection rely on large-scale training datasets with long training times. As a result, many real-wor...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8609273/ https://www.ncbi.nlm.nih.gov/pubmed/34840519 http://dx.doi.org/10.1007/s10586-021-03439-5 |
_version_ | 1784602896134832128 |
---|---|
author | Wu, Chongke Shao, Sicong Tunc, Cihan Satam, Pratik Hariri, Salim |
author_facet | Wu, Chongke Shao, Sicong Tunc, Cihan Satam, Pratik Hariri, Salim |
author_sort | Wu, Chongke |
collection | PubMed |
description | Deep learning-based video anomaly detection methods have drawn significant attention in the past few years due to their superior performance. However, almost all the leading methods for video anomaly detection rely on large-scale training datasets with long training times. As a result, many real-world video analysis tasks are still not applicable for fast deployment. On the other hand, the leading methods cannot provide interpretability due to the uninterpretable feature representations hiding the decision-making process when anomaly detection models are considered as a black box. However, the interpretability for anomaly detection is crucial since the corresponding response to the anomalies in the video is determined by their severity and nature. To tackle these problems, this paper proposes an efficient deep learning framework for video anomaly detection and provides explanations. The proposed framework uses pre-trained deep models to extract high-level concept and context features for training denoising autoencoder (DAE), requiring little training time (i.e., within 10 s on UCSD Pedestrian datasets) while achieving comparable detection performance to the leading methods. Furthermore, this framework presents the first video anomaly detection use of combing autoencoder and SHapley Additive exPlanations (SHAP) for model interpretability. The framework can explain each anomaly detection result in surveillance videos. In the experiments, we evaluate the proposed framework's effectiveness and efficiency while also explaining anomalies behind the autoencoder’s prediction. On the USCD Pedestrian datasets, the DAE achieved 85.9% AUC with a training time of 5 s on the USCD Ped1 and 92.4% AUC with a training time of 2.9 s on the UCSD Ped2. |
format | Online Article Text |
id | pubmed-8609273 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-86092732021-11-23 An explainable and efficient deep learning framework for video anomaly detection Wu, Chongke Shao, Sicong Tunc, Cihan Satam, Pratik Hariri, Salim Cluster Comput Article Deep learning-based video anomaly detection methods have drawn significant attention in the past few years due to their superior performance. However, almost all the leading methods for video anomaly detection rely on large-scale training datasets with long training times. As a result, many real-world video analysis tasks are still not applicable for fast deployment. On the other hand, the leading methods cannot provide interpretability due to the uninterpretable feature representations hiding the decision-making process when anomaly detection models are considered as a black box. However, the interpretability for anomaly detection is crucial since the corresponding response to the anomalies in the video is determined by their severity and nature. To tackle these problems, this paper proposes an efficient deep learning framework for video anomaly detection and provides explanations. The proposed framework uses pre-trained deep models to extract high-level concept and context features for training denoising autoencoder (DAE), requiring little training time (i.e., within 10 s on UCSD Pedestrian datasets) while achieving comparable detection performance to the leading methods. Furthermore, this framework presents the first video anomaly detection use of combing autoencoder and SHapley Additive exPlanations (SHAP) for model interpretability. The framework can explain each anomaly detection result in surveillance videos. In the experiments, we evaluate the proposed framework's effectiveness and efficiency while also explaining anomalies behind the autoencoder’s prediction. On the USCD Pedestrian datasets, the DAE achieved 85.9% AUC with a training time of 5 s on the USCD Ped1 and 92.4% AUC with a training time of 2.9 s on the UCSD Ped2. Springer US 2021-11-23 2022 /pmc/articles/PMC8609273/ /pubmed/34840519 http://dx.doi.org/10.1007/s10586-021-03439-5 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Wu, Chongke Shao, Sicong Tunc, Cihan Satam, Pratik Hariri, Salim An explainable and efficient deep learning framework for video anomaly detection |
title | An explainable and efficient deep learning framework for video anomaly detection |
title_full | An explainable and efficient deep learning framework for video anomaly detection |
title_fullStr | An explainable and efficient deep learning framework for video anomaly detection |
title_full_unstemmed | An explainable and efficient deep learning framework for video anomaly detection |
title_short | An explainable and efficient deep learning framework for video anomaly detection |
title_sort | explainable and efficient deep learning framework for video anomaly detection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8609273/ https://www.ncbi.nlm.nih.gov/pubmed/34840519 http://dx.doi.org/10.1007/s10586-021-03439-5 |
work_keys_str_mv | AT wuchongke anexplainableandefficientdeeplearningframeworkforvideoanomalydetection AT shaosicong anexplainableandefficientdeeplearningframeworkforvideoanomalydetection AT tunccihan anexplainableandefficientdeeplearningframeworkforvideoanomalydetection AT satampratik anexplainableandefficientdeeplearningframeworkforvideoanomalydetection AT haririsalim anexplainableandefficientdeeplearningframeworkforvideoanomalydetection AT wuchongke explainableandefficientdeeplearningframeworkforvideoanomalydetection AT shaosicong explainableandefficientdeeplearningframeworkforvideoanomalydetection AT tunccihan explainableandefficientdeeplearningframeworkforvideoanomalydetection AT satampratik explainableandefficientdeeplearningframeworkforvideoanomalydetection AT haririsalim explainableandefficientdeeplearningframeworkforvideoanomalydetection |