Cargando…
Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos
Recently, most state-of-the-art anomaly detection methods are based on apparent motion and appearance reconstruction networks and use error estimation between generated and real information as detection features. These approaches achieve promising results by only using normal samples for training st...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124646/ https://www.ncbi.nlm.nih.gov/pubmed/34063625 http://dx.doi.org/10.3390/s21093179 |
_version_ | 1783693268420132864 |
---|---|
author | Vu, Tuan-Hung Boonaert, Jacques Ambellouis, Sebastien Taleb-Ahmed, Abdelmalik |
author_facet | Vu, Tuan-Hung Boonaert, Jacques Ambellouis, Sebastien Taleb-Ahmed, Abdelmalik |
author_sort | Vu, Tuan-Hung |
collection | PubMed |
description | Recently, most state-of-the-art anomaly detection methods are based on apparent motion and appearance reconstruction networks and use error estimation between generated and real information as detection features. These approaches achieve promising results by only using normal samples for training steps. In this paper, our contributions are two-fold. On the one hand, we propose a flexible multi-channel framework to generate multi-type frame-level features. On the other hand, we study how it is possible to improve the detection performance by supervised learning. The multi-channel framework is based on four Conditional GANs (CGANs) taking various type of appearance and motion information as input and producing prediction information as output. These CGANs provide a better feature space to represent the distinction between normal and abnormal events. Then, the difference between those generative and ground-truth information is encoded by Peak Signal-to-Noise Ratio (PSNR). We propose to classify those features in a classical supervised scenario by building a small training set with some abnormal samples of the original test set of the dataset. The binary Support Vector Machine (SVM) is applied for frame-level anomaly detection. Finally, we use Mask R-CNN as detector to perform object-centric anomaly localization. Our solution is largely evaluated on Avenue, Ped1, Ped2, and ShanghaiTech datasets. Our experiment results demonstrate that PSNR features combined with supervised SVM are better than error maps computed by previous methods. We achieve state-of-the-art performance for frame-level AUC on Ped1 and ShanghaiTech. Especially, for the most challenging Shanghaitech dataset, a supervised training model outperforms up to 9% the state-of-the-art an unsupervised strategy. |
format | Online Article Text |
id | pubmed-8124646 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-81246462021-05-17 Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos Vu, Tuan-Hung Boonaert, Jacques Ambellouis, Sebastien Taleb-Ahmed, Abdelmalik Sensors (Basel) Article Recently, most state-of-the-art anomaly detection methods are based on apparent motion and appearance reconstruction networks and use error estimation between generated and real information as detection features. These approaches achieve promising results by only using normal samples for training steps. In this paper, our contributions are two-fold. On the one hand, we propose a flexible multi-channel framework to generate multi-type frame-level features. On the other hand, we study how it is possible to improve the detection performance by supervised learning. The multi-channel framework is based on four Conditional GANs (CGANs) taking various type of appearance and motion information as input and producing prediction information as output. These CGANs provide a better feature space to represent the distinction between normal and abnormal events. Then, the difference between those generative and ground-truth information is encoded by Peak Signal-to-Noise Ratio (PSNR). We propose to classify those features in a classical supervised scenario by building a small training set with some abnormal samples of the original test set of the dataset. The binary Support Vector Machine (SVM) is applied for frame-level anomaly detection. Finally, we use Mask R-CNN as detector to perform object-centric anomaly localization. Our solution is largely evaluated on Avenue, Ped1, Ped2, and ShanghaiTech datasets. Our experiment results demonstrate that PSNR features combined with supervised SVM are better than error maps computed by previous methods. We achieve state-of-the-art performance for frame-level AUC on Ped1 and ShanghaiTech. Especially, for the most challenging Shanghaitech dataset, a supervised training model outperforms up to 9% the state-of-the-art an unsupervised strategy. MDPI 2021-05-03 /pmc/articles/PMC8124646/ /pubmed/34063625 http://dx.doi.org/10.3390/s21093179 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Vu, Tuan-Hung Boonaert, Jacques Ambellouis, Sebastien Taleb-Ahmed, Abdelmalik Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos |
title | Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos |
title_full | Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos |
title_fullStr | Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos |
title_full_unstemmed | Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos |
title_short | Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos |
title_sort | multi-channel generative framework and supervised learning for anomaly detection in surveillance videos |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124646/ https://www.ncbi.nlm.nih.gov/pubmed/34063625 http://dx.doi.org/10.3390/s21093179 |
work_keys_str_mv | AT vutuanhung multichannelgenerativeframeworkandsupervisedlearningforanomalydetectioninsurveillancevideos AT boonaertjacques multichannelgenerativeframeworkandsupervisedlearningforanomalydetectioninsurveillancevideos AT ambellouissebastien multichannelgenerativeframeworkandsupervisedlearningforanomalydetectioninsurveillancevideos AT talebahmedabdelmalik multichannelgenerativeframeworkandsupervisedlearningforanomalydetectioninsurveillancevideos |