Cargando…

Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos

Recently, most state-of-the-art anomaly detection methods are based on apparent motion and appearance reconstruction networks and use error estimation between generated and real information as detection features. These approaches achieve promising results by only using normal samples for training st...

Descripción completa

Detalles Bibliográficos
Autores principales: Vu, Tuan-Hung, Boonaert, Jacques, Ambellouis, Sebastien, Taleb-Ahmed, Abdelmalik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124646/
https://www.ncbi.nlm.nih.gov/pubmed/34063625
http://dx.doi.org/10.3390/s21093179
_version_ 1783693268420132864
author Vu, Tuan-Hung
Boonaert, Jacques
Ambellouis, Sebastien
Taleb-Ahmed, Abdelmalik
author_facet Vu, Tuan-Hung
Boonaert, Jacques
Ambellouis, Sebastien
Taleb-Ahmed, Abdelmalik
author_sort Vu, Tuan-Hung
collection PubMed
description Recently, most state-of-the-art anomaly detection methods are based on apparent motion and appearance reconstruction networks and use error estimation between generated and real information as detection features. These approaches achieve promising results by only using normal samples for training steps. In this paper, our contributions are two-fold. On the one hand, we propose a flexible multi-channel framework to generate multi-type frame-level features. On the other hand, we study how it is possible to improve the detection performance by supervised learning. The multi-channel framework is based on four Conditional GANs (CGANs) taking various type of appearance and motion information as input and producing prediction information as output. These CGANs provide a better feature space to represent the distinction between normal and abnormal events. Then, the difference between those generative and ground-truth information is encoded by Peak Signal-to-Noise Ratio (PSNR). We propose to classify those features in a classical supervised scenario by building a small training set with some abnormal samples of the original test set of the dataset. The binary Support Vector Machine (SVM) is applied for frame-level anomaly detection. Finally, we use Mask R-CNN as detector to perform object-centric anomaly localization. Our solution is largely evaluated on Avenue, Ped1, Ped2, and ShanghaiTech datasets. Our experiment results demonstrate that PSNR features combined with supervised SVM are better than error maps computed by previous methods. We achieve state-of-the-art performance for frame-level AUC on Ped1 and ShanghaiTech. Especially, for the most challenging Shanghaitech dataset, a supervised training model outperforms up to 9% the state-of-the-art an unsupervised strategy.
format Online
Article
Text
id pubmed-8124646
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81246462021-05-17 Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos Vu, Tuan-Hung Boonaert, Jacques Ambellouis, Sebastien Taleb-Ahmed, Abdelmalik Sensors (Basel) Article Recently, most state-of-the-art anomaly detection methods are based on apparent motion and appearance reconstruction networks and use error estimation between generated and real information as detection features. These approaches achieve promising results by only using normal samples for training steps. In this paper, our contributions are two-fold. On the one hand, we propose a flexible multi-channel framework to generate multi-type frame-level features. On the other hand, we study how it is possible to improve the detection performance by supervised learning. The multi-channel framework is based on four Conditional GANs (CGANs) taking various type of appearance and motion information as input and producing prediction information as output. These CGANs provide a better feature space to represent the distinction between normal and abnormal events. Then, the difference between those generative and ground-truth information is encoded by Peak Signal-to-Noise Ratio (PSNR). We propose to classify those features in a classical supervised scenario by building a small training set with some abnormal samples of the original test set of the dataset. The binary Support Vector Machine (SVM) is applied for frame-level anomaly detection. Finally, we use Mask R-CNN as detector to perform object-centric anomaly localization. Our solution is largely evaluated on Avenue, Ped1, Ped2, and ShanghaiTech datasets. Our experiment results demonstrate that PSNR features combined with supervised SVM are better than error maps computed by previous methods. We achieve state-of-the-art performance for frame-level AUC on Ped1 and ShanghaiTech. Especially, for the most challenging Shanghaitech dataset, a supervised training model outperforms up to 9% the state-of-the-art an unsupervised strategy. MDPI 2021-05-03 /pmc/articles/PMC8124646/ /pubmed/34063625 http://dx.doi.org/10.3390/s21093179 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Vu, Tuan-Hung
Boonaert, Jacques
Ambellouis, Sebastien
Taleb-Ahmed, Abdelmalik
Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos
title Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos
title_full Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos
title_fullStr Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos
title_full_unstemmed Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos
title_short Multi-Channel Generative Framework and Supervised Learning for Anomaly Detection in Surveillance Videos
title_sort multi-channel generative framework and supervised learning for anomaly detection in surveillance videos
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124646/
https://www.ncbi.nlm.nih.gov/pubmed/34063625
http://dx.doi.org/10.3390/s21093179
work_keys_str_mv AT vutuanhung multichannelgenerativeframeworkandsupervisedlearningforanomalydetectioninsurveillancevideos
AT boonaertjacques multichannelgenerativeframeworkandsupervisedlearningforanomalydetectioninsurveillancevideos
AT ambellouissebastien multichannelgenerativeframeworkandsupervisedlearningforanomalydetectioninsurveillancevideos
AT talebahmedabdelmalik multichannelgenerativeframeworkandsupervisedlearningforanomalydetectioninsurveillancevideos