Cargando…

A flexible framework for anomaly Detection via dimensionality reduction

Anomaly detection is challenging, especially for large datasets in high dimensions. Here, we explore a general anomaly detection framework based on dimensionality reduction and unsupervised clustering. DRAMA is released as a general python package that implements the general framework with a wide ra...

Descripción completa

Detalles Bibliográficos
Autores principales: Vafaei Sadr, Alireza, Bassett, Bruce A., Kunz, M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer London 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946572/
https://www.ncbi.nlm.nih.gov/pubmed/33723477
http://dx.doi.org/10.1007/s00521-021-05839-5
_version_ 1783663080779022336
author Vafaei Sadr, Alireza
Bassett, Bruce A.
Kunz, M.
author_facet Vafaei Sadr, Alireza
Bassett, Bruce A.
Kunz, M.
author_sort Vafaei Sadr, Alireza
collection PubMed
description Anomaly detection is challenging, especially for large datasets in high dimensions. Here, we explore a general anomaly detection framework based on dimensionality reduction and unsupervised clustering. DRAMA is released as a general python package that implements the general framework with a wide range of built-in options. This approach identifies the primary prototypes in the data with anomalies detected by their large distances from the prototypes, either in the latent space or in the original, high-dimensional space. DRAMA is tested on a wide variety of simulated and real datasets, in up to 3000 dimensions, and is found to be robust and highly competitive with commonly used anomaly detection algorithms, especially in high dimensions. The flexibility of the DRAMA framework allows for significant optimization once some examples of anomalies are available, making it ideal for online anomaly detection, active learning, and highly unbalanced datasets. Besides, DRAMA naturally provides clustering of outliers for subsequent analysis.
format Online
Article
Text
id pubmed-7946572
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer London
record_format MEDLINE/PubMed
spelling pubmed-79465722021-03-11 A flexible framework for anomaly Detection via dimensionality reduction Vafaei Sadr, Alireza Bassett, Bruce A. Kunz, M. Neural Comput Appl S.I. : 2019 India Intl. Congress on Computational Intelligence Anomaly detection is challenging, especially for large datasets in high dimensions. Here, we explore a general anomaly detection framework based on dimensionality reduction and unsupervised clustering. DRAMA is released as a general python package that implements the general framework with a wide range of built-in options. This approach identifies the primary prototypes in the data with anomalies detected by their large distances from the prototypes, either in the latent space or in the original, high-dimensional space. DRAMA is tested on a wide variety of simulated and real datasets, in up to 3000 dimensions, and is found to be robust and highly competitive with commonly used anomaly detection algorithms, especially in high dimensions. The flexibility of the DRAMA framework allows for significant optimization once some examples of anomalies are available, making it ideal for online anomaly detection, active learning, and highly unbalanced datasets. Besides, DRAMA naturally provides clustering of outliers for subsequent analysis. Springer London 2021-03-11 2023 /pmc/articles/PMC7946572/ /pubmed/33723477 http://dx.doi.org/10.1007/s00521-021-05839-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle S.I. : 2019 India Intl. Congress on Computational Intelligence
Vafaei Sadr, Alireza
Bassett, Bruce A.
Kunz, M.
A flexible framework for anomaly Detection via dimensionality reduction
title A flexible framework for anomaly Detection via dimensionality reduction
title_full A flexible framework for anomaly Detection via dimensionality reduction
title_fullStr A flexible framework for anomaly Detection via dimensionality reduction
title_full_unstemmed A flexible framework for anomaly Detection via dimensionality reduction
title_short A flexible framework for anomaly Detection via dimensionality reduction
title_sort flexible framework for anomaly detection via dimensionality reduction
topic S.I. : 2019 India Intl. Congress on Computational Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946572/
https://www.ncbi.nlm.nih.gov/pubmed/33723477
http://dx.doi.org/10.1007/s00521-021-05839-5
work_keys_str_mv AT vafaeisadralireza aflexibleframeworkforanomalydetectionviadimensionalityreduction
AT bassettbrucea aflexibleframeworkforanomalydetectionviadimensionalityreduction
AT kunzm aflexibleframeworkforanomalydetectionviadimensionalityreduction
AT vafaeisadralireza flexibleframeworkforanomalydetectionviadimensionalityreduction
AT bassettbrucea flexibleframeworkforanomalydetectionviadimensionalityreduction
AT kunzm flexibleframeworkforanomalydetectionviadimensionalityreduction