Cargando…
A flexible framework for anomaly Detection via dimensionality reduction
Anomaly detection is challenging, especially for large datasets in high dimensions. Here, we explore a general anomaly detection framework based on dimensionality reduction and unsupervised clustering. DRAMA is released as a general python package that implements the general framework with a wide ra...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer London
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946572/ https://www.ncbi.nlm.nih.gov/pubmed/33723477 http://dx.doi.org/10.1007/s00521-021-05839-5 |
_version_ | 1783663080779022336 |
---|---|
author | Vafaei Sadr, Alireza Bassett, Bruce A. Kunz, M. |
author_facet | Vafaei Sadr, Alireza Bassett, Bruce A. Kunz, M. |
author_sort | Vafaei Sadr, Alireza |
collection | PubMed |
description | Anomaly detection is challenging, especially for large datasets in high dimensions. Here, we explore a general anomaly detection framework based on dimensionality reduction and unsupervised clustering. DRAMA is released as a general python package that implements the general framework with a wide range of built-in options. This approach identifies the primary prototypes in the data with anomalies detected by their large distances from the prototypes, either in the latent space or in the original, high-dimensional space. DRAMA is tested on a wide variety of simulated and real datasets, in up to 3000 dimensions, and is found to be robust and highly competitive with commonly used anomaly detection algorithms, especially in high dimensions. The flexibility of the DRAMA framework allows for significant optimization once some examples of anomalies are available, making it ideal for online anomaly detection, active learning, and highly unbalanced datasets. Besides, DRAMA naturally provides clustering of outliers for subsequent analysis. |
format | Online Article Text |
id | pubmed-7946572 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer London |
record_format | MEDLINE/PubMed |
spelling | pubmed-79465722021-03-11 A flexible framework for anomaly Detection via dimensionality reduction Vafaei Sadr, Alireza Bassett, Bruce A. Kunz, M. Neural Comput Appl S.I. : 2019 India Intl. Congress on Computational Intelligence Anomaly detection is challenging, especially for large datasets in high dimensions. Here, we explore a general anomaly detection framework based on dimensionality reduction and unsupervised clustering. DRAMA is released as a general python package that implements the general framework with a wide range of built-in options. This approach identifies the primary prototypes in the data with anomalies detected by their large distances from the prototypes, either in the latent space or in the original, high-dimensional space. DRAMA is tested on a wide variety of simulated and real datasets, in up to 3000 dimensions, and is found to be robust and highly competitive with commonly used anomaly detection algorithms, especially in high dimensions. The flexibility of the DRAMA framework allows for significant optimization once some examples of anomalies are available, making it ideal for online anomaly detection, active learning, and highly unbalanced datasets. Besides, DRAMA naturally provides clustering of outliers for subsequent analysis. Springer London 2021-03-11 2023 /pmc/articles/PMC7946572/ /pubmed/33723477 http://dx.doi.org/10.1007/s00521-021-05839-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | S.I. : 2019 India Intl. Congress on Computational Intelligence Vafaei Sadr, Alireza Bassett, Bruce A. Kunz, M. A flexible framework for anomaly Detection via dimensionality reduction |
title | A flexible framework for anomaly Detection via dimensionality reduction |
title_full | A flexible framework for anomaly Detection via dimensionality reduction |
title_fullStr | A flexible framework for anomaly Detection via dimensionality reduction |
title_full_unstemmed | A flexible framework for anomaly Detection via dimensionality reduction |
title_short | A flexible framework for anomaly Detection via dimensionality reduction |
title_sort | flexible framework for anomaly detection via dimensionality reduction |
topic | S.I. : 2019 India Intl. Congress on Computational Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946572/ https://www.ncbi.nlm.nih.gov/pubmed/33723477 http://dx.doi.org/10.1007/s00521-021-05839-5 |
work_keys_str_mv | AT vafaeisadralireza aflexibleframeworkforanomalydetectionviadimensionalityreduction AT bassettbrucea aflexibleframeworkforanomalydetectionviadimensionalityreduction AT kunzm aflexibleframeworkforanomalydetectionviadimensionalityreduction AT vafaeisadralireza flexibleframeworkforanomalydetectionviadimensionalityreduction AT bassettbrucea flexibleframeworkforanomalydetectionviadimensionalityreduction AT kunzm flexibleframeworkforanomalydetectionviadimensionalityreduction |