Cargando…
A hybrid anomaly detection method for high dimensional data
Anomaly detection of high-dimensional data is a challenge because the sparsity of the data distribution caused by high dimensionality hardly provides rich information distinguishing anomalous instances from normal instances. To address this, this article proposes an anomaly detection method combinin...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280180/ https://www.ncbi.nlm.nih.gov/pubmed/37346598 http://dx.doi.org/10.7717/peerj-cs.1199 |
_version_ | 1785060746458038272 |
---|---|
author | Zhang, Xin Wei, Pingping Wang, Qingling |
author_facet | Zhang, Xin Wei, Pingping Wang, Qingling |
author_sort | Zhang, Xin |
collection | PubMed |
description | Anomaly detection of high-dimensional data is a challenge because the sparsity of the data distribution caused by high dimensionality hardly provides rich information distinguishing anomalous instances from normal instances. To address this, this article proposes an anomaly detection method combining an autoencoder and a sparse weighted least squares-support vector machine. First, the autoencoder is used to extract those low-dimensional features of high-dimensional data, thus reducing the dimension and the complexity of the searching space. Then, in the low-dimensional feature space obtained by the autoencoder, the sparse weighted least squares-support vector machine separates anomalous and normal features. Finally, the learned class labels to be used to distinguish normal instances and abnormal instances are outputed, thus achieving anomaly detection of high-dimensional data. The experiment results on real high-dimensional datasets show that the proposed method wins over competing methods in terms of anomaly detection ability. For high-dimensional data, using deep methods can reconstruct the layered feature space, which is beneficial for gaining those advanced anomaly detection results. |
format | Online Article Text |
id | pubmed-10280180 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-102801802023-06-21 A hybrid anomaly detection method for high dimensional data Zhang, Xin Wei, Pingping Wang, Qingling PeerJ Comput Sci Algorithms and Analysis of Algorithms Anomaly detection of high-dimensional data is a challenge because the sparsity of the data distribution caused by high dimensionality hardly provides rich information distinguishing anomalous instances from normal instances. To address this, this article proposes an anomaly detection method combining an autoencoder and a sparse weighted least squares-support vector machine. First, the autoencoder is used to extract those low-dimensional features of high-dimensional data, thus reducing the dimension and the complexity of the searching space. Then, in the low-dimensional feature space obtained by the autoencoder, the sparse weighted least squares-support vector machine separates anomalous and normal features. Finally, the learned class labels to be used to distinguish normal instances and abnormal instances are outputed, thus achieving anomaly detection of high-dimensional data. The experiment results on real high-dimensional datasets show that the proposed method wins over competing methods in terms of anomaly detection ability. For high-dimensional data, using deep methods can reconstruct the layered feature space, which is beneficial for gaining those advanced anomaly detection results. PeerJ Inc. 2023-01-12 /pmc/articles/PMC10280180/ /pubmed/37346598 http://dx.doi.org/10.7717/peerj-cs.1199 Text en ©2022 Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Algorithms and Analysis of Algorithms Zhang, Xin Wei, Pingping Wang, Qingling A hybrid anomaly detection method for high dimensional data |
title | A hybrid anomaly detection method for high dimensional data |
title_full | A hybrid anomaly detection method for high dimensional data |
title_fullStr | A hybrid anomaly detection method for high dimensional data |
title_full_unstemmed | A hybrid anomaly detection method for high dimensional data |
title_short | A hybrid anomaly detection method for high dimensional data |
title_sort | hybrid anomaly detection method for high dimensional data |
topic | Algorithms and Analysis of Algorithms |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280180/ https://www.ncbi.nlm.nih.gov/pubmed/37346598 http://dx.doi.org/10.7717/peerj-cs.1199 |
work_keys_str_mv | AT zhangxin ahybridanomalydetectionmethodforhighdimensionaldata AT weipingping ahybridanomalydetectionmethodforhighdimensionaldata AT wangqingling ahybridanomalydetectionmethodforhighdimensionaldata AT zhangxin hybridanomalydetectionmethodforhighdimensionaldata AT weipingping hybridanomalydetectionmethodforhighdimensionaldata AT wangqingling hybridanomalydetectionmethodforhighdimensionaldata |