Cargando…

A hybrid anomaly detection method for high dimensional data

Anomaly detection of high-dimensional data is a challenge because the sparsity of the data distribution caused by high dimensionality hardly provides rich information distinguishing anomalous instances from normal instances. To address this, this article proposes an anomaly detection method combinin...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xin, Wei, Pingping, Wang, Qingling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280180/
https://www.ncbi.nlm.nih.gov/pubmed/37346598
http://dx.doi.org/10.7717/peerj-cs.1199
_version_ 1785060746458038272
author Zhang, Xin
Wei, Pingping
Wang, Qingling
author_facet Zhang, Xin
Wei, Pingping
Wang, Qingling
author_sort Zhang, Xin
collection PubMed
description Anomaly detection of high-dimensional data is a challenge because the sparsity of the data distribution caused by high dimensionality hardly provides rich information distinguishing anomalous instances from normal instances. To address this, this article proposes an anomaly detection method combining an autoencoder and a sparse weighted least squares-support vector machine. First, the autoencoder is used to extract those low-dimensional features of high-dimensional data, thus reducing the dimension and the complexity of the searching space. Then, in the low-dimensional feature space obtained by the autoencoder, the sparse weighted least squares-support vector machine separates anomalous and normal features. Finally, the learned class labels to be used to distinguish normal instances and abnormal instances are outputed, thus achieving anomaly detection of high-dimensional data. The experiment results on real high-dimensional datasets show that the proposed method wins over competing methods in terms of anomaly detection ability. For high-dimensional data, using deep methods can reconstruct the layered feature space, which is beneficial for gaining those advanced anomaly detection results.
format Online
Article
Text
id pubmed-10280180
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-102801802023-06-21 A hybrid anomaly detection method for high dimensional data Zhang, Xin Wei, Pingping Wang, Qingling PeerJ Comput Sci Algorithms and Analysis of Algorithms Anomaly detection of high-dimensional data is a challenge because the sparsity of the data distribution caused by high dimensionality hardly provides rich information distinguishing anomalous instances from normal instances. To address this, this article proposes an anomaly detection method combining an autoencoder and a sparse weighted least squares-support vector machine. First, the autoencoder is used to extract those low-dimensional features of high-dimensional data, thus reducing the dimension and the complexity of the searching space. Then, in the low-dimensional feature space obtained by the autoencoder, the sparse weighted least squares-support vector machine separates anomalous and normal features. Finally, the learned class labels to be used to distinguish normal instances and abnormal instances are outputed, thus achieving anomaly detection of high-dimensional data. The experiment results on real high-dimensional datasets show that the proposed method wins over competing methods in terms of anomaly detection ability. For high-dimensional data, using deep methods can reconstruct the layered feature space, which is beneficial for gaining those advanced anomaly detection results. PeerJ Inc. 2023-01-12 /pmc/articles/PMC10280180/ /pubmed/37346598 http://dx.doi.org/10.7717/peerj-cs.1199 Text en ©2022 Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Algorithms and Analysis of Algorithms
Zhang, Xin
Wei, Pingping
Wang, Qingling
A hybrid anomaly detection method for high dimensional data
title A hybrid anomaly detection method for high dimensional data
title_full A hybrid anomaly detection method for high dimensional data
title_fullStr A hybrid anomaly detection method for high dimensional data
title_full_unstemmed A hybrid anomaly detection method for high dimensional data
title_short A hybrid anomaly detection method for high dimensional data
title_sort hybrid anomaly detection method for high dimensional data
topic Algorithms and Analysis of Algorithms
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280180/
https://www.ncbi.nlm.nih.gov/pubmed/37346598
http://dx.doi.org/10.7717/peerj-cs.1199
work_keys_str_mv AT zhangxin ahybridanomalydetectionmethodforhighdimensionaldata
AT weipingping ahybridanomalydetectionmethodforhighdimensionaldata
AT wangqingling ahybridanomalydetectionmethodforhighdimensionaldata
AT zhangxin hybridanomalydetectionmethodforhighdimensionaldata
AT weipingping hybridanomalydetectionmethodforhighdimensionaldata
AT wangqingling hybridanomalydetectionmethodforhighdimensionaldata