Cargando…

Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques

Through the ongoing digitization of the world, the number of connected devices is continuously growing without any foreseen decline in the near future. In particular, these devices increasingly include critical systems such as power grids and medical institutions, possibly causing tremendous consequ...

Descripción completa

Detalles Bibliográficos
Autores principales: Verkerken, Miel, D’hooge, Laurens, Wauters, Tim, Volckaert, Bruno, De Turck, Filip
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8520582/
http://dx.doi.org/10.1007/s10922-021-09615-7
_version_ 1784584699064090624
author Verkerken, Miel
D’hooge, Laurens
Wauters, Tim
Volckaert, Bruno
De Turck, Filip
author_facet Verkerken, Miel
D’hooge, Laurens
Wauters, Tim
Volckaert, Bruno
De Turck, Filip
author_sort Verkerken, Miel
collection PubMed
description Through the ongoing digitization of the world, the number of connected devices is continuously growing without any foreseen decline in the near future. In particular, these devices increasingly include critical systems such as power grids and medical institutions, possibly causing tremendous consequences in the case of a successful cybersecurity attack. A network intrusion detection system (NIDS) is one of the main components to detect ongoing attacks by differentiating normal from malicious traffic. Anomaly-based NIDS, more specifically unsupervised methods previously proved promising for their ability to detect known as well as zero-day attacks without the need for a labeled dataset. Despite decades of development by researchers, anomaly-based NIDS are only rarely employed in real-world applications, most possibly due to the lack of generalization power of the proposed models. This article first evaluates four unsupervised machine learning methods on two recent datasets and then defines their generalization strength using a novel inter-dataset evaluation strategy estimating their adaptability. Results show that all models can present high classification scores on an individual dataset but fail to directly transfer those to a second unseen but related dataset. Specifically, the accuracy dropped on average 25.63% in an inter-dataset setting compared to the conventional evaluation approach. This generalization challenge can be observed and tackled in future research with the help of the proposed evaluation strategy in this paper.
format Online
Article
Text
id pubmed-8520582
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-85205822021-10-18 Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques Verkerken, Miel D’hooge, Laurens Wauters, Tim Volckaert, Bruno De Turck, Filip J Netw Syst Manage Article Through the ongoing digitization of the world, the number of connected devices is continuously growing without any foreseen decline in the near future. In particular, these devices increasingly include critical systems such as power grids and medical institutions, possibly causing tremendous consequences in the case of a successful cybersecurity attack. A network intrusion detection system (NIDS) is one of the main components to detect ongoing attacks by differentiating normal from malicious traffic. Anomaly-based NIDS, more specifically unsupervised methods previously proved promising for their ability to detect known as well as zero-day attacks without the need for a labeled dataset. Despite decades of development by researchers, anomaly-based NIDS are only rarely employed in real-world applications, most possibly due to the lack of generalization power of the proposed models. This article first evaluates four unsupervised machine learning methods on two recent datasets and then defines their generalization strength using a novel inter-dataset evaluation strategy estimating their adaptability. Results show that all models can present high classification scores on an individual dataset but fail to directly transfer those to a second unseen but related dataset. Specifically, the accuracy dropped on average 25.63% in an inter-dataset setting compared to the conventional evaluation approach. This generalization challenge can be observed and tackled in future research with the help of the proposed evaluation strategy in this paper. Springer US 2021-10-17 2022 /pmc/articles/PMC8520582/ http://dx.doi.org/10.1007/s10922-021-09615-7 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Verkerken, Miel
D’hooge, Laurens
Wauters, Tim
Volckaert, Bruno
De Turck, Filip
Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title_full Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title_fullStr Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title_full_unstemmed Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title_short Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title_sort towards model generalization for intrusion detection: unsupervised machine learning techniques
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8520582/
http://dx.doi.org/10.1007/s10922-021-09615-7
work_keys_str_mv AT verkerkenmiel towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques
AT dhoogelaurens towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques
AT wauterstim towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques
AT volckaertbruno towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques
AT deturckfilip towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques