Cargando…
Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
Through the ongoing digitization of the world, the number of connected devices is continuously growing without any foreseen decline in the near future. In particular, these devices increasingly include critical systems such as power grids and medical institutions, possibly causing tremendous consequ...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8520582/ http://dx.doi.org/10.1007/s10922-021-09615-7 |
_version_ | 1784584699064090624 |
---|---|
author | Verkerken, Miel D’hooge, Laurens Wauters, Tim Volckaert, Bruno De Turck, Filip |
author_facet | Verkerken, Miel D’hooge, Laurens Wauters, Tim Volckaert, Bruno De Turck, Filip |
author_sort | Verkerken, Miel |
collection | PubMed |
description | Through the ongoing digitization of the world, the number of connected devices is continuously growing without any foreseen decline in the near future. In particular, these devices increasingly include critical systems such as power grids and medical institutions, possibly causing tremendous consequences in the case of a successful cybersecurity attack. A network intrusion detection system (NIDS) is one of the main components to detect ongoing attacks by differentiating normal from malicious traffic. Anomaly-based NIDS, more specifically unsupervised methods previously proved promising for their ability to detect known as well as zero-day attacks without the need for a labeled dataset. Despite decades of development by researchers, anomaly-based NIDS are only rarely employed in real-world applications, most possibly due to the lack of generalization power of the proposed models. This article first evaluates four unsupervised machine learning methods on two recent datasets and then defines their generalization strength using a novel inter-dataset evaluation strategy estimating their adaptability. Results show that all models can present high classification scores on an individual dataset but fail to directly transfer those to a second unseen but related dataset. Specifically, the accuracy dropped on average 25.63% in an inter-dataset setting compared to the conventional evaluation approach. This generalization challenge can be observed and tackled in future research with the help of the proposed evaluation strategy in this paper. |
format | Online Article Text |
id | pubmed-8520582 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-85205822021-10-18 Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques Verkerken, Miel D’hooge, Laurens Wauters, Tim Volckaert, Bruno De Turck, Filip J Netw Syst Manage Article Through the ongoing digitization of the world, the number of connected devices is continuously growing without any foreseen decline in the near future. In particular, these devices increasingly include critical systems such as power grids and medical institutions, possibly causing tremendous consequences in the case of a successful cybersecurity attack. A network intrusion detection system (NIDS) is one of the main components to detect ongoing attacks by differentiating normal from malicious traffic. Anomaly-based NIDS, more specifically unsupervised methods previously proved promising for their ability to detect known as well as zero-day attacks without the need for a labeled dataset. Despite decades of development by researchers, anomaly-based NIDS are only rarely employed in real-world applications, most possibly due to the lack of generalization power of the proposed models. This article first evaluates four unsupervised machine learning methods on two recent datasets and then defines their generalization strength using a novel inter-dataset evaluation strategy estimating their adaptability. Results show that all models can present high classification scores on an individual dataset but fail to directly transfer those to a second unseen but related dataset. Specifically, the accuracy dropped on average 25.63% in an inter-dataset setting compared to the conventional evaluation approach. This generalization challenge can be observed and tackled in future research with the help of the proposed evaluation strategy in this paper. Springer US 2021-10-17 2022 /pmc/articles/PMC8520582/ http://dx.doi.org/10.1007/s10922-021-09615-7 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Verkerken, Miel D’hooge, Laurens Wauters, Tim Volckaert, Bruno De Turck, Filip Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques |
title | Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques |
title_full | Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques |
title_fullStr | Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques |
title_full_unstemmed | Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques |
title_short | Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques |
title_sort | towards model generalization for intrusion detection: unsupervised machine learning techniques |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8520582/ http://dx.doi.org/10.1007/s10922-021-09615-7 |
work_keys_str_mv | AT verkerkenmiel towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques AT dhoogelaurens towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques AT wauterstim towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques AT volckaertbruno towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques AT deturckfilip towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques |