Cargando…

Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques

Through the ongoing digitization of the world, the number of connected devices is continuously growing without any foreseen decline in the near future. In particular, these devices increasingly include critical systems such as power grids and medical institutions, possibly causing tremendous consequ...

Descripción completa

Detalles Bibliográficos
Autores principales:	Verkerken, Miel, D’hooge, Laurens, Wauters, Tim, Volckaert, Bruno, De Turck, Filip
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8520582/ http://dx.doi.org/10.1007/s10922-021-09615-7

_version_	1784584699064090624
author	Verkerken, Miel D’hooge, Laurens Wauters, Tim Volckaert, Bruno De Turck, Filip
author_facet	Verkerken, Miel D’hooge, Laurens Wauters, Tim Volckaert, Bruno De Turck, Filip
author_sort	Verkerken, Miel
collection	PubMed
description	Through the ongoing digitization of the world, the number of connected devices is continuously growing without any foreseen decline in the near future. In particular, these devices increasingly include critical systems such as power grids and medical institutions, possibly causing tremendous consequences in the case of a successful cybersecurity attack. A network intrusion detection system (NIDS) is one of the main components to detect ongoing attacks by differentiating normal from malicious traffic. Anomaly-based NIDS, more specifically unsupervised methods previously proved promising for their ability to detect known as well as zero-day attacks without the need for a labeled dataset. Despite decades of development by researchers, anomaly-based NIDS are only rarely employed in real-world applications, most possibly due to the lack of generalization power of the proposed models. This article first evaluates four unsupervised machine learning methods on two recent datasets and then defines their generalization strength using a novel inter-dataset evaluation strategy estimating their adaptability. Results show that all models can present high classification scores on an individual dataset but fail to directly transfer those to a second unseen but related dataset. Specifically, the accuracy dropped on average 25.63% in an inter-dataset setting compared to the conventional evaluation approach. This generalization challenge can be observed and tackled in future research with the help of the proposed evaluation strategy in this paper.
format	Online Article Text
id	pubmed-8520582
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-85205822021-10-18 Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques Verkerken, Miel D’hooge, Laurens Wauters, Tim Volckaert, Bruno De Turck, Filip J Netw Syst Manage Article Through the ongoing digitization of the world, the number of connected devices is continuously growing without any foreseen decline in the near future. In particular, these devices increasingly include critical systems such as power grids and medical institutions, possibly causing tremendous consequences in the case of a successful cybersecurity attack. A network intrusion detection system (NIDS) is one of the main components to detect ongoing attacks by differentiating normal from malicious traffic. Anomaly-based NIDS, more specifically unsupervised methods previously proved promising for their ability to detect known as well as zero-day attacks without the need for a labeled dataset. Despite decades of development by researchers, anomaly-based NIDS are only rarely employed in real-world applications, most possibly due to the lack of generalization power of the proposed models. This article first evaluates four unsupervised machine learning methods on two recent datasets and then defines their generalization strength using a novel inter-dataset evaluation strategy estimating their adaptability. Results show that all models can present high classification scores on an individual dataset but fail to directly transfer those to a second unseen but related dataset. Specifically, the accuracy dropped on average 25.63% in an inter-dataset setting compared to the conventional evaluation approach. This generalization challenge can be observed and tackled in future research with the help of the proposed evaluation strategy in this paper. Springer US 2021-10-17 2022 /pmc/articles/PMC8520582/ http://dx.doi.org/10.1007/s10922-021-09615-7 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Article Verkerken, Miel D’hooge, Laurens Wauters, Tim Volckaert, Bruno De Turck, Filip Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title	Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title_full	Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title_fullStr	Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title_full_unstemmed	Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title_short	Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques
title_sort	towards model generalization for intrusion detection: unsupervised machine learning techniques
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8520582/ http://dx.doi.org/10.1007/s10922-021-09615-7
work_keys_str_mv	AT verkerkenmiel towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques AT dhoogelaurens towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques AT wauterstim towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques AT volckaertbruno towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques AT deturckfilip towardsmodelgeneralizationforintrusiondetectionunsupervisedmachinelearningtechniques

Towards Model Generalization for Intrusion Detection: Unsupervised Machine Learning Techniques

Ejemplares similares