Cargando…

Network intrusion detection using oversampling technique and machine learning algorithms

The expeditious growth of the World Wide Web and the rampant flow of network traffic have resulted in a continuous increase of network security threats. Cyber attackers seek to exploit vulnerabilities in network architecture to steal valuable information or disrupt computer resources. Network Intrus...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmed, Hafiza Anisa, Hameed, Anum, Bawany, Narmeen Zakaria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8771780/
https://www.ncbi.nlm.nih.gov/pubmed/35111914
http://dx.doi.org/10.7717/peerj-cs.820
_version_ 1784635689140224000
author Ahmed, Hafiza Anisa
Hameed, Anum
Bawany, Narmeen Zakaria
author_facet Ahmed, Hafiza Anisa
Hameed, Anum
Bawany, Narmeen Zakaria
author_sort Ahmed, Hafiza Anisa
collection PubMed
description The expeditious growth of the World Wide Web and the rampant flow of network traffic have resulted in a continuous increase of network security threats. Cyber attackers seek to exploit vulnerabilities in network architecture to steal valuable information or disrupt computer resources. Network Intrusion Detection System (NIDS) is used to effectively detect various attacks, thus providing timely protection to network resources from these attacks. To implement NIDS, a stream of supervised and unsupervised machine learning approaches is applied to detect irregularities in network traffic and to address network security issues. Such NIDSs are trained using various datasets that include attack traces. However, due to the advancement in modern-day attacks, these systems are unable to detect the emerging threats. Therefore, NIDS needs to be trained and developed with a modern comprehensive dataset which contains contemporary common and attack activities. This paper presents a framework in which different machine learning classification schemes are employed to detect various types of network attack categories. Five machine learning algorithms: Random Forest, Decision Tree, Logistic Regression, K-Nearest Neighbors and Artificial Neural Networks, are used for attack detection. This study uses a dataset published by the University of New South Wales (UNSW-NB15), a relatively new dataset that contains a large amount of network traffic data with nine categories of network attacks. The results show that the classification models achieved the highest accuracy of 89.29% by applying the Random Forest algorithm. Further improvement in the accuracy of classification models is observed when Synthetic Minority Oversampling Technique (SMOTE) is applied to address the class imbalance problem. After applying the SMOTE, the Random Forest classifier showed an accuracy of 95.1% with 24 selected features from the Principal Component Analysis method.
format Online
Article
Text
id pubmed-8771780
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-87717802022-02-01 Network intrusion detection using oversampling technique and machine learning algorithms Ahmed, Hafiza Anisa Hameed, Anum Bawany, Narmeen Zakaria PeerJ Comput Sci Data Mining and Machine Learning The expeditious growth of the World Wide Web and the rampant flow of network traffic have resulted in a continuous increase of network security threats. Cyber attackers seek to exploit vulnerabilities in network architecture to steal valuable information or disrupt computer resources. Network Intrusion Detection System (NIDS) is used to effectively detect various attacks, thus providing timely protection to network resources from these attacks. To implement NIDS, a stream of supervised and unsupervised machine learning approaches is applied to detect irregularities in network traffic and to address network security issues. Such NIDSs are trained using various datasets that include attack traces. However, due to the advancement in modern-day attacks, these systems are unable to detect the emerging threats. Therefore, NIDS needs to be trained and developed with a modern comprehensive dataset which contains contemporary common and attack activities. This paper presents a framework in which different machine learning classification schemes are employed to detect various types of network attack categories. Five machine learning algorithms: Random Forest, Decision Tree, Logistic Regression, K-Nearest Neighbors and Artificial Neural Networks, are used for attack detection. This study uses a dataset published by the University of New South Wales (UNSW-NB15), a relatively new dataset that contains a large amount of network traffic data with nine categories of network attacks. The results show that the classification models achieved the highest accuracy of 89.29% by applying the Random Forest algorithm. Further improvement in the accuracy of classification models is observed when Synthetic Minority Oversampling Technique (SMOTE) is applied to address the class imbalance problem. After applying the SMOTE, the Random Forest classifier showed an accuracy of 95.1% with 24 selected features from the Principal Component Analysis method. PeerJ Inc. 2022-01-07 /pmc/articles/PMC8771780/ /pubmed/35111914 http://dx.doi.org/10.7717/peerj-cs.820 Text en © 2022 Ahmed et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Data Mining and Machine Learning
Ahmed, Hafiza Anisa
Hameed, Anum
Bawany, Narmeen Zakaria
Network intrusion detection using oversampling technique and machine learning algorithms
title Network intrusion detection using oversampling technique and machine learning algorithms
title_full Network intrusion detection using oversampling technique and machine learning algorithms
title_fullStr Network intrusion detection using oversampling technique and machine learning algorithms
title_full_unstemmed Network intrusion detection using oversampling technique and machine learning algorithms
title_short Network intrusion detection using oversampling technique and machine learning algorithms
title_sort network intrusion detection using oversampling technique and machine learning algorithms
topic Data Mining and Machine Learning
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8771780/
https://www.ncbi.nlm.nih.gov/pubmed/35111914
http://dx.doi.org/10.7717/peerj-cs.820
work_keys_str_mv AT ahmedhafizaanisa networkintrusiondetectionusingoversamplingtechniqueandmachinelearningalgorithms
AT hameedanum networkintrusiondetectionusingoversamplingtechniqueandmachinelearningalgorithms
AT bawanynarmeenzakaria networkintrusiondetectionusingoversamplingtechniqueandmachinelearningalgorithms