Cargando…
Network intrusion detection using oversampling technique and machine learning algorithms
The expeditious growth of the World Wide Web and the rampant flow of network traffic have resulted in a continuous increase of network security threats. Cyber attackers seek to exploit vulnerabilities in network architecture to steal valuable information or disrupt computer resources. Network Intrus...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8771780/ https://www.ncbi.nlm.nih.gov/pubmed/35111914 http://dx.doi.org/10.7717/peerj-cs.820 |
_version_ | 1784635689140224000 |
---|---|
author | Ahmed, Hafiza Anisa Hameed, Anum Bawany, Narmeen Zakaria |
author_facet | Ahmed, Hafiza Anisa Hameed, Anum Bawany, Narmeen Zakaria |
author_sort | Ahmed, Hafiza Anisa |
collection | PubMed |
description | The expeditious growth of the World Wide Web and the rampant flow of network traffic have resulted in a continuous increase of network security threats. Cyber attackers seek to exploit vulnerabilities in network architecture to steal valuable information or disrupt computer resources. Network Intrusion Detection System (NIDS) is used to effectively detect various attacks, thus providing timely protection to network resources from these attacks. To implement NIDS, a stream of supervised and unsupervised machine learning approaches is applied to detect irregularities in network traffic and to address network security issues. Such NIDSs are trained using various datasets that include attack traces. However, due to the advancement in modern-day attacks, these systems are unable to detect the emerging threats. Therefore, NIDS needs to be trained and developed with a modern comprehensive dataset which contains contemporary common and attack activities. This paper presents a framework in which different machine learning classification schemes are employed to detect various types of network attack categories. Five machine learning algorithms: Random Forest, Decision Tree, Logistic Regression, K-Nearest Neighbors and Artificial Neural Networks, are used for attack detection. This study uses a dataset published by the University of New South Wales (UNSW-NB15), a relatively new dataset that contains a large amount of network traffic data with nine categories of network attacks. The results show that the classification models achieved the highest accuracy of 89.29% by applying the Random Forest algorithm. Further improvement in the accuracy of classification models is observed when Synthetic Minority Oversampling Technique (SMOTE) is applied to address the class imbalance problem. After applying the SMOTE, the Random Forest classifier showed an accuracy of 95.1% with 24 selected features from the Principal Component Analysis method. |
format | Online Article Text |
id | pubmed-8771780 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-87717802022-02-01 Network intrusion detection using oversampling technique and machine learning algorithms Ahmed, Hafiza Anisa Hameed, Anum Bawany, Narmeen Zakaria PeerJ Comput Sci Data Mining and Machine Learning The expeditious growth of the World Wide Web and the rampant flow of network traffic have resulted in a continuous increase of network security threats. Cyber attackers seek to exploit vulnerabilities in network architecture to steal valuable information or disrupt computer resources. Network Intrusion Detection System (NIDS) is used to effectively detect various attacks, thus providing timely protection to network resources from these attacks. To implement NIDS, a stream of supervised and unsupervised machine learning approaches is applied to detect irregularities in network traffic and to address network security issues. Such NIDSs are trained using various datasets that include attack traces. However, due to the advancement in modern-day attacks, these systems are unable to detect the emerging threats. Therefore, NIDS needs to be trained and developed with a modern comprehensive dataset which contains contemporary common and attack activities. This paper presents a framework in which different machine learning classification schemes are employed to detect various types of network attack categories. Five machine learning algorithms: Random Forest, Decision Tree, Logistic Regression, K-Nearest Neighbors and Artificial Neural Networks, are used for attack detection. This study uses a dataset published by the University of New South Wales (UNSW-NB15), a relatively new dataset that contains a large amount of network traffic data with nine categories of network attacks. The results show that the classification models achieved the highest accuracy of 89.29% by applying the Random Forest algorithm. Further improvement in the accuracy of classification models is observed when Synthetic Minority Oversampling Technique (SMOTE) is applied to address the class imbalance problem. After applying the SMOTE, the Random Forest classifier showed an accuracy of 95.1% with 24 selected features from the Principal Component Analysis method. PeerJ Inc. 2022-01-07 /pmc/articles/PMC8771780/ /pubmed/35111914 http://dx.doi.org/10.7717/peerj-cs.820 Text en © 2022 Ahmed et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Data Mining and Machine Learning Ahmed, Hafiza Anisa Hameed, Anum Bawany, Narmeen Zakaria Network intrusion detection using oversampling technique and machine learning algorithms |
title | Network intrusion detection using oversampling technique and machine learning algorithms |
title_full | Network intrusion detection using oversampling technique and machine learning algorithms |
title_fullStr | Network intrusion detection using oversampling technique and machine learning algorithms |
title_full_unstemmed | Network intrusion detection using oversampling technique and machine learning algorithms |
title_short | Network intrusion detection using oversampling technique and machine learning algorithms |
title_sort | network intrusion detection using oversampling technique and machine learning algorithms |
topic | Data Mining and Machine Learning |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8771780/ https://www.ncbi.nlm.nih.gov/pubmed/35111914 http://dx.doi.org/10.7717/peerj-cs.820 |
work_keys_str_mv | AT ahmedhafizaanisa networkintrusiondetectionusingoversamplingtechniqueandmachinelearningalgorithms AT hameedanum networkintrusiondetectionusingoversamplingtechniqueandmachinelearningalgorithms AT bawanynarmeenzakaria networkintrusiondetectionusingoversamplingtechniqueandmachinelearningalgorithms |