Cargando…

Apache Spark and Deep Learning Models for High-Performance Network Intrusion Detection Using CSE-CIC-IDS2018

Keeping computers secure is becoming challenging as networks grow and new network-based technologies emerge. Cybercriminals' attack surface expands with the release of new internet-enabled products. As many cyberattacks affect businesses' confidentiality, availability, and integrity, netwo...

Descripción completa

Detalles Bibliográficos
Autores principales: Hagar, Abdulnaser A., Gawali, Bharti W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9439899/
https://www.ncbi.nlm.nih.gov/pubmed/36059395
http://dx.doi.org/10.1155/2022/3131153
_version_ 1784782186024534016
author Hagar, Abdulnaser A.
Gawali, Bharti W.
author_facet Hagar, Abdulnaser A.
Gawali, Bharti W.
author_sort Hagar, Abdulnaser A.
collection PubMed
description Keeping computers secure is becoming challenging as networks grow and new network-based technologies emerge. Cybercriminals' attack surface expands with the release of new internet-enabled products. As many cyberattacks affect businesses' confidentiality, availability, and integrity, network intrusion detection systems (NIDS) show an essential role. Network-based intrusion detection uses datasets like CSE-CIC-IDS2018 to train prediction models. With fourteen types of attacks included, the latest big data set for intrusion detection is available to the public. This work proposes three models, two deep learning convolutional neural networks (CNN), long short-term memory (LSTM), and Apache Spark, to improve the detection of all types of attacks. To reduce the dimensionality, random forests (RF) was employed to select the important features; it gave 19 from 84 features. The dataset is imbalanced; thus, oversampling and undersampling techniques reduce the imbalance ratio. The Apache Spark model produced the best results across all 15 classes, with accuracy as high as 100% for all classes, as seen by the experiments' findings. For the F1-score, Apache Spark showed the highest results with 1.00 for most classes. The findings of the three models showed outstanding results for multiclassification network intrusion detection.
format Online
Article
Text
id pubmed-9439899
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-94398992022-09-03 Apache Spark and Deep Learning Models for High-Performance Network Intrusion Detection Using CSE-CIC-IDS2018 Hagar, Abdulnaser A. Gawali, Bharti W. Comput Intell Neurosci Research Article Keeping computers secure is becoming challenging as networks grow and new network-based technologies emerge. Cybercriminals' attack surface expands with the release of new internet-enabled products. As many cyberattacks affect businesses' confidentiality, availability, and integrity, network intrusion detection systems (NIDS) show an essential role. Network-based intrusion detection uses datasets like CSE-CIC-IDS2018 to train prediction models. With fourteen types of attacks included, the latest big data set for intrusion detection is available to the public. This work proposes three models, two deep learning convolutional neural networks (CNN), long short-term memory (LSTM), and Apache Spark, to improve the detection of all types of attacks. To reduce the dimensionality, random forests (RF) was employed to select the important features; it gave 19 from 84 features. The dataset is imbalanced; thus, oversampling and undersampling techniques reduce the imbalance ratio. The Apache Spark model produced the best results across all 15 classes, with accuracy as high as 100% for all classes, as seen by the experiments' findings. For the F1-score, Apache Spark showed the highest results with 1.00 for most classes. The findings of the three models showed outstanding results for multiclassification network intrusion detection. Hindawi 2022-08-26 /pmc/articles/PMC9439899/ /pubmed/36059395 http://dx.doi.org/10.1155/2022/3131153 Text en Copyright © 2022 Abdulnaser A. Hagar and Bharti W. Gawali. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hagar, Abdulnaser A.
Gawali, Bharti W.
Apache Spark and Deep Learning Models for High-Performance Network Intrusion Detection Using CSE-CIC-IDS2018
title Apache Spark and Deep Learning Models for High-Performance Network Intrusion Detection Using CSE-CIC-IDS2018
title_full Apache Spark and Deep Learning Models for High-Performance Network Intrusion Detection Using CSE-CIC-IDS2018
title_fullStr Apache Spark and Deep Learning Models for High-Performance Network Intrusion Detection Using CSE-CIC-IDS2018
title_full_unstemmed Apache Spark and Deep Learning Models for High-Performance Network Intrusion Detection Using CSE-CIC-IDS2018
title_short Apache Spark and Deep Learning Models for High-Performance Network Intrusion Detection Using CSE-CIC-IDS2018
title_sort apache spark and deep learning models for high-performance network intrusion detection using cse-cic-ids2018
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9439899/
https://www.ncbi.nlm.nih.gov/pubmed/36059395
http://dx.doi.org/10.1155/2022/3131153
work_keys_str_mv AT hagarabdulnasera apachesparkanddeeplearningmodelsforhighperformancenetworkintrusiondetectionusingcsecicids2018
AT gawalibhartiw apachesparkanddeeplearningmodelsforhighperformancenetworkintrusiondetectionusingcsecicids2018