Cargando…

Data-Driven Network Analysis for Anomaly Traffic Detection

Cybersecurity is a critical issue in today’s internet world. Classical security systems, such as firewalls based on signature detection, cannot detect today’s sophisticated zero-day attacks. Machine learning (ML) based solutions are more attractive for their capabilities of detecting anomaly traffic...

Descripción completa

Detalles Bibliográficos
Autores principales: Alam, Shumon, Alam, Yasin, Cui, Suxia, Akujuobi, Cajetan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10574999/
https://www.ncbi.nlm.nih.gov/pubmed/37837004
http://dx.doi.org/10.3390/s23198174
_version_ 1785120820840890368
author Alam, Shumon
Alam, Yasin
Cui, Suxia
Akujuobi, Cajetan
author_facet Alam, Shumon
Alam, Yasin
Cui, Suxia
Akujuobi, Cajetan
author_sort Alam, Shumon
collection PubMed
description Cybersecurity is a critical issue in today’s internet world. Classical security systems, such as firewalls based on signature detection, cannot detect today’s sophisticated zero-day attacks. Machine learning (ML) based solutions are more attractive for their capabilities of detecting anomaly traffic from benign traffic, but to develop an ML-based anomaly detection system, we need meaningful or realistic network datasets to train the detection engine. There are many public network datasets for ML applications. Still, they have limitations, such as the data creation process and the lack of diverse attack scenarios or background traffic. To create a good detection engine, we need a realistic dataset with various attack scenarios and various types of background traffic, such as HTTPs, streaming, and SMTP traffic. In this work, we have developed realistic network data or datasets considering various attack scenarios and diverse background/benign traffic. Furthermore, considering the importance of distributed denial of service (DDoS) attacks, we have compared the performance of detecting anomaly traffic of some classical supervised and our prior developed unsupervised ML algorithms based on the convolutional neural network (CNN) and pseudo auto-encoder (AE) architecture based on the created datasets. The results show that the performance of the CNN-Pseudo-AE is comparable to that of many classical supervised algorithms. Hence, the CNN-Pseudo-AE algorithm is promising in actual implementation.
format Online
Article
Text
id pubmed-10574999
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-105749992023-10-14 Data-Driven Network Analysis for Anomaly Traffic Detection Alam, Shumon Alam, Yasin Cui, Suxia Akujuobi, Cajetan Sensors (Basel) Article Cybersecurity is a critical issue in today’s internet world. Classical security systems, such as firewalls based on signature detection, cannot detect today’s sophisticated zero-day attacks. Machine learning (ML) based solutions are more attractive for their capabilities of detecting anomaly traffic from benign traffic, but to develop an ML-based anomaly detection system, we need meaningful or realistic network datasets to train the detection engine. There are many public network datasets for ML applications. Still, they have limitations, such as the data creation process and the lack of diverse attack scenarios or background traffic. To create a good detection engine, we need a realistic dataset with various attack scenarios and various types of background traffic, such as HTTPs, streaming, and SMTP traffic. In this work, we have developed realistic network data or datasets considering various attack scenarios and diverse background/benign traffic. Furthermore, considering the importance of distributed denial of service (DDoS) attacks, we have compared the performance of detecting anomaly traffic of some classical supervised and our prior developed unsupervised ML algorithms based on the convolutional neural network (CNN) and pseudo auto-encoder (AE) architecture based on the created datasets. The results show that the performance of the CNN-Pseudo-AE is comparable to that of many classical supervised algorithms. Hence, the CNN-Pseudo-AE algorithm is promising in actual implementation. MDPI 2023-09-29 /pmc/articles/PMC10574999/ /pubmed/37837004 http://dx.doi.org/10.3390/s23198174 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Alam, Shumon
Alam, Yasin
Cui, Suxia
Akujuobi, Cajetan
Data-Driven Network Analysis for Anomaly Traffic Detection
title Data-Driven Network Analysis for Anomaly Traffic Detection
title_full Data-Driven Network Analysis for Anomaly Traffic Detection
title_fullStr Data-Driven Network Analysis for Anomaly Traffic Detection
title_full_unstemmed Data-Driven Network Analysis for Anomaly Traffic Detection
title_short Data-Driven Network Analysis for Anomaly Traffic Detection
title_sort data-driven network analysis for anomaly traffic detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10574999/
https://www.ncbi.nlm.nih.gov/pubmed/37837004
http://dx.doi.org/10.3390/s23198174
work_keys_str_mv AT alamshumon datadrivennetworkanalysisforanomalytrafficdetection
AT alamyasin datadrivennetworkanalysisforanomalytrafficdetection
AT cuisuxia datadrivennetworkanalysisforanomalytrafficdetection
AT akujuobicajetan datadrivennetworkanalysisforanomalytrafficdetection