Cargando…
Data-Driven Network Analysis for Anomaly Traffic Detection
Cybersecurity is a critical issue in today’s internet world. Classical security systems, such as firewalls based on signature detection, cannot detect today’s sophisticated zero-day attacks. Machine learning (ML) based solutions are more attractive for their capabilities of detecting anomaly traffic...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10574999/ https://www.ncbi.nlm.nih.gov/pubmed/37837004 http://dx.doi.org/10.3390/s23198174 |
_version_ | 1785120820840890368 |
---|---|
author | Alam, Shumon Alam, Yasin Cui, Suxia Akujuobi, Cajetan |
author_facet | Alam, Shumon Alam, Yasin Cui, Suxia Akujuobi, Cajetan |
author_sort | Alam, Shumon |
collection | PubMed |
description | Cybersecurity is a critical issue in today’s internet world. Classical security systems, such as firewalls based on signature detection, cannot detect today’s sophisticated zero-day attacks. Machine learning (ML) based solutions are more attractive for their capabilities of detecting anomaly traffic from benign traffic, but to develop an ML-based anomaly detection system, we need meaningful or realistic network datasets to train the detection engine. There are many public network datasets for ML applications. Still, they have limitations, such as the data creation process and the lack of diverse attack scenarios or background traffic. To create a good detection engine, we need a realistic dataset with various attack scenarios and various types of background traffic, such as HTTPs, streaming, and SMTP traffic. In this work, we have developed realistic network data or datasets considering various attack scenarios and diverse background/benign traffic. Furthermore, considering the importance of distributed denial of service (DDoS) attacks, we have compared the performance of detecting anomaly traffic of some classical supervised and our prior developed unsupervised ML algorithms based on the convolutional neural network (CNN) and pseudo auto-encoder (AE) architecture based on the created datasets. The results show that the performance of the CNN-Pseudo-AE is comparable to that of many classical supervised algorithms. Hence, the CNN-Pseudo-AE algorithm is promising in actual implementation. |
format | Online Article Text |
id | pubmed-10574999 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-105749992023-10-14 Data-Driven Network Analysis for Anomaly Traffic Detection Alam, Shumon Alam, Yasin Cui, Suxia Akujuobi, Cajetan Sensors (Basel) Article Cybersecurity is a critical issue in today’s internet world. Classical security systems, such as firewalls based on signature detection, cannot detect today’s sophisticated zero-day attacks. Machine learning (ML) based solutions are more attractive for their capabilities of detecting anomaly traffic from benign traffic, but to develop an ML-based anomaly detection system, we need meaningful or realistic network datasets to train the detection engine. There are many public network datasets for ML applications. Still, they have limitations, such as the data creation process and the lack of diverse attack scenarios or background traffic. To create a good detection engine, we need a realistic dataset with various attack scenarios and various types of background traffic, such as HTTPs, streaming, and SMTP traffic. In this work, we have developed realistic network data or datasets considering various attack scenarios and diverse background/benign traffic. Furthermore, considering the importance of distributed denial of service (DDoS) attacks, we have compared the performance of detecting anomaly traffic of some classical supervised and our prior developed unsupervised ML algorithms based on the convolutional neural network (CNN) and pseudo auto-encoder (AE) architecture based on the created datasets. The results show that the performance of the CNN-Pseudo-AE is comparable to that of many classical supervised algorithms. Hence, the CNN-Pseudo-AE algorithm is promising in actual implementation. MDPI 2023-09-29 /pmc/articles/PMC10574999/ /pubmed/37837004 http://dx.doi.org/10.3390/s23198174 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Alam, Shumon Alam, Yasin Cui, Suxia Akujuobi, Cajetan Data-Driven Network Analysis for Anomaly Traffic Detection |
title | Data-Driven Network Analysis for Anomaly Traffic Detection |
title_full | Data-Driven Network Analysis for Anomaly Traffic Detection |
title_fullStr | Data-Driven Network Analysis for Anomaly Traffic Detection |
title_full_unstemmed | Data-Driven Network Analysis for Anomaly Traffic Detection |
title_short | Data-Driven Network Analysis for Anomaly Traffic Detection |
title_sort | data-driven network analysis for anomaly traffic detection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10574999/ https://www.ncbi.nlm.nih.gov/pubmed/37837004 http://dx.doi.org/10.3390/s23198174 |
work_keys_str_mv | AT alamshumon datadrivennetworkanalysisforanomalytrafficdetection AT alamyasin datadrivennetworkanalysisforanomalytrafficdetection AT cuisuxia datadrivennetworkanalysisforanomalytrafficdetection AT akujuobicajetan datadrivennetworkanalysisforanomalytrafficdetection |