Cargando…

Data-Driven Network Analysis for Anomaly Traffic Detection

Cybersecurity is a critical issue in today’s internet world. Classical security systems, such as firewalls based on signature detection, cannot detect today’s sophisticated zero-day attacks. Machine learning (ML) based solutions are more attractive for their capabilities of detecting anomaly traffic...

Descripción completa

Detalles Bibliográficos
Autores principales:	Alam, Shumon, Alam, Yasin, Cui, Suxia, Akujuobi, Cajetan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10574999/ https://www.ncbi.nlm.nih.gov/pubmed/37837004 http://dx.doi.org/10.3390/s23198174

_version_	1785120820840890368
author	Alam, Shumon Alam, Yasin Cui, Suxia Akujuobi, Cajetan
author_facet	Alam, Shumon Alam, Yasin Cui, Suxia Akujuobi, Cajetan
author_sort	Alam, Shumon
collection	PubMed
description	Cybersecurity is a critical issue in today’s internet world. Classical security systems, such as firewalls based on signature detection, cannot detect today’s sophisticated zero-day attacks. Machine learning (ML) based solutions are more attractive for their capabilities of detecting anomaly traffic from benign traffic, but to develop an ML-based anomaly detection system, we need meaningful or realistic network datasets to train the detection engine. There are many public network datasets for ML applications. Still, they have limitations, such as the data creation process and the lack of diverse attack scenarios or background traffic. To create a good detection engine, we need a realistic dataset with various attack scenarios and various types of background traffic, such as HTTPs, streaming, and SMTP traffic. In this work, we have developed realistic network data or datasets considering various attack scenarios and diverse background/benign traffic. Furthermore, considering the importance of distributed denial of service (DDoS) attacks, we have compared the performance of detecting anomaly traffic of some classical supervised and our prior developed unsupervised ML algorithms based on the convolutional neural network (CNN) and pseudo auto-encoder (AE) architecture based on the created datasets. The results show that the performance of the CNN-Pseudo-AE is comparable to that of many classical supervised algorithms. Hence, the CNN-Pseudo-AE algorithm is promising in actual implementation.
format	Online Article Text
id	pubmed-10574999
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-105749992023-10-14 Data-Driven Network Analysis for Anomaly Traffic Detection Alam, Shumon Alam, Yasin Cui, Suxia Akujuobi, Cajetan Sensors (Basel) Article Cybersecurity is a critical issue in today’s internet world. Classical security systems, such as firewalls based on signature detection, cannot detect today’s sophisticated zero-day attacks. Machine learning (ML) based solutions are more attractive for their capabilities of detecting anomaly traffic from benign traffic, but to develop an ML-based anomaly detection system, we need meaningful or realistic network datasets to train the detection engine. There are many public network datasets for ML applications. Still, they have limitations, such as the data creation process and the lack of diverse attack scenarios or background traffic. To create a good detection engine, we need a realistic dataset with various attack scenarios and various types of background traffic, such as HTTPs, streaming, and SMTP traffic. In this work, we have developed realistic network data or datasets considering various attack scenarios and diverse background/benign traffic. Furthermore, considering the importance of distributed denial of service (DDoS) attacks, we have compared the performance of detecting anomaly traffic of some classical supervised and our prior developed unsupervised ML algorithms based on the convolutional neural network (CNN) and pseudo auto-encoder (AE) architecture based on the created datasets. The results show that the performance of the CNN-Pseudo-AE is comparable to that of many classical supervised algorithms. Hence, the CNN-Pseudo-AE algorithm is promising in actual implementation. MDPI 2023-09-29 /pmc/articles/PMC10574999/ /pubmed/37837004 http://dx.doi.org/10.3390/s23198174 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Alam, Shumon Alam, Yasin Cui, Suxia Akujuobi, Cajetan Data-Driven Network Analysis for Anomaly Traffic Detection
title	Data-Driven Network Analysis for Anomaly Traffic Detection
title_full	Data-Driven Network Analysis for Anomaly Traffic Detection
title_fullStr	Data-Driven Network Analysis for Anomaly Traffic Detection
title_full_unstemmed	Data-Driven Network Analysis for Anomaly Traffic Detection
title_short	Data-Driven Network Analysis for Anomaly Traffic Detection
title_sort	data-driven network analysis for anomaly traffic detection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10574999/ https://www.ncbi.nlm.nih.gov/pubmed/37837004 http://dx.doi.org/10.3390/s23198174
work_keys_str_mv	AT alamshumon datadrivennetworkanalysisforanomalytrafficdetection AT alamyasin datadrivennetworkanalysisforanomalytrafficdetection AT cuisuxia datadrivennetworkanalysisforanomalytrafficdetection AT akujuobicajetan datadrivennetworkanalysisforanomalytrafficdetection

Data-Driven Network Analysis for Anomaly Traffic Detection

Ejemplares similares