Cargando…

DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems

We present the first dataset that aims to serve as a benchmark to validate the resilience of botnet detectors against adversarial attacks. This dataset includes realistic adversarial samples that are generated by leveraging two widely used Deep Reinforcement Learning (DRL) techniques. These adversar...

Descripción completa

Detalles Bibliográficos
Autores principales:	Venturi, Andrea, Apruzzese, Giovanni, Andreolini, Mauro, Colajanni, Michele, Marchetti, Mirco
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2020
Materias:	Data Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7749366/ https://www.ncbi.nlm.nih.gov/pubmed/33365367 http://dx.doi.org/10.1016/j.dib.2020.106631

_version_	1783625287685111808
author	Venturi, Andrea Apruzzese, Giovanni Andreolini, Mauro Colajanni, Michele Marchetti, Mirco
author_facet	Venturi, Andrea Apruzzese, Giovanni Andreolini, Mauro Colajanni, Michele Marchetti, Mirco
author_sort	Venturi, Andrea
collection	PubMed
description	We present the first dataset that aims to serve as a benchmark to validate the resilience of botnet detectors against adversarial attacks. This dataset includes realistic adversarial samples that are generated by leveraging two widely used Deep Reinforcement Learning (DRL) techniques. These adversarial samples are proved to evade state of the art detectors based on Machine- and Deep-Learning algorithms. The initial corpus of malicious samples consists of network flows belonging to different botnet families presented in three public datasets containing real enterprise network traffic. We use these datasets to devise detectors capable of achieving state-of-the-art performance. We then train two DRL agents, based on Double Deep Q-Network and Deep Sarsa, to generate realistic adversarial samples: the goal is achieving misclassifications by performing small modifications to the initial malicious samples. These alterations involve the features that can be more realistically altered by an expert attacker, and do not compromise the underlying malicious logic of the original samples. Our dataset represents an important contribution to the cybersecurity research community as it is the first including thousands of automatically generated adversarial samples that are able to thwart state of the art classifiers with a high evasion rate. The adversarial samples are grouped by malware variant and provided in a CSV file format. Researchers can validate their defensive proposals by testing their detectors against the adversarial samples of the proposed dataset. Moreover, the analysis of these samples can pave the way to a deeper comprehension of adversarial attacks and to some sort of explainability of machine learning defensive algorithms. They can also support the definition of novel effective defensive techniques.
format	Online Article Text
id	pubmed-7749366
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-77493662020-12-22 DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems Venturi, Andrea Apruzzese, Giovanni Andreolini, Mauro Colajanni, Michele Marchetti, Mirco Data Brief Data Article We present the first dataset that aims to serve as a benchmark to validate the resilience of botnet detectors against adversarial attacks. This dataset includes realistic adversarial samples that are generated by leveraging two widely used Deep Reinforcement Learning (DRL) techniques. These adversarial samples are proved to evade state of the art detectors based on Machine- and Deep-Learning algorithms. The initial corpus of malicious samples consists of network flows belonging to different botnet families presented in three public datasets containing real enterprise network traffic. We use these datasets to devise detectors capable of achieving state-of-the-art performance. We then train two DRL agents, based on Double Deep Q-Network and Deep Sarsa, to generate realistic adversarial samples: the goal is achieving misclassifications by performing small modifications to the initial malicious samples. These alterations involve the features that can be more realistically altered by an expert attacker, and do not compromise the underlying malicious logic of the original samples. Our dataset represents an important contribution to the cybersecurity research community as it is the first including thousands of automatically generated adversarial samples that are able to thwart state of the art classifiers with a high evasion rate. The adversarial samples are grouped by malware variant and provided in a CSV file format. Researchers can validate their defensive proposals by testing their detectors against the adversarial samples of the proposed dataset. Moreover, the analysis of these samples can pave the way to a deeper comprehension of adversarial attacks and to some sort of explainability of machine learning defensive algorithms. They can also support the definition of novel effective defensive techniques. Elsevier 2020-12-08 /pmc/articles/PMC7749366/ /pubmed/33365367 http://dx.doi.org/10.1016/j.dib.2020.106631 Text en © 2020 The Authors. Published by Elsevier Inc. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Data Article Venturi, Andrea Apruzzese, Giovanni Andreolini, Mauro Colajanni, Michele Marchetti, Mirco DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems
title	DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems
title_full	DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems
title_fullStr	DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems
title_full_unstemmed	DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems
title_short	DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems
title_sort	drelab - deep reinforcement learning adversarial botnet: a benchmark dataset for adversarial attacks against botnet intrusion detection systems
topic	Data Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7749366/ https://www.ncbi.nlm.nih.gov/pubmed/33365367 http://dx.doi.org/10.1016/j.dib.2020.106631
work_keys_str_mv	AT venturiandrea drelabdeepreinforcementlearningadversarialbotnetabenchmarkdatasetforadversarialattacksagainstbotnetintrusiondetectionsystems AT apruzzesegiovanni drelabdeepreinforcementlearningadversarialbotnetabenchmarkdatasetforadversarialattacksagainstbotnetintrusiondetectionsystems AT andreolinimauro drelabdeepreinforcementlearningadversarialbotnetabenchmarkdatasetforadversarialattacksagainstbotnetintrusiondetectionsystems AT colajannimichele drelabdeepreinforcementlearningadversarialbotnetabenchmarkdatasetforadversarialattacksagainstbotnetintrusiondetectionsystems AT marchettimirco drelabdeepreinforcementlearningadversarialbotnetabenchmarkdatasetforadversarialattacksagainstbotnetintrusiondetectionsystems

DReLAB - Deep REinforcement Learning Adversarial Botnet: A benchmark dataset for adversarial attacks against botnet Intrusion Detection Systems

Ejemplares similares