Cargando…

Application of Improved Asynchronous Advantage Actor Critic Reinforcement Learning Model on Anomaly Detection

Anomaly detection research was conducted traditionally using mathematical and statistical methods. This topic has been widely applied in many fields. Recently reinforcement learning has achieved exceptional successes in many areas such as the AlphaGo chess playing and video gaming etc. However, ther...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Kun, Wang, Wenyong, Hu, Teng, Deng, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7996251/
https://www.ncbi.nlm.nih.gov/pubmed/33668769
http://dx.doi.org/10.3390/e23030274
_version_ 1783670075269578752
author Zhou, Kun
Wang, Wenyong
Hu, Teng
Deng, Kai
author_facet Zhou, Kun
Wang, Wenyong
Hu, Teng
Deng, Kai
author_sort Zhou, Kun
collection PubMed
description Anomaly detection research was conducted traditionally using mathematical and statistical methods. This topic has been widely applied in many fields. Recently reinforcement learning has achieved exceptional successes in many areas such as the AlphaGo chess playing and video gaming etc. However, there were scarce researches applying reinforcement learning to the field of anomaly detection. This paper therefore aimed at proposing an adaptable asynchronous advantage actor-critic model of reinforcement learning to this field. The performances were evaluated and compared among classical machine learning and the generative adversarial model with variants. Basic principles of the related models were introduced firstly. Then problem definitions, modelling processes and testing were detailed. The proposed model differentiated the sequence and image from other anomalies by proposing appropriate neural networks of attention mechanism and convolutional network for the two kinds of anomalies, respectively. Finally, performances with classical models using public benchmark datasets (NSL-KDD, AWID and CICIDS-2017, DoHBrw-2020) were evaluated and compared. Experiments confirmed the effectiveness of the proposed model with the results indicating higher rewards and lower loss rates on the datasets during training and testing. The metrics of precision, recall rate and F1 score were higher than or at least comparable to the state-of-the-art models. We concluded the proposed model could outperform or at least achieve comparable results with the existing anomaly detection models.
format Online
Article
Text
id pubmed-7996251
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79962512021-03-27 Application of Improved Asynchronous Advantage Actor Critic Reinforcement Learning Model on Anomaly Detection Zhou, Kun Wang, Wenyong Hu, Teng Deng, Kai Entropy (Basel) Article Anomaly detection research was conducted traditionally using mathematical and statistical methods. This topic has been widely applied in many fields. Recently reinforcement learning has achieved exceptional successes in many areas such as the AlphaGo chess playing and video gaming etc. However, there were scarce researches applying reinforcement learning to the field of anomaly detection. This paper therefore aimed at proposing an adaptable asynchronous advantage actor-critic model of reinforcement learning to this field. The performances were evaluated and compared among classical machine learning and the generative adversarial model with variants. Basic principles of the related models were introduced firstly. Then problem definitions, modelling processes and testing were detailed. The proposed model differentiated the sequence and image from other anomalies by proposing appropriate neural networks of attention mechanism and convolutional network for the two kinds of anomalies, respectively. Finally, performances with classical models using public benchmark datasets (NSL-KDD, AWID and CICIDS-2017, DoHBrw-2020) were evaluated and compared. Experiments confirmed the effectiveness of the proposed model with the results indicating higher rewards and lower loss rates on the datasets during training and testing. The metrics of precision, recall rate and F1 score were higher than or at least comparable to the state-of-the-art models. We concluded the proposed model could outperform or at least achieve comparable results with the existing anomaly detection models. MDPI 2021-02-25 /pmc/articles/PMC7996251/ /pubmed/33668769 http://dx.doi.org/10.3390/e23030274 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Zhou, Kun
Wang, Wenyong
Hu, Teng
Deng, Kai
Application of Improved Asynchronous Advantage Actor Critic Reinforcement Learning Model on Anomaly Detection
title Application of Improved Asynchronous Advantage Actor Critic Reinforcement Learning Model on Anomaly Detection
title_full Application of Improved Asynchronous Advantage Actor Critic Reinforcement Learning Model on Anomaly Detection
title_fullStr Application of Improved Asynchronous Advantage Actor Critic Reinforcement Learning Model on Anomaly Detection
title_full_unstemmed Application of Improved Asynchronous Advantage Actor Critic Reinforcement Learning Model on Anomaly Detection
title_short Application of Improved Asynchronous Advantage Actor Critic Reinforcement Learning Model on Anomaly Detection
title_sort application of improved asynchronous advantage actor critic reinforcement learning model on anomaly detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7996251/
https://www.ncbi.nlm.nih.gov/pubmed/33668769
http://dx.doi.org/10.3390/e23030274
work_keys_str_mv AT zhoukun applicationofimprovedasynchronousadvantageactorcriticreinforcementlearningmodelonanomalydetection
AT wangwenyong applicationofimprovedasynchronousadvantageactorcriticreinforcementlearningmodelonanomalydetection
AT huteng applicationofimprovedasynchronousadvantageactorcriticreinforcementlearningmodelonanomalydetection
AT dengkai applicationofimprovedasynchronousadvantageactorcriticreinforcementlearningmodelonanomalydetection