Cargando…

JACLNet:Application of adaptive code length network in JavaScript malicious code detection

Currently, JavaScript malicious code detection methods are becoming more and more effective. Still, the existing methods based on deep learning are poor at detecting too long or too short JavaScript code. Based on this, this paper proposes an adaptive code length deep learning network JACLNet, compo...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zhining, Wan, Liang, Chu, Kun, Li, Shusheng, Wei, Haodong, Tang, Lu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9749972/
https://www.ncbi.nlm.nih.gov/pubmed/36516186
http://dx.doi.org/10.1371/journal.pone.0277891
_version_ 1784850145727217664
author Zhang, Zhining
Wan, Liang
Chu, Kun
Li, Shusheng
Wei, Haodong
Tang, Lu
author_facet Zhang, Zhining
Wan, Liang
Chu, Kun
Li, Shusheng
Wei, Haodong
Tang, Lu
author_sort Zhang, Zhining
collection PubMed
description Currently, JavaScript malicious code detection methods are becoming more and more effective. Still, the existing methods based on deep learning are poor at detecting too long or too short JavaScript code. Based on this, this paper proposes an adaptive code length deep learning network JACLNet, composed of convolutional block RDCNet, BiLSTM and Transfrom, to capture the association features of the variable distance between codes. Firstly, an abstract syntax tree recombination algorithm is designed to provide rich syntax information for feature extraction. Secondly, a deep residual convolution block network (RDCNet) is designed to capture short-distance association features between codes. Finally, this paper proposes a JACLNet network for JavaScript malicious code detection. To verify that the model presented in this paper can effectively detect variable JavaScript code, we divide the datasets used in this paper into long text dataset DB_Long; short text dataset DB_Short, original dataset DB_Or and enhanced dataset DB_Re. In DB_Long, our method’s F1 − score is 98.87%, higher than that of JSContana by 2.52%. In DB_Short, our method’s F1-score is 97.32%, higher than that of JSContana by 7.79%. To verify that the abstract syntax tree recombination algorithm proposed in this paper can provide rich syntax information for subsequent models, we conduct comparative experiments on DB_Or and DB_Re. In DPCNN+BiLSTM, F1-score with abstract syntax tree recombination increased by 1.72%, and in JSContana, F1-score with abstract syntax tree recombination increased by 1.50%. F1-score with abstract syntax tree recombination in JACNet improved by 1.00% otherwise unused.
format Online
Article
Text
id pubmed-9749972
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-97499722022-12-15 JACLNet:Application of adaptive code length network in JavaScript malicious code detection Zhang, Zhining Wan, Liang Chu, Kun Li, Shusheng Wei, Haodong Tang, Lu PLoS One Research Article Currently, JavaScript malicious code detection methods are becoming more and more effective. Still, the existing methods based on deep learning are poor at detecting too long or too short JavaScript code. Based on this, this paper proposes an adaptive code length deep learning network JACLNet, composed of convolutional block RDCNet, BiLSTM and Transfrom, to capture the association features of the variable distance between codes. Firstly, an abstract syntax tree recombination algorithm is designed to provide rich syntax information for feature extraction. Secondly, a deep residual convolution block network (RDCNet) is designed to capture short-distance association features between codes. Finally, this paper proposes a JACLNet network for JavaScript malicious code detection. To verify that the model presented in this paper can effectively detect variable JavaScript code, we divide the datasets used in this paper into long text dataset DB_Long; short text dataset DB_Short, original dataset DB_Or and enhanced dataset DB_Re. In DB_Long, our method’s F1 − score is 98.87%, higher than that of JSContana by 2.52%. In DB_Short, our method’s F1-score is 97.32%, higher than that of JSContana by 7.79%. To verify that the abstract syntax tree recombination algorithm proposed in this paper can provide rich syntax information for subsequent models, we conduct comparative experiments on DB_Or and DB_Re. In DPCNN+BiLSTM, F1-score with abstract syntax tree recombination increased by 1.72%, and in JSContana, F1-score with abstract syntax tree recombination increased by 1.50%. F1-score with abstract syntax tree recombination in JACNet improved by 1.00% otherwise unused. Public Library of Science 2022-12-14 /pmc/articles/PMC9749972/ /pubmed/36516186 http://dx.doi.org/10.1371/journal.pone.0277891 Text en © 2022 Zhang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Zhining
Wan, Liang
Chu, Kun
Li, Shusheng
Wei, Haodong
Tang, Lu
JACLNet:Application of adaptive code length network in JavaScript malicious code detection
title JACLNet:Application of adaptive code length network in JavaScript malicious code detection
title_full JACLNet:Application of adaptive code length network in JavaScript malicious code detection
title_fullStr JACLNet:Application of adaptive code length network in JavaScript malicious code detection
title_full_unstemmed JACLNet:Application of adaptive code length network in JavaScript malicious code detection
title_short JACLNet:Application of adaptive code length network in JavaScript malicious code detection
title_sort jaclnet:application of adaptive code length network in javascript malicious code detection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9749972/
https://www.ncbi.nlm.nih.gov/pubmed/36516186
http://dx.doi.org/10.1371/journal.pone.0277891
work_keys_str_mv AT zhangzhining jaclnetapplicationofadaptivecodelengthnetworkinjavascriptmaliciouscodedetection
AT wanliang jaclnetapplicationofadaptivecodelengthnetworkinjavascriptmaliciouscodedetection
AT chukun jaclnetapplicationofadaptivecodelengthnetworkinjavascriptmaliciouscodedetection
AT lishusheng jaclnetapplicationofadaptivecodelengthnetworkinjavascriptmaliciouscodedetection
AT weihaodong jaclnetapplicationofadaptivecodelengthnetworkinjavascriptmaliciouscodedetection
AT tanglu jaclnetapplicationofadaptivecodelengthnetworkinjavascriptmaliciouscodedetection