Cargando…
A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods
Most of the sophisticated attacks in the modern age of cybercrime are based, among other things, on specialized phishing campaigns. A challenge in identifying phishing campaigns is defining a classification of patterns that can be generalized and used in different areas and campaigns of a different...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9170453/ https://www.ncbi.nlm.nih.gov/pubmed/35677198 http://dx.doi.org/10.1155/2022/5036026 |
_version_ | 1784721430153265152 |
---|---|
author | Tang, Yonghui Wu, Fei |
author_facet | Tang, Yonghui Wu, Fei |
author_sort | Tang, Yonghui |
collection | PubMed |
description | Most of the sophisticated attacks in the modern age of cybercrime are based, among other things, on specialized phishing campaigns. A challenge in identifying phishing campaigns is defining a classification of patterns that can be generalized and used in different areas and campaigns of a different nature. Although efforts have been made to establish a general labeling scheme in their classification, there is still limited data labeled in such a format. The usual approaches are based on feature engineering to correctly identify phishing campaigns, exporting lexical, syntactic, and semantic features, e.g., previous phrases. In this context, the most recent approaches have taken advantage of modern neural network architectures to record hidden information at the phrase and text levels, e.g., Long Short-Term Memory (LSTM) and Convolutional Neural Networks (CNNs). However, these models lose semantic information related to the specific problem, resulting in a variation in their performance, depending on the different data sets and the corresponding standards used for labeling. In this paper, we propose to extend word embeddings with word vectors that indicate the semantic similarity of each word with each phishing campaigns template tag. These embedded keywords are calculated based on semantic subfields corresponding to each phishing campaign tag, constructed based on the automatic extraction of keywords representing these tags. Combining general word integrations with vectors is calculated based on word similarity using a set of sequential Kalman filters, which can then power any neural architecture such as LSTM or CNN to predict each phishing campaign. Our experiments use a data indicator to evaluate our approach and achieve remarkable results that reinforce the state-of-the-art. |
format | Online Article Text |
id | pubmed-9170453 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-91704532022-06-07 A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods Tang, Yonghui Wu, Fei Appl Bionics Biomech Research Article Most of the sophisticated attacks in the modern age of cybercrime are based, among other things, on specialized phishing campaigns. A challenge in identifying phishing campaigns is defining a classification of patterns that can be generalized and used in different areas and campaigns of a different nature. Although efforts have been made to establish a general labeling scheme in their classification, there is still limited data labeled in such a format. The usual approaches are based on feature engineering to correctly identify phishing campaigns, exporting lexical, syntactic, and semantic features, e.g., previous phrases. In this context, the most recent approaches have taken advantage of modern neural network architectures to record hidden information at the phrase and text levels, e.g., Long Short-Term Memory (LSTM) and Convolutional Neural Networks (CNNs). However, these models lose semantic information related to the specific problem, resulting in a variation in their performance, depending on the different data sets and the corresponding standards used for labeling. In this paper, we propose to extend word embeddings with word vectors that indicate the semantic similarity of each word with each phishing campaigns template tag. These embedded keywords are calculated based on semantic subfields corresponding to each phishing campaign tag, constructed based on the automatic extraction of keywords representing these tags. Combining general word integrations with vectors is calculated based on word similarity using a set of sequential Kalman filters, which can then power any neural architecture such as LSTM or CNN to predict each phishing campaign. Our experiments use a data indicator to evaluate our approach and achieve remarkable results that reinforce the state-of-the-art. Hindawi 2022-05-30 /pmc/articles/PMC9170453/ /pubmed/35677198 http://dx.doi.org/10.1155/2022/5036026 Text en Copyright © 2022 Yonghui Tang and Fei Wu. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Tang, Yonghui Wu, Fei A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods |
title | A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods |
title_full | A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods |
title_fullStr | A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods |
title_full_unstemmed | A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods |
title_short | A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods |
title_sort | deep learning filter that blocks phishing campaigns using intelligent english text recognition methods |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9170453/ https://www.ncbi.nlm.nih.gov/pubmed/35677198 http://dx.doi.org/10.1155/2022/5036026 |
work_keys_str_mv | AT tangyonghui adeeplearningfilterthatblocksphishingcampaignsusingintelligentenglishtextrecognitionmethods AT wufei adeeplearningfilterthatblocksphishingcampaignsusingintelligentenglishtextrecognitionmethods AT tangyonghui deeplearningfilterthatblocksphishingcampaignsusingintelligentenglishtextrecognitionmethods AT wufei deeplearningfilterthatblocksphishingcampaignsusingintelligentenglishtextrecognitionmethods |