Cargando…

A hybrid DNN–LSTM model for detecting phishing URLs

Phishing is an attack targeting to imitate the official websites of corporations such as banks, e-commerce, financial institutions, and governmental institutions. Phishing websites aim to access and retrieve users’ important information such as personal identification, social security number, passwo...

Descripción completa

Detalles Bibliográficos
Autores principales: Ozcan, Alper, Catal, Cagatay, Donmez, Emrah, Senturk, Behcet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer London 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8349600/
https://www.ncbi.nlm.nih.gov/pubmed/34393380
http://dx.doi.org/10.1007/s00521-021-06401-z
_version_ 1783735596823347200
author Ozcan, Alper
Catal, Cagatay
Donmez, Emrah
Senturk, Behcet
author_facet Ozcan, Alper
Catal, Cagatay
Donmez, Emrah
Senturk, Behcet
author_sort Ozcan, Alper
collection PubMed
description Phishing is an attack targeting to imitate the official websites of corporations such as banks, e-commerce, financial institutions, and governmental institutions. Phishing websites aim to access and retrieve users’ important information such as personal identification, social security number, password, e-mail, credit card, and other account information. Several anti-phishing techniques have been developed to cope with the increasing number of phishing attacks so far. Machine learning and particularly, deep learning algorithms are nowadays the most crucial techniques used to detect and prevent phishing attacks because of their strong learning abilities on massive datasets and their state-of-the-art results in many classification problems. Previously, two types of feature extraction techniques [i.e., character embedding-based and manual natural language processing (NLP) feature extraction] were used in isolation. However, researchers did not consolidate these features and therefore, the performance was not remarkable. Unlike previous works, our study presented an approach that utilizes both feature extraction techniques. We discussed how to combine these feature extraction techniques to fully utilize from the available data. This paper proposes hybrid deep learning models based on long short-term memory and deep neural network algorithms for detecting phishing uniform resource locator and evaluates the performance of the models on phishing datasets. The proposed hybrid deep learning models utilize both character embedding and NLP features, thereby simultaneously exploiting deep connections between characters and revealing NLP-based high-level connections. Experimental results showed that the proposed models achieve superior performance than the other phishing detection models in terms of accuracy metric.
format Online
Article
Text
id pubmed-8349600
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer London
record_format MEDLINE/PubMed
spelling pubmed-83496002021-08-09 A hybrid DNN–LSTM model for detecting phishing URLs Ozcan, Alper Catal, Cagatay Donmez, Emrah Senturk, Behcet Neural Comput Appl S.I. : Machine Learning Applications for Security Phishing is an attack targeting to imitate the official websites of corporations such as banks, e-commerce, financial institutions, and governmental institutions. Phishing websites aim to access and retrieve users’ important information such as personal identification, social security number, password, e-mail, credit card, and other account information. Several anti-phishing techniques have been developed to cope with the increasing number of phishing attacks so far. Machine learning and particularly, deep learning algorithms are nowadays the most crucial techniques used to detect and prevent phishing attacks because of their strong learning abilities on massive datasets and their state-of-the-art results in many classification problems. Previously, two types of feature extraction techniques [i.e., character embedding-based and manual natural language processing (NLP) feature extraction] were used in isolation. However, researchers did not consolidate these features and therefore, the performance was not remarkable. Unlike previous works, our study presented an approach that utilizes both feature extraction techniques. We discussed how to combine these feature extraction techniques to fully utilize from the available data. This paper proposes hybrid deep learning models based on long short-term memory and deep neural network algorithms for detecting phishing uniform resource locator and evaluates the performance of the models on phishing datasets. The proposed hybrid deep learning models utilize both character embedding and NLP features, thereby simultaneously exploiting deep connections between characters and revealing NLP-based high-level connections. Experimental results showed that the proposed models achieve superior performance than the other phishing detection models in terms of accuracy metric. Springer London 2021-08-08 2023 /pmc/articles/PMC8349600/ /pubmed/34393380 http://dx.doi.org/10.1007/s00521-021-06401-z Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle S.I. : Machine Learning Applications for Security
Ozcan, Alper
Catal, Cagatay
Donmez, Emrah
Senturk, Behcet
A hybrid DNN–LSTM model for detecting phishing URLs
title A hybrid DNN–LSTM model for detecting phishing URLs
title_full A hybrid DNN–LSTM model for detecting phishing URLs
title_fullStr A hybrid DNN–LSTM model for detecting phishing URLs
title_full_unstemmed A hybrid DNN–LSTM model for detecting phishing URLs
title_short A hybrid DNN–LSTM model for detecting phishing URLs
title_sort hybrid dnn–lstm model for detecting phishing urls
topic S.I. : Machine Learning Applications for Security
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8349600/
https://www.ncbi.nlm.nih.gov/pubmed/34393380
http://dx.doi.org/10.1007/s00521-021-06401-z
work_keys_str_mv AT ozcanalper ahybriddnnlstmmodelfordetectingphishingurls
AT catalcagatay ahybriddnnlstmmodelfordetectingphishingurls
AT donmezemrah ahybriddnnlstmmodelfordetectingphishingurls
AT senturkbehcet ahybriddnnlstmmodelfordetectingphishingurls
AT ozcanalper hybriddnnlstmmodelfordetectingphishingurls
AT catalcagatay hybriddnnlstmmodelfordetectingphishingurls
AT donmezemrah hybriddnnlstmmodelfordetectingphishingurls
AT senturkbehcet hybriddnnlstmmodelfordetectingphishingurls