Cargando…

A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning

Phishing attacks are evolving with more sophisticated techniques, posing significant threats. Considering the potential of machine-learning-based approaches, our research presents a similar modern approach for web phishing detection by applying powerful machine learning algorithms. An efficient laye...

Descripción completa

Detalles Bibliográficos
Autores principales: Shaukat, Muhammad Waqas, Amin, Rashid, Muslam, Muhana Magboul Ali, Alshehri, Asma Hassan, Xie, Jiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10575062/
https://www.ncbi.nlm.nih.gov/pubmed/37836902
http://dx.doi.org/10.3390/s23198070
_version_ 1785120835864887296
author Shaukat, Muhammad Waqas
Amin, Rashid
Muslam, Muhana Magboul Ali
Alshehri, Asma Hassan
Xie, Jiang
author_facet Shaukat, Muhammad Waqas
Amin, Rashid
Muslam, Muhana Magboul Ali
Alshehri, Asma Hassan
Xie, Jiang
author_sort Shaukat, Muhammad Waqas
collection PubMed
description Phishing attacks are evolving with more sophisticated techniques, posing significant threats. Considering the potential of machine-learning-based approaches, our research presents a similar modern approach for web phishing detection by applying powerful machine learning algorithms. An efficient layered classification model is proposed to detect websites based on their URL structure, text, and image features. Previously, similar studies have used machine learning techniques for URL features with a limited dataset. In our research, we have used a large dataset of 20,000 website URLs, and 22 salient features from each URL are extracted to prepare a comprehensive dataset. Along with this, another dataset containing website text is also prepared for NLP-based text evaluation. It is seen that many phishing websites contain text as images, and to handle this, the text from images is extracted to classify it as spam or legitimate. The experimental evaluation demonstrated efficient and accurate phishing detection. Our layered classification model uses support vector machine (SVM), XGBoost, random forest, multilayer perceptron, linear regression, decision tree, naïve Bayes, and SVC algorithms. The performance evaluation revealed that the XGBoost algorithm outperformed other applied models with maximum accuracy and precision of 94% in the training phase and 91% in the testing phase. Multilayer perceptron also worked well with an accuracy of 91% in the testing phase. The accuracy results for random forest and decision tree were 91% and 90%, respectively. Logistic regression and SVM algorithms were used in the text-based classification, and the accuracy was found to be 87% and 88%, respectively. With these precision values, the models classified phishing and legitimate websites very well, based on URL, text, and image features. This research contributes to early detection of sophisticated phishing attacks, enhancing internet user security.
format Online
Article
Text
id pubmed-10575062
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-105750622023-10-14 A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning Shaukat, Muhammad Waqas Amin, Rashid Muslam, Muhana Magboul Ali Alshehri, Asma Hassan Xie, Jiang Sensors (Basel) Article Phishing attacks are evolving with more sophisticated techniques, posing significant threats. Considering the potential of machine-learning-based approaches, our research presents a similar modern approach for web phishing detection by applying powerful machine learning algorithms. An efficient layered classification model is proposed to detect websites based on their URL structure, text, and image features. Previously, similar studies have used machine learning techniques for URL features with a limited dataset. In our research, we have used a large dataset of 20,000 website URLs, and 22 salient features from each URL are extracted to prepare a comprehensive dataset. Along with this, another dataset containing website text is also prepared for NLP-based text evaluation. It is seen that many phishing websites contain text as images, and to handle this, the text from images is extracted to classify it as spam or legitimate. The experimental evaluation demonstrated efficient and accurate phishing detection. Our layered classification model uses support vector machine (SVM), XGBoost, random forest, multilayer perceptron, linear regression, decision tree, naïve Bayes, and SVC algorithms. The performance evaluation revealed that the XGBoost algorithm outperformed other applied models with maximum accuracy and precision of 94% in the training phase and 91% in the testing phase. Multilayer perceptron also worked well with an accuracy of 91% in the testing phase. The accuracy results for random forest and decision tree were 91% and 90%, respectively. Logistic regression and SVM algorithms were used in the text-based classification, and the accuracy was found to be 87% and 88%, respectively. With these precision values, the models classified phishing and legitimate websites very well, based on URL, text, and image features. This research contributes to early detection of sophisticated phishing attacks, enhancing internet user security. MDPI 2023-09-25 /pmc/articles/PMC10575062/ /pubmed/37836902 http://dx.doi.org/10.3390/s23198070 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Shaukat, Muhammad Waqas
Amin, Rashid
Muslam, Muhana Magboul Ali
Alshehri, Asma Hassan
Xie, Jiang
A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning
title A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning
title_full A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning
title_fullStr A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning
title_full_unstemmed A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning
title_short A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning
title_sort hybrid approach for alluring ads phishing attack detection using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10575062/
https://www.ncbi.nlm.nih.gov/pubmed/37836902
http://dx.doi.org/10.3390/s23198070
work_keys_str_mv AT shaukatmuhammadwaqas ahybridapproachforalluringadsphishingattackdetectionusingmachinelearning
AT aminrashid ahybridapproachforalluringadsphishingattackdetectionusingmachinelearning
AT muslammuhanamagboulali ahybridapproachforalluringadsphishingattackdetectionusingmachinelearning
AT alshehriasmahassan ahybridapproachforalluringadsphishingattackdetectionusingmachinelearning
AT xiejiang ahybridapproachforalluringadsphishingattackdetectionusingmachinelearning
AT shaukatmuhammadwaqas hybridapproachforalluringadsphishingattackdetectionusingmachinelearning
AT aminrashid hybridapproachforalluringadsphishingattackdetectionusingmachinelearning
AT muslammuhanamagboulali hybridapproachforalluringadsphishingattackdetectionusingmachinelearning
AT alshehriasmahassan hybridapproachforalluringadsphishingattackdetectionusingmachinelearning
AT xiejiang hybridapproachforalluringadsphishingattackdetectionusingmachinelearning