Cargando…

Building an Effective Classifier for Phishing Web Pages Detection: A Quantum-Inspired Biomimetic Paradigm Suitable for Big Data Analytics of Cyber Attacks

To combat malicious domains, which serve as a key platform for a wide range of attacks, domain name service (DNS) data provide rich traces of Internet activities and are a powerful resource. This paper presents new research that proposes a model for finding malicious domains by passively analyzing D...

Descripción completa

Detalles Bibliográficos
Autores principales: Darwish, Saad M., Farhan, Dheyauldeen A., Elzoghabi, Adel A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10204568/
https://www.ncbi.nlm.nih.gov/pubmed/37218783
http://dx.doi.org/10.3390/biomimetics8020197
_version_ 1785045864795865088
author Darwish, Saad M.
Farhan, Dheyauldeen A.
Elzoghabi, Adel A.
author_facet Darwish, Saad M.
Farhan, Dheyauldeen A.
Elzoghabi, Adel A.
author_sort Darwish, Saad M.
collection PubMed
description To combat malicious domains, which serve as a key platform for a wide range of attacks, domain name service (DNS) data provide rich traces of Internet activities and are a powerful resource. This paper presents new research that proposes a model for finding malicious domains by passively analyzing DNS data. The proposed model builds a real-time, accurate, middleweight, and fast classifier by combining a genetic algorithm for selecting DNS data features with a two-step quantum ant colony optimization (QABC) algorithm for classification. The modified two-step QABC classifier uses K-means instead of random initialization to place food sources. In order to overcome ABCs poor exploitation abilities and its convergence speed, this paper utilizes the metaheuristic QABC algorithm for global optimization problems inspired by quantum physics concepts. The use of the Hadoop framework and a hybrid machine learning approach (K-mean and QABC) to deal with the large size of uniform resource locator (URL) data is one of the main contributions of this paper. The major point is that blacklists, heavyweight classifiers (those that use more features), and lightweight classifiers (those that use fewer features and consume the features from the browser) may all be improved with the use of the suggested machine learning method. The results showed that the suggested model could work with more than 96.6% accuracy for more than 10 million query–answer pairs.
format Online
Article
Text
id pubmed-10204568
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102045682023-05-24 Building an Effective Classifier for Phishing Web Pages Detection: A Quantum-Inspired Biomimetic Paradigm Suitable for Big Data Analytics of Cyber Attacks Darwish, Saad M. Farhan, Dheyauldeen A. Elzoghabi, Adel A. Biomimetics (Basel) Article To combat malicious domains, which serve as a key platform for a wide range of attacks, domain name service (DNS) data provide rich traces of Internet activities and are a powerful resource. This paper presents new research that proposes a model for finding malicious domains by passively analyzing DNS data. The proposed model builds a real-time, accurate, middleweight, and fast classifier by combining a genetic algorithm for selecting DNS data features with a two-step quantum ant colony optimization (QABC) algorithm for classification. The modified two-step QABC classifier uses K-means instead of random initialization to place food sources. In order to overcome ABCs poor exploitation abilities and its convergence speed, this paper utilizes the metaheuristic QABC algorithm for global optimization problems inspired by quantum physics concepts. The use of the Hadoop framework and a hybrid machine learning approach (K-mean and QABC) to deal with the large size of uniform resource locator (URL) data is one of the main contributions of this paper. The major point is that blacklists, heavyweight classifiers (those that use more features), and lightweight classifiers (those that use fewer features and consume the features from the browser) may all be improved with the use of the suggested machine learning method. The results showed that the suggested model could work with more than 96.6% accuracy for more than 10 million query–answer pairs. MDPI 2023-05-09 /pmc/articles/PMC10204568/ /pubmed/37218783 http://dx.doi.org/10.3390/biomimetics8020197 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Darwish, Saad M.
Farhan, Dheyauldeen A.
Elzoghabi, Adel A.
Building an Effective Classifier for Phishing Web Pages Detection: A Quantum-Inspired Biomimetic Paradigm Suitable for Big Data Analytics of Cyber Attacks
title Building an Effective Classifier for Phishing Web Pages Detection: A Quantum-Inspired Biomimetic Paradigm Suitable for Big Data Analytics of Cyber Attacks
title_full Building an Effective Classifier for Phishing Web Pages Detection: A Quantum-Inspired Biomimetic Paradigm Suitable for Big Data Analytics of Cyber Attacks
title_fullStr Building an Effective Classifier for Phishing Web Pages Detection: A Quantum-Inspired Biomimetic Paradigm Suitable for Big Data Analytics of Cyber Attacks
title_full_unstemmed Building an Effective Classifier for Phishing Web Pages Detection: A Quantum-Inspired Biomimetic Paradigm Suitable for Big Data Analytics of Cyber Attacks
title_short Building an Effective Classifier for Phishing Web Pages Detection: A Quantum-Inspired Biomimetic Paradigm Suitable for Big Data Analytics of Cyber Attacks
title_sort building an effective classifier for phishing web pages detection: a quantum-inspired biomimetic paradigm suitable for big data analytics of cyber attacks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10204568/
https://www.ncbi.nlm.nih.gov/pubmed/37218783
http://dx.doi.org/10.3390/biomimetics8020197
work_keys_str_mv AT darwishsaadm buildinganeffectiveclassifierforphishingwebpagesdetectionaquantuminspiredbiomimeticparadigmsuitableforbigdataanalyticsofcyberattacks
AT farhandheyauldeena buildinganeffectiveclassifierforphishingwebpagesdetectionaquantuminspiredbiomimeticparadigmsuitableforbigdataanalyticsofcyberattacks
AT elzoghabiadela buildinganeffectiveclassifierforphishingwebpagesdetectionaquantuminspiredbiomimeticparadigmsuitableforbigdataanalyticsofcyberattacks