Cargando…

Phishing Email Detection Based on Binary Search Feature Selection

Phishing has appeared as a critical issue in the cybersecurity domain. Phishers adopt email as one of their major channels of communication to lure potential victims. This paper attempts to detect phishing emails by using binary search feature selection (BSFS) with a Pearson correlation coefficient...

Descripción completa

Detalles Bibliográficos
Autor principal: Sonowal, Gunikhan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Singapore 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7275664/
https://www.ncbi.nlm.nih.gov/pubmed/33063047
http://dx.doi.org/10.1007/s42979-020-00194-z
_version_ 1783542833792155648
author Sonowal, Gunikhan
author_facet Sonowal, Gunikhan
author_sort Sonowal, Gunikhan
collection PubMed
description Phishing has appeared as a critical issue in the cybersecurity domain. Phishers adopt email as one of their major channels of communication to lure potential victims. This paper attempts to detect phishing emails by using binary search feature selection (BSFS) with a Pearson correlation coefficient algorithm as a ranking method. The proposed method utilizes four sets of features from the email subject, the body of the email, hyperlinks, and readability of contents. Overall, 41 features were selected from the aforementioned four dimensions. The result shows that the BSFS method evaluated the accuracy of 97.41% in comparison with SFFS (95.63%) and WFS (95.56%). This exploration shows that the SFFS requires more time to ascertain the optimum features set and the WFS requires the least time; however, the accuracy of WFS is very low in comparison with other algorithms. The significant finding of the experiment is that the BFSF requires the least time to evaluate the best feature set with better accuracy even though few features are removed from the feature corpus.
format Online
Article
Text
id pubmed-7275664
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer Singapore
record_format MEDLINE/PubMed
spelling pubmed-72756642020-06-08 Phishing Email Detection Based on Binary Search Feature Selection Sonowal, Gunikhan SN COMPUT. SCI. Original Research Phishing has appeared as a critical issue in the cybersecurity domain. Phishers adopt email as one of their major channels of communication to lure potential victims. This paper attempts to detect phishing emails by using binary search feature selection (BSFS) with a Pearson correlation coefficient algorithm as a ranking method. The proposed method utilizes four sets of features from the email subject, the body of the email, hyperlinks, and readability of contents. Overall, 41 features were selected from the aforementioned four dimensions. The result shows that the BSFS method evaluated the accuracy of 97.41% in comparison with SFFS (95.63%) and WFS (95.56%). This exploration shows that the SFFS requires more time to ascertain the optimum features set and the WFS requires the least time; however, the accuracy of WFS is very low in comparison with other algorithms. The significant finding of the experiment is that the BFSF requires the least time to evaluate the best feature set with better accuracy even though few features are removed from the feature corpus. Springer Singapore 2020-06-06 2020 /pmc/articles/PMC7275664/ /pubmed/33063047 http://dx.doi.org/10.1007/s42979-020-00194-z Text en © Springer Nature Singapore Pte Ltd 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Research
Sonowal, Gunikhan
Phishing Email Detection Based on Binary Search Feature Selection
title Phishing Email Detection Based on Binary Search Feature Selection
title_full Phishing Email Detection Based on Binary Search Feature Selection
title_fullStr Phishing Email Detection Based on Binary Search Feature Selection
title_full_unstemmed Phishing Email Detection Based on Binary Search Feature Selection
title_short Phishing Email Detection Based on Binary Search Feature Selection
title_sort phishing email detection based on binary search feature selection
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7275664/
https://www.ncbi.nlm.nih.gov/pubmed/33063047
http://dx.doi.org/10.1007/s42979-020-00194-z
work_keys_str_mv AT sonowalgunikhan phishingemaildetectionbasedonbinarysearchfeatureselection