Cargando…

Detecting phishing webpages via homology analysis of webpage structure

Phishing webpages are often generated by phishing kits or evolved from existing kits. Therefore, the homology analysis of phishing webpages can help curb the proliferation of phishing webpages from the source. Based on the observation that phishing webpages belonging to the same family have similar...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Jian, Qiao, Yuqiang, Ye, Ou, Zhang, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044298/
https://www.ncbi.nlm.nih.gov/pubmed/35494880
http://dx.doi.org/10.7717/peerj-cs.868
_version_ 1784695075637297152
author Feng, Jian
Qiao, Yuqiang
Ye, Ou
Zhang, Ying
author_facet Feng, Jian
Qiao, Yuqiang
Ye, Ou
Zhang, Ying
author_sort Feng, Jian
collection PubMed
description Phishing webpages are often generated by phishing kits or evolved from existing kits. Therefore, the homology analysis of phishing webpages can help curb the proliferation of phishing webpages from the source. Based on the observation that phishing webpages belonging to the same family have similar page structures, a homology detection method based on webpage clustering according to structural similarity is proposed. The method consists of two stages. The first stage realizes model construction. Firstly, it extracts the structural features and style attributes of webpages through the document structure and vectorizes them, and then assigns different weights to different features, and measures the similarity of webpages and guides webpage clustering by webpage difference index. The second phase completes the detection of webpages to be tested. The fingerprint generation algorithm using double compressions generates fingerprints for the centres of the clusters and the webpages to be tested respectively and accelerates the detection process of the webpages to be tested through bitwise comparison. Experiments show that, compared with the existing methods, the proposed method can accurately locate the family of phishing webpages and can detect phishing webpages efficiently.
format Online
Article
Text
id pubmed-9044298
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-90442982022-04-28 Detecting phishing webpages via homology analysis of webpage structure Feng, Jian Qiao, Yuqiang Ye, Ou Zhang, Ying PeerJ Comput Sci Computer Networks and Communications Phishing webpages are often generated by phishing kits or evolved from existing kits. Therefore, the homology analysis of phishing webpages can help curb the proliferation of phishing webpages from the source. Based on the observation that phishing webpages belonging to the same family have similar page structures, a homology detection method based on webpage clustering according to structural similarity is proposed. The method consists of two stages. The first stage realizes model construction. Firstly, it extracts the structural features and style attributes of webpages through the document structure and vectorizes them, and then assigns different weights to different features, and measures the similarity of webpages and guides webpage clustering by webpage difference index. The second phase completes the detection of webpages to be tested. The fingerprint generation algorithm using double compressions generates fingerprints for the centres of the clusters and the webpages to be tested respectively and accelerates the detection process of the webpages to be tested through bitwise comparison. Experiments show that, compared with the existing methods, the proposed method can accurately locate the family of phishing webpages and can detect phishing webpages efficiently. PeerJ Inc. 2022-02-01 /pmc/articles/PMC9044298/ /pubmed/35494880 http://dx.doi.org/10.7717/peerj-cs.868 Text en © 2022 Feng et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Computer Networks and Communications
Feng, Jian
Qiao, Yuqiang
Ye, Ou
Zhang, Ying
Detecting phishing webpages via homology analysis of webpage structure
title Detecting phishing webpages via homology analysis of webpage structure
title_full Detecting phishing webpages via homology analysis of webpage structure
title_fullStr Detecting phishing webpages via homology analysis of webpage structure
title_full_unstemmed Detecting phishing webpages via homology analysis of webpage structure
title_short Detecting phishing webpages via homology analysis of webpage structure
title_sort detecting phishing webpages via homology analysis of webpage structure
topic Computer Networks and Communications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044298/
https://www.ncbi.nlm.nih.gov/pubmed/35494880
http://dx.doi.org/10.7717/peerj-cs.868
work_keys_str_mv AT fengjian detectingphishingwebpagesviahomologyanalysisofwebpagestructure
AT qiaoyuqiang detectingphishingwebpagesviahomologyanalysisofwebpagestructure
AT yeou detectingphishingwebpagesviahomologyanalysisofwebpagestructure
AT zhangying detectingphishingwebpagesviahomologyanalysisofwebpagestructure