Cargando…
Detecting phishing webpages via homology analysis of webpage structure
Phishing webpages are often generated by phishing kits or evolved from existing kits. Therefore, the homology analysis of phishing webpages can help curb the proliferation of phishing webpages from the source. Based on the observation that phishing webpages belonging to the same family have similar...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044298/ https://www.ncbi.nlm.nih.gov/pubmed/35494880 http://dx.doi.org/10.7717/peerj-cs.868 |
_version_ | 1784695075637297152 |
---|---|
author | Feng, Jian Qiao, Yuqiang Ye, Ou Zhang, Ying |
author_facet | Feng, Jian Qiao, Yuqiang Ye, Ou Zhang, Ying |
author_sort | Feng, Jian |
collection | PubMed |
description | Phishing webpages are often generated by phishing kits or evolved from existing kits. Therefore, the homology analysis of phishing webpages can help curb the proliferation of phishing webpages from the source. Based on the observation that phishing webpages belonging to the same family have similar page structures, a homology detection method based on webpage clustering according to structural similarity is proposed. The method consists of two stages. The first stage realizes model construction. Firstly, it extracts the structural features and style attributes of webpages through the document structure and vectorizes them, and then assigns different weights to different features, and measures the similarity of webpages and guides webpage clustering by webpage difference index. The second phase completes the detection of webpages to be tested. The fingerprint generation algorithm using double compressions generates fingerprints for the centres of the clusters and the webpages to be tested respectively and accelerates the detection process of the webpages to be tested through bitwise comparison. Experiments show that, compared with the existing methods, the proposed method can accurately locate the family of phishing webpages and can detect phishing webpages efficiently. |
format | Online Article Text |
id | pubmed-9044298 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90442982022-04-28 Detecting phishing webpages via homology analysis of webpage structure Feng, Jian Qiao, Yuqiang Ye, Ou Zhang, Ying PeerJ Comput Sci Computer Networks and Communications Phishing webpages are often generated by phishing kits or evolved from existing kits. Therefore, the homology analysis of phishing webpages can help curb the proliferation of phishing webpages from the source. Based on the observation that phishing webpages belonging to the same family have similar page structures, a homology detection method based on webpage clustering according to structural similarity is proposed. The method consists of two stages. The first stage realizes model construction. Firstly, it extracts the structural features and style attributes of webpages through the document structure and vectorizes them, and then assigns different weights to different features, and measures the similarity of webpages and guides webpage clustering by webpage difference index. The second phase completes the detection of webpages to be tested. The fingerprint generation algorithm using double compressions generates fingerprints for the centres of the clusters and the webpages to be tested respectively and accelerates the detection process of the webpages to be tested through bitwise comparison. Experiments show that, compared with the existing methods, the proposed method can accurately locate the family of phishing webpages and can detect phishing webpages efficiently. PeerJ Inc. 2022-02-01 /pmc/articles/PMC9044298/ /pubmed/35494880 http://dx.doi.org/10.7717/peerj-cs.868 Text en © 2022 Feng et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Computer Networks and Communications Feng, Jian Qiao, Yuqiang Ye, Ou Zhang, Ying Detecting phishing webpages via homology analysis of webpage structure |
title | Detecting phishing webpages via homology analysis of webpage structure |
title_full | Detecting phishing webpages via homology analysis of webpage structure |
title_fullStr | Detecting phishing webpages via homology analysis of webpage structure |
title_full_unstemmed | Detecting phishing webpages via homology analysis of webpage structure |
title_short | Detecting phishing webpages via homology analysis of webpage structure |
title_sort | detecting phishing webpages via homology analysis of webpage structure |
topic | Computer Networks and Communications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044298/ https://www.ncbi.nlm.nih.gov/pubmed/35494880 http://dx.doi.org/10.7717/peerj-cs.868 |
work_keys_str_mv | AT fengjian detectingphishingwebpagesviahomologyanalysisofwebpagestructure AT qiaoyuqiang detectingphishingwebpagesviahomologyanalysisofwebpagestructure AT yeou detectingphishingwebpagesviahomologyanalysisofwebpagestructure AT zhangying detectingphishingwebpagesviahomologyanalysisofwebpagestructure |