Cargando…

An improved two-stage label propagation algorithm based on LeaderRank

To solve the problems of poor stability and low modularity (Q) of community division results caused by the randomness of node selection and label update in the traditional label propagation algorithm, an improved two-stage label propagation algorithm based on LeaderRank was proposed in this study. I...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Miaomiao, Yang, Jinyun, Guo, Jingfeng, Chen, Jing, Zhang, Yongsheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9454888/
https://www.ncbi.nlm.nih.gov/pubmed/36091993
http://dx.doi.org/10.7717/peerj-cs.981
_version_ 1784785458752913408
author Liu, Miaomiao
Yang, Jinyun
Guo, Jingfeng
Chen, Jing
Zhang, Yongsheng
author_facet Liu, Miaomiao
Yang, Jinyun
Guo, Jingfeng
Chen, Jing
Zhang, Yongsheng
author_sort Liu, Miaomiao
collection PubMed
description To solve the problems of poor stability and low modularity (Q) of community division results caused by the randomness of node selection and label update in the traditional label propagation algorithm, an improved two-stage label propagation algorithm based on LeaderRank was proposed in this study. In the first stage, the order of node updating was determined by the participation coefficient (PC). Then, a new similarity measure was defined to improve the label selection mechanism so as to solve the problem of label oscillation caused by multiple labels of the node with the most similarity to the node. Moreover, the influence of the nodes was comprehensively used to find the initial community structure. In the second stage, the rough communities obtained in the first stage were regarded as nodes, and their merging sequence was determined by the PC. Next, the non-weak community and the community with the largest number of connected edges were combined. Finally, the community structure was further optimized to improve the modularity so as to obtain the final partition result. Experiments were performed on nine classic realistic networks and 19 artificial datasets with different scales, complexities, and densities. The modularity and normalized mutual information (NMI) were used as evaluation indexes for comparing the improved algorithm with dozens of relevant classic algorithms. The results showed that the proposed algorithm yields superior performance, and the results of community partitioning obtained using the improved algorithm were stable and more accurate than those obtained using other algorithms. In addition, the proposed algorithm always performs well in nine large-scale artificial data sets with 6,000 to 50,000 nodes and three large realistic network datasets, which verifies its computational performance and utility in community detection for large-scale networks.
format Online
Article
Text
id pubmed-9454888
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-94548882022-09-09 An improved two-stage label propagation algorithm based on LeaderRank Liu, Miaomiao Yang, Jinyun Guo, Jingfeng Chen, Jing Zhang, Yongsheng PeerJ Comput Sci Artificial Intelligence To solve the problems of poor stability and low modularity (Q) of community division results caused by the randomness of node selection and label update in the traditional label propagation algorithm, an improved two-stage label propagation algorithm based on LeaderRank was proposed in this study. In the first stage, the order of node updating was determined by the participation coefficient (PC). Then, a new similarity measure was defined to improve the label selection mechanism so as to solve the problem of label oscillation caused by multiple labels of the node with the most similarity to the node. Moreover, the influence of the nodes was comprehensively used to find the initial community structure. In the second stage, the rough communities obtained in the first stage were regarded as nodes, and their merging sequence was determined by the PC. Next, the non-weak community and the community with the largest number of connected edges were combined. Finally, the community structure was further optimized to improve the modularity so as to obtain the final partition result. Experiments were performed on nine classic realistic networks and 19 artificial datasets with different scales, complexities, and densities. The modularity and normalized mutual information (NMI) were used as evaluation indexes for comparing the improved algorithm with dozens of relevant classic algorithms. The results showed that the proposed algorithm yields superior performance, and the results of community partitioning obtained using the improved algorithm were stable and more accurate than those obtained using other algorithms. In addition, the proposed algorithm always performs well in nine large-scale artificial data sets with 6,000 to 50,000 nodes and three large realistic network datasets, which verifies its computational performance and utility in community detection for large-scale networks. PeerJ Inc. 2022-05-18 /pmc/articles/PMC9454888/ /pubmed/36091993 http://dx.doi.org/10.7717/peerj-cs.981 Text en © 2022 Liu et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Liu, Miaomiao
Yang, Jinyun
Guo, Jingfeng
Chen, Jing
Zhang, Yongsheng
An improved two-stage label propagation algorithm based on LeaderRank
title An improved two-stage label propagation algorithm based on LeaderRank
title_full An improved two-stage label propagation algorithm based on LeaderRank
title_fullStr An improved two-stage label propagation algorithm based on LeaderRank
title_full_unstemmed An improved two-stage label propagation algorithm based on LeaderRank
title_short An improved two-stage label propagation algorithm based on LeaderRank
title_sort improved two-stage label propagation algorithm based on leaderrank
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9454888/
https://www.ncbi.nlm.nih.gov/pubmed/36091993
http://dx.doi.org/10.7717/peerj-cs.981
work_keys_str_mv AT liumiaomiao animprovedtwostagelabelpropagationalgorithmbasedonleaderrank
AT yangjinyun animprovedtwostagelabelpropagationalgorithmbasedonleaderrank
AT guojingfeng animprovedtwostagelabelpropagationalgorithmbasedonleaderrank
AT chenjing animprovedtwostagelabelpropagationalgorithmbasedonleaderrank
AT zhangyongsheng animprovedtwostagelabelpropagationalgorithmbasedonleaderrank
AT liumiaomiao improvedtwostagelabelpropagationalgorithmbasedonleaderrank
AT yangjinyun improvedtwostagelabelpropagationalgorithmbasedonleaderrank
AT guojingfeng improvedtwostagelabelpropagationalgorithmbasedonleaderrank
AT chenjing improvedtwostagelabelpropagationalgorithmbasedonleaderrank
AT zhangyongsheng improvedtwostagelabelpropagationalgorithmbasedonleaderrank