Cargando…

Link Clustering with Extended Link Similarity and EQ Evaluation Division

Link Clustering (LC) is a relatively new method for detecting overlapping communities in networks. The basic principle of LC is to derive a transform matrix whose elements are composed of the link similarity of neighbor links based on the Jaccard distance calculation; then it applies hierarchical cl...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Lan, Wang, Guishen, Wang, Yan, Blanzieri, Enrico, Su, Chao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3686866/
https://www.ncbi.nlm.nih.gov/pubmed/23840390
http://dx.doi.org/10.1371/journal.pone.0066005
_version_ 1782273849590546432
author Huang, Lan
Wang, Guishen
Wang, Yan
Blanzieri, Enrico
Su, Chao
author_facet Huang, Lan
Wang, Guishen
Wang, Yan
Blanzieri, Enrico
Su, Chao
author_sort Huang, Lan
collection PubMed
description Link Clustering (LC) is a relatively new method for detecting overlapping communities in networks. The basic principle of LC is to derive a transform matrix whose elements are composed of the link similarity of neighbor links based on the Jaccard distance calculation; then it applies hierarchical clustering to the transform matrix and uses a measure of partition density on the resulting dendrogram to determine the cut level for best community detection. However, the original link clustering method does not consider the link similarity of non-neighbor links, and the partition density tends to divide the communities into many small communities. In this paper, an Extended Link Clustering method (ELC) for overlapping community detection is proposed. The improved method employs a new link similarity, Extended Link Similarity (ELS), to produce a denser transform matrix, and uses the maximum value of EQ (an extended measure of quality of modularity) as a means to optimally cut the dendrogram for better partitioning of the original network space. Since ELS uses more link information, the resulting transform matrix provides a superior basis for clustering and analysis. Further, using the EQ value to find the best level for the hierarchical clustering dendrogram division, we obtain communities that are more sensible and reasonable than the ones obtained by the partition density evaluation. Experimentation on five real-world networks and artificially-generated networks shows that the ELC method achieves higher EQ and In-group Proportion (IGP) values. Additionally, communities are more realistic than those generated by either of the original LC method or the classical CPM method.
format Online
Article
Text
id pubmed-3686866
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36868662013-07-09 Link Clustering with Extended Link Similarity and EQ Evaluation Division Huang, Lan Wang, Guishen Wang, Yan Blanzieri, Enrico Su, Chao PLoS One Research Article Link Clustering (LC) is a relatively new method for detecting overlapping communities in networks. The basic principle of LC is to derive a transform matrix whose elements are composed of the link similarity of neighbor links based on the Jaccard distance calculation; then it applies hierarchical clustering to the transform matrix and uses a measure of partition density on the resulting dendrogram to determine the cut level for best community detection. However, the original link clustering method does not consider the link similarity of non-neighbor links, and the partition density tends to divide the communities into many small communities. In this paper, an Extended Link Clustering method (ELC) for overlapping community detection is proposed. The improved method employs a new link similarity, Extended Link Similarity (ELS), to produce a denser transform matrix, and uses the maximum value of EQ (an extended measure of quality of modularity) as a means to optimally cut the dendrogram for better partitioning of the original network space. Since ELS uses more link information, the resulting transform matrix provides a superior basis for clustering and analysis. Further, using the EQ value to find the best level for the hierarchical clustering dendrogram division, we obtain communities that are more sensible and reasonable than the ones obtained by the partition density evaluation. Experimentation on five real-world networks and artificially-generated networks shows that the ELC method achieves higher EQ and In-group Proportion (IGP) values. Additionally, communities are more realistic than those generated by either of the original LC method or the classical CPM method. Public Library of Science 2013-06-19 /pmc/articles/PMC3686866/ /pubmed/23840390 http://dx.doi.org/10.1371/journal.pone.0066005 Text en © 2013 Huang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Huang, Lan
Wang, Guishen
Wang, Yan
Blanzieri, Enrico
Su, Chao
Link Clustering with Extended Link Similarity and EQ Evaluation Division
title Link Clustering with Extended Link Similarity and EQ Evaluation Division
title_full Link Clustering with Extended Link Similarity and EQ Evaluation Division
title_fullStr Link Clustering with Extended Link Similarity and EQ Evaluation Division
title_full_unstemmed Link Clustering with Extended Link Similarity and EQ Evaluation Division
title_short Link Clustering with Extended Link Similarity and EQ Evaluation Division
title_sort link clustering with extended link similarity and eq evaluation division
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3686866/
https://www.ncbi.nlm.nih.gov/pubmed/23840390
http://dx.doi.org/10.1371/journal.pone.0066005
work_keys_str_mv AT huanglan linkclusteringwithextendedlinksimilarityandeqevaluationdivision
AT wangguishen linkclusteringwithextendedlinksimilarityandeqevaluationdivision
AT wangyan linkclusteringwithextendedlinksimilarityandeqevaluationdivision
AT blanzierienrico linkclusteringwithextendedlinksimilarityandeqevaluationdivision
AT suchao linkclusteringwithextendedlinksimilarityandeqevaluationdivision