Cargando…

Active link selection for efficient semi-supervised community detection

Several semi-supervised community detection algorithms have been proposed recently to improve the performance of traditional topology-based methods. However, most of them focus on how to integrate supervised information with topology information; few of them pay attention to which information is cri...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Liang, Jin, Di, Wang, Xiao, Cao, Xiaochun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4649850/
https://www.ncbi.nlm.nih.gov/pubmed/25761385
http://dx.doi.org/10.1038/srep09039
_version_ 1782401430185836544
author Yang, Liang
Jin, Di
Wang, Xiao
Cao, Xiaochun
author_facet Yang, Liang
Jin, Di
Wang, Xiao
Cao, Xiaochun
author_sort Yang, Liang
collection PubMed
description Several semi-supervised community detection algorithms have been proposed recently to improve the performance of traditional topology-based methods. However, most of them focus on how to integrate supervised information with topology information; few of them pay attention to which information is critical for performance improvement. This leads to large amounts of demand for supervised information, which is expensive or difficult to obtain in most fields. For this problem we propose an active link selection framework, that is we actively select the most uncertain and informative links for human labeling for the efficient utilization of the supervised information. We also disconnect the most likely inter-community edges to further improve the efficiency. Our main idea is that, by connecting uncertain nodes to their community hubs and disconnecting the inter-community edges, one can sharpen the block structure of adjacency matrix more efficiently than randomly labeling links as the existing methods did. Experiments on both synthetic and real networks demonstrate that our new approach significantly outperforms the existing methods in terms of the efficiency of using supervised information. It needs ~13% of the supervised information to achieve a performance similar to that of the original semi-supervised approaches.
format Online
Article
Text
id pubmed-4649850
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-46498502015-11-23 Active link selection for efficient semi-supervised community detection Yang, Liang Jin, Di Wang, Xiao Cao, Xiaochun Sci Rep Article Several semi-supervised community detection algorithms have been proposed recently to improve the performance of traditional topology-based methods. However, most of them focus on how to integrate supervised information with topology information; few of them pay attention to which information is critical for performance improvement. This leads to large amounts of demand for supervised information, which is expensive or difficult to obtain in most fields. For this problem we propose an active link selection framework, that is we actively select the most uncertain and informative links for human labeling for the efficient utilization of the supervised information. We also disconnect the most likely inter-community edges to further improve the efficiency. Our main idea is that, by connecting uncertain nodes to their community hubs and disconnecting the inter-community edges, one can sharpen the block structure of adjacency matrix more efficiently than randomly labeling links as the existing methods did. Experiments on both synthetic and real networks demonstrate that our new approach significantly outperforms the existing methods in terms of the efficiency of using supervised information. It needs ~13% of the supervised information to achieve a performance similar to that of the original semi-supervised approaches. Nature Publishing Group 2015-03-12 /pmc/articles/PMC4649850/ /pubmed/25761385 http://dx.doi.org/10.1038/srep09039 Text en Copyright © 2015, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Yang, Liang
Jin, Di
Wang, Xiao
Cao, Xiaochun
Active link selection for efficient semi-supervised community detection
title Active link selection for efficient semi-supervised community detection
title_full Active link selection for efficient semi-supervised community detection
title_fullStr Active link selection for efficient semi-supervised community detection
title_full_unstemmed Active link selection for efficient semi-supervised community detection
title_short Active link selection for efficient semi-supervised community detection
title_sort active link selection for efficient semi-supervised community detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4649850/
https://www.ncbi.nlm.nih.gov/pubmed/25761385
http://dx.doi.org/10.1038/srep09039
work_keys_str_mv AT yangliang activelinkselectionforefficientsemisupervisedcommunitydetection
AT jindi activelinkselectionforefficientsemisupervisedcommunitydetection
AT wangxiao activelinkselectionforefficientsemisupervisedcommunitydetection
AT caoxiaochun activelinkselectionforefficientsemisupervisedcommunitydetection