Cargando…
Active link selection for efficient semi-supervised community detection
Several semi-supervised community detection algorithms have been proposed recently to improve the performance of traditional topology-based methods. However, most of them focus on how to integrate supervised information with topology information; few of them pay attention to which information is cri...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4649850/ https://www.ncbi.nlm.nih.gov/pubmed/25761385 http://dx.doi.org/10.1038/srep09039 |
_version_ | 1782401430185836544 |
---|---|
author | Yang, Liang Jin, Di Wang, Xiao Cao, Xiaochun |
author_facet | Yang, Liang Jin, Di Wang, Xiao Cao, Xiaochun |
author_sort | Yang, Liang |
collection | PubMed |
description | Several semi-supervised community detection algorithms have been proposed recently to improve the performance of traditional topology-based methods. However, most of them focus on how to integrate supervised information with topology information; few of them pay attention to which information is critical for performance improvement. This leads to large amounts of demand for supervised information, which is expensive or difficult to obtain in most fields. For this problem we propose an active link selection framework, that is we actively select the most uncertain and informative links for human labeling for the efficient utilization of the supervised information. We also disconnect the most likely inter-community edges to further improve the efficiency. Our main idea is that, by connecting uncertain nodes to their community hubs and disconnecting the inter-community edges, one can sharpen the block structure of adjacency matrix more efficiently than randomly labeling links as the existing methods did. Experiments on both synthetic and real networks demonstrate that our new approach significantly outperforms the existing methods in terms of the efficiency of using supervised information. It needs ~13% of the supervised information to achieve a performance similar to that of the original semi-supervised approaches. |
format | Online Article Text |
id | pubmed-4649850 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-46498502015-11-23 Active link selection for efficient semi-supervised community detection Yang, Liang Jin, Di Wang, Xiao Cao, Xiaochun Sci Rep Article Several semi-supervised community detection algorithms have been proposed recently to improve the performance of traditional topology-based methods. However, most of them focus on how to integrate supervised information with topology information; few of them pay attention to which information is critical for performance improvement. This leads to large amounts of demand for supervised information, which is expensive or difficult to obtain in most fields. For this problem we propose an active link selection framework, that is we actively select the most uncertain and informative links for human labeling for the efficient utilization of the supervised information. We also disconnect the most likely inter-community edges to further improve the efficiency. Our main idea is that, by connecting uncertain nodes to their community hubs and disconnecting the inter-community edges, one can sharpen the block structure of adjacency matrix more efficiently than randomly labeling links as the existing methods did. Experiments on both synthetic and real networks demonstrate that our new approach significantly outperforms the existing methods in terms of the efficiency of using supervised information. It needs ~13% of the supervised information to achieve a performance similar to that of the original semi-supervised approaches. Nature Publishing Group 2015-03-12 /pmc/articles/PMC4649850/ /pubmed/25761385 http://dx.doi.org/10.1038/srep09039 Text en Copyright © 2015, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Yang, Liang Jin, Di Wang, Xiao Cao, Xiaochun Active link selection for efficient semi-supervised community detection |
title | Active link selection for efficient semi-supervised community detection |
title_full | Active link selection for efficient semi-supervised community detection |
title_fullStr | Active link selection for efficient semi-supervised community detection |
title_full_unstemmed | Active link selection for efficient semi-supervised community detection |
title_short | Active link selection for efficient semi-supervised community detection |
title_sort | active link selection for efficient semi-supervised community detection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4649850/ https://www.ncbi.nlm.nih.gov/pubmed/25761385 http://dx.doi.org/10.1038/srep09039 |
work_keys_str_mv | AT yangliang activelinkselectionforefficientsemisupervisedcommunitydetection AT jindi activelinkselectionforefficientsemisupervisedcommunitydetection AT wangxiao activelinkselectionforefficientsemisupervisedcommunitydetection AT caoxiaochun activelinkselectionforefficientsemisupervisedcommunitydetection |