Cargando…
Multilabel user classification using the community structure of online networks
We study the problem of semi-supervised, multi-label user classification of networked data in the online social platform setting. We propose a framework that combines unsupervised community extraction and supervised, community-based feature weighting before training a classifier. We introduce Approx...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5344401/ https://www.ncbi.nlm.nih.gov/pubmed/28278242 http://dx.doi.org/10.1371/journal.pone.0173347 |
_version_ | 1782513534552244224 |
---|---|
author | Rizos, Georgios Papadopoulos, Symeon Kompatsiaris, Yiannis |
author_facet | Rizos, Georgios Papadopoulos, Symeon Kompatsiaris, Yiannis |
author_sort | Rizos, Georgios |
collection | PubMed |
description | We study the problem of semi-supervised, multi-label user classification of networked data in the online social platform setting. We propose a framework that combines unsupervised community extraction and supervised, community-based feature weighting before training a classifier. We introduce Approximate Regularized Commute-Time Embedding (ARCTE), an algorithm that projects the users of a social graph onto a latent space, but instead of packing the global structure into a matrix of predefined rank, as many spectral and neural representation learning methods do, it extracts local communities for all users in the graph in order to learn a sparse embedding. To this end, we employ an improvement of personalized PageRank algorithms for searching locally in each user’s graph structure. Then, we perform supervised community feature weighting in order to boost the importance of highly predictive communities. We assess our method performance on the problem of user classification by performing an extensive comparative study among various recent methods based on graph embeddings. The comparison shows that ARCTE significantly outperforms the competition in almost all cases, achieving up to 35% relative improvement compared to the second best competing method in terms of F1-score. |
format | Online Article Text |
id | pubmed-5344401 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-53444012017-03-29 Multilabel user classification using the community structure of online networks Rizos, Georgios Papadopoulos, Symeon Kompatsiaris, Yiannis PLoS One Research Article We study the problem of semi-supervised, multi-label user classification of networked data in the online social platform setting. We propose a framework that combines unsupervised community extraction and supervised, community-based feature weighting before training a classifier. We introduce Approximate Regularized Commute-Time Embedding (ARCTE), an algorithm that projects the users of a social graph onto a latent space, but instead of packing the global structure into a matrix of predefined rank, as many spectral and neural representation learning methods do, it extracts local communities for all users in the graph in order to learn a sparse embedding. To this end, we employ an improvement of personalized PageRank algorithms for searching locally in each user’s graph structure. Then, we perform supervised community feature weighting in order to boost the importance of highly predictive communities. We assess our method performance on the problem of user classification by performing an extensive comparative study among various recent methods based on graph embeddings. The comparison shows that ARCTE significantly outperforms the competition in almost all cases, achieving up to 35% relative improvement compared to the second best competing method in terms of F1-score. Public Library of Science 2017-03-09 /pmc/articles/PMC5344401/ /pubmed/28278242 http://dx.doi.org/10.1371/journal.pone.0173347 Text en © 2017 Rizos et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Rizos, Georgios Papadopoulos, Symeon Kompatsiaris, Yiannis Multilabel user classification using the community structure of online networks |
title | Multilabel user classification using the community structure of online networks |
title_full | Multilabel user classification using the community structure of online networks |
title_fullStr | Multilabel user classification using the community structure of online networks |
title_full_unstemmed | Multilabel user classification using the community structure of online networks |
title_short | Multilabel user classification using the community structure of online networks |
title_sort | multilabel user classification using the community structure of online networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5344401/ https://www.ncbi.nlm.nih.gov/pubmed/28278242 http://dx.doi.org/10.1371/journal.pone.0173347 |
work_keys_str_mv | AT rizosgeorgios multilabeluserclassificationusingthecommunitystructureofonlinenetworks AT papadopoulossymeon multilabeluserclassificationusingthecommunitystructureofonlinenetworks AT kompatsiarisyiannis multilabeluserclassificationusingthecommunitystructureofonlinenetworks |