Cargando…

Multilabel user classification using the community structure of online networks

We study the problem of semi-supervised, multi-label user classification of networked data in the online social platform setting. We propose a framework that combines unsupervised community extraction and supervised, community-based feature weighting before training a classifier. We introduce Approx...

Descripción completa

Detalles Bibliográficos
Autores principales: Rizos, Georgios, Papadopoulos, Symeon, Kompatsiaris, Yiannis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5344401/
https://www.ncbi.nlm.nih.gov/pubmed/28278242
http://dx.doi.org/10.1371/journal.pone.0173347
_version_ 1782513534552244224
author Rizos, Georgios
Papadopoulos, Symeon
Kompatsiaris, Yiannis
author_facet Rizos, Georgios
Papadopoulos, Symeon
Kompatsiaris, Yiannis
author_sort Rizos, Georgios
collection PubMed
description We study the problem of semi-supervised, multi-label user classification of networked data in the online social platform setting. We propose a framework that combines unsupervised community extraction and supervised, community-based feature weighting before training a classifier. We introduce Approximate Regularized Commute-Time Embedding (ARCTE), an algorithm that projects the users of a social graph onto a latent space, but instead of packing the global structure into a matrix of predefined rank, as many spectral and neural representation learning methods do, it extracts local communities for all users in the graph in order to learn a sparse embedding. To this end, we employ an improvement of personalized PageRank algorithms for searching locally in each user’s graph structure. Then, we perform supervised community feature weighting in order to boost the importance of highly predictive communities. We assess our method performance on the problem of user classification by performing an extensive comparative study among various recent methods based on graph embeddings. The comparison shows that ARCTE significantly outperforms the competition in almost all cases, achieving up to 35% relative improvement compared to the second best competing method in terms of F1-score.
format Online
Article
Text
id pubmed-5344401
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53444012017-03-29 Multilabel user classification using the community structure of online networks Rizos, Georgios Papadopoulos, Symeon Kompatsiaris, Yiannis PLoS One Research Article We study the problem of semi-supervised, multi-label user classification of networked data in the online social platform setting. We propose a framework that combines unsupervised community extraction and supervised, community-based feature weighting before training a classifier. We introduce Approximate Regularized Commute-Time Embedding (ARCTE), an algorithm that projects the users of a social graph onto a latent space, but instead of packing the global structure into a matrix of predefined rank, as many spectral and neural representation learning methods do, it extracts local communities for all users in the graph in order to learn a sparse embedding. To this end, we employ an improvement of personalized PageRank algorithms for searching locally in each user’s graph structure. Then, we perform supervised community feature weighting in order to boost the importance of highly predictive communities. We assess our method performance on the problem of user classification by performing an extensive comparative study among various recent methods based on graph embeddings. The comparison shows that ARCTE significantly outperforms the competition in almost all cases, achieving up to 35% relative improvement compared to the second best competing method in terms of F1-score. Public Library of Science 2017-03-09 /pmc/articles/PMC5344401/ /pubmed/28278242 http://dx.doi.org/10.1371/journal.pone.0173347 Text en © 2017 Rizos et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Rizos, Georgios
Papadopoulos, Symeon
Kompatsiaris, Yiannis
Multilabel user classification using the community structure of online networks
title Multilabel user classification using the community structure of online networks
title_full Multilabel user classification using the community structure of online networks
title_fullStr Multilabel user classification using the community structure of online networks
title_full_unstemmed Multilabel user classification using the community structure of online networks
title_short Multilabel user classification using the community structure of online networks
title_sort multilabel user classification using the community structure of online networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5344401/
https://www.ncbi.nlm.nih.gov/pubmed/28278242
http://dx.doi.org/10.1371/journal.pone.0173347
work_keys_str_mv AT rizosgeorgios multilabeluserclassificationusingthecommunitystructureofonlinenetworks
AT papadopoulossymeon multilabeluserclassificationusingthecommunitystructureofonlinenetworks
AT kompatsiarisyiannis multilabeluserclassificationusingthecommunitystructureofonlinenetworks