Cargando…

Multilevel Hierarchical Kernel Spectral Clustering for Real-Life Large Scale Complex Networks

Kernel spectral clustering corresponds to a weighted kernel principal component analysis problem in a constrained optimization framework. The primal formulation leads to an eigen-decomposition of a centered Laplacian matrix at the dual level. The dual formulation allows to build a model on a represe...

Descripción completa

Detalles Bibliográficos
Autores principales: Mall, Raghvendra, Langone, Rocco, Suykens, Johan A. K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4065034/
https://www.ncbi.nlm.nih.gov/pubmed/24949877
http://dx.doi.org/10.1371/journal.pone.0099966
_version_ 1782322016298205184
author Mall, Raghvendra
Langone, Rocco
Suykens, Johan A. K.
author_facet Mall, Raghvendra
Langone, Rocco
Suykens, Johan A. K.
author_sort Mall, Raghvendra
collection PubMed
description Kernel spectral clustering corresponds to a weighted kernel principal component analysis problem in a constrained optimization framework. The primal formulation leads to an eigen-decomposition of a centered Laplacian matrix at the dual level. The dual formulation allows to build a model on a representative subgraph of the large scale network in the training phase and the model parameters are estimated in the validation stage. The KSC model has a powerful out-of-sample extension property which allows cluster affiliation for the unseen nodes of the big data network. In this paper we exploit the structure of the projections in the eigenspace during the validation stage to automatically determine a set of increasing distance thresholds. We use these distance thresholds in the test phase to obtain multiple levels of hierarchy for the large scale network. The hierarchical structure in the network is determined in a bottom-up fashion. We empirically showcase that real-world networks have multilevel hierarchical organization which cannot be detected efficiently by several state-of-the-art large scale hierarchical community detection techniques like the Louvain, OSLOM and Infomap methods. We show that a major advantage of our proposed approach is the ability to locate good quality clusters at both the finer and coarser levels of hierarchy using internal cluster quality metrics on 7 real-life networks.
format Online
Article
Text
id pubmed-4065034
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40650342014-06-25 Multilevel Hierarchical Kernel Spectral Clustering for Real-Life Large Scale Complex Networks Mall, Raghvendra Langone, Rocco Suykens, Johan A. K. PLoS One Research Article Kernel spectral clustering corresponds to a weighted kernel principal component analysis problem in a constrained optimization framework. The primal formulation leads to an eigen-decomposition of a centered Laplacian matrix at the dual level. The dual formulation allows to build a model on a representative subgraph of the large scale network in the training phase and the model parameters are estimated in the validation stage. The KSC model has a powerful out-of-sample extension property which allows cluster affiliation for the unseen nodes of the big data network. In this paper we exploit the structure of the projections in the eigenspace during the validation stage to automatically determine a set of increasing distance thresholds. We use these distance thresholds in the test phase to obtain multiple levels of hierarchy for the large scale network. The hierarchical structure in the network is determined in a bottom-up fashion. We empirically showcase that real-world networks have multilevel hierarchical organization which cannot be detected efficiently by several state-of-the-art large scale hierarchical community detection techniques like the Louvain, OSLOM and Infomap methods. We show that a major advantage of our proposed approach is the ability to locate good quality clusters at both the finer and coarser levels of hierarchy using internal cluster quality metrics on 7 real-life networks. Public Library of Science 2014-06-20 /pmc/articles/PMC4065034/ /pubmed/24949877 http://dx.doi.org/10.1371/journal.pone.0099966 Text en © 2014 Mall et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mall, Raghvendra
Langone, Rocco
Suykens, Johan A. K.
Multilevel Hierarchical Kernel Spectral Clustering for Real-Life Large Scale Complex Networks
title Multilevel Hierarchical Kernel Spectral Clustering for Real-Life Large Scale Complex Networks
title_full Multilevel Hierarchical Kernel Spectral Clustering for Real-Life Large Scale Complex Networks
title_fullStr Multilevel Hierarchical Kernel Spectral Clustering for Real-Life Large Scale Complex Networks
title_full_unstemmed Multilevel Hierarchical Kernel Spectral Clustering for Real-Life Large Scale Complex Networks
title_short Multilevel Hierarchical Kernel Spectral Clustering for Real-Life Large Scale Complex Networks
title_sort multilevel hierarchical kernel spectral clustering for real-life large scale complex networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4065034/
https://www.ncbi.nlm.nih.gov/pubmed/24949877
http://dx.doi.org/10.1371/journal.pone.0099966
work_keys_str_mv AT mallraghvendra multilevelhierarchicalkernelspectralclusteringforreallifelargescalecomplexnetworks
AT langonerocco multilevelhierarchicalkernelspectralclusteringforreallifelargescalecomplexnetworks
AT suykensjohanak multilevelhierarchicalkernelspectralclusteringforreallifelargescalecomplexnetworks