Cargando…

Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data

Subspace clustering has been gaining increasing attention in recent years due to its promising ability in dealing with high-dimensional data. However, most of the existing subspace clustering methods tend to only exploit the subspace information to construct a single affinity graph (typically for sp...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Xiaosha, Huang, Dong, Wang, Chang-Dong, Kwoh, Chee-Keong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206270/
http://dx.doi.org/10.1007/978-3-030-47426-3_26
_version_ 1783530382413529088
author Cai, Xiaosha
Huang, Dong
Wang, Chang-Dong
Kwoh, Chee-Keong
author_facet Cai, Xiaosha
Huang, Dong
Wang, Chang-Dong
Kwoh, Chee-Keong
author_sort Cai, Xiaosha
collection PubMed
description Subspace clustering has been gaining increasing attention in recent years due to its promising ability in dealing with high-dimensional data. However, most of the existing subspace clustering methods tend to only exploit the subspace information to construct a single affinity graph (typically for spectral clustering), which often lack the ability to go beyond a single graph to explore multiple graphs built in various subspaces in high-dimensional space. To address this, this paper presents a new spectral clustering approach based on subspace randomization and graph fusion (SC-SRGF) for high-dimensional data. In particular, a set of random subspaces are first generated by performing random sampling on the original feature space. Then, multiple K-nearest neighbor (K-NN) affinity graphs are constructed to capture the local structures in the generated subspaces. To fuse the multiple affinity graphs from multiple subspaces, an iterative similarity network fusion scheme is utilized to achieve a unified graph for the final spectral clustering. Experiments on twelve real-world high-dimensional datasets demonstrate the superiority of the proposed approach. The MATLAB source code is available at https://www.researchgate.net/publication/338864134.
format Online
Article
Text
id pubmed-7206270
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72062702020-05-08 Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data Cai, Xiaosha Huang, Dong Wang, Chang-Dong Kwoh, Chee-Keong Advances in Knowledge Discovery and Data Mining Article Subspace clustering has been gaining increasing attention in recent years due to its promising ability in dealing with high-dimensional data. However, most of the existing subspace clustering methods tend to only exploit the subspace information to construct a single affinity graph (typically for spectral clustering), which often lack the ability to go beyond a single graph to explore multiple graphs built in various subspaces in high-dimensional space. To address this, this paper presents a new spectral clustering approach based on subspace randomization and graph fusion (SC-SRGF) for high-dimensional data. In particular, a set of random subspaces are first generated by performing random sampling on the original feature space. Then, multiple K-nearest neighbor (K-NN) affinity graphs are constructed to capture the local structures in the generated subspaces. To fuse the multiple affinity graphs from multiple subspaces, an iterative similarity network fusion scheme is utilized to achieve a unified graph for the final spectral clustering. Experiments on twelve real-world high-dimensional datasets demonstrate the superiority of the proposed approach. The MATLAB source code is available at https://www.researchgate.net/publication/338864134. 2020-04-17 /pmc/articles/PMC7206270/ http://dx.doi.org/10.1007/978-3-030-47426-3_26 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Cai, Xiaosha
Huang, Dong
Wang, Chang-Dong
Kwoh, Chee-Keong
Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data
title Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data
title_full Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data
title_fullStr Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data
title_full_unstemmed Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data
title_short Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data
title_sort spectral clustering by subspace randomization and graph fusion for high-dimensional data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206270/
http://dx.doi.org/10.1007/978-3-030-47426-3_26
work_keys_str_mv AT caixiaosha spectralclusteringbysubspacerandomizationandgraphfusionforhighdimensionaldata
AT huangdong spectralclusteringbysubspacerandomizationandgraphfusionforhighdimensionaldata
AT wangchangdong spectralclusteringbysubspacerandomizationandgraphfusionforhighdimensionaldata
AT kwohcheekeong spectralclusteringbysubspacerandomizationandgraphfusionforhighdimensionaldata