Cargando…
Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data
Subspace clustering has been gaining increasing attention in recent years due to its promising ability in dealing with high-dimensional data. However, most of the existing subspace clustering methods tend to only exploit the subspace information to construct a single affinity graph (typically for sp...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206270/ http://dx.doi.org/10.1007/978-3-030-47426-3_26 |
_version_ | 1783530382413529088 |
---|---|
author | Cai, Xiaosha Huang, Dong Wang, Chang-Dong Kwoh, Chee-Keong |
author_facet | Cai, Xiaosha Huang, Dong Wang, Chang-Dong Kwoh, Chee-Keong |
author_sort | Cai, Xiaosha |
collection | PubMed |
description | Subspace clustering has been gaining increasing attention in recent years due to its promising ability in dealing with high-dimensional data. However, most of the existing subspace clustering methods tend to only exploit the subspace information to construct a single affinity graph (typically for spectral clustering), which often lack the ability to go beyond a single graph to explore multiple graphs built in various subspaces in high-dimensional space. To address this, this paper presents a new spectral clustering approach based on subspace randomization and graph fusion (SC-SRGF) for high-dimensional data. In particular, a set of random subspaces are first generated by performing random sampling on the original feature space. Then, multiple K-nearest neighbor (K-NN) affinity graphs are constructed to capture the local structures in the generated subspaces. To fuse the multiple affinity graphs from multiple subspaces, an iterative similarity network fusion scheme is utilized to achieve a unified graph for the final spectral clustering. Experiments on twelve real-world high-dimensional datasets demonstrate the superiority of the proposed approach. The MATLAB source code is available at https://www.researchgate.net/publication/338864134. |
format | Online Article Text |
id | pubmed-7206270 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72062702020-05-08 Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data Cai, Xiaosha Huang, Dong Wang, Chang-Dong Kwoh, Chee-Keong Advances in Knowledge Discovery and Data Mining Article Subspace clustering has been gaining increasing attention in recent years due to its promising ability in dealing with high-dimensional data. However, most of the existing subspace clustering methods tend to only exploit the subspace information to construct a single affinity graph (typically for spectral clustering), which often lack the ability to go beyond a single graph to explore multiple graphs built in various subspaces in high-dimensional space. To address this, this paper presents a new spectral clustering approach based on subspace randomization and graph fusion (SC-SRGF) for high-dimensional data. In particular, a set of random subspaces are first generated by performing random sampling on the original feature space. Then, multiple K-nearest neighbor (K-NN) affinity graphs are constructed to capture the local structures in the generated subspaces. To fuse the multiple affinity graphs from multiple subspaces, an iterative similarity network fusion scheme is utilized to achieve a unified graph for the final spectral clustering. Experiments on twelve real-world high-dimensional datasets demonstrate the superiority of the proposed approach. The MATLAB source code is available at https://www.researchgate.net/publication/338864134. 2020-04-17 /pmc/articles/PMC7206270/ http://dx.doi.org/10.1007/978-3-030-47426-3_26 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Cai, Xiaosha Huang, Dong Wang, Chang-Dong Kwoh, Chee-Keong Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data |
title | Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data |
title_full | Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data |
title_fullStr | Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data |
title_full_unstemmed | Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data |
title_short | Spectral Clustering by Subspace Randomization and Graph Fusion for High-Dimensional Data |
title_sort | spectral clustering by subspace randomization and graph fusion for high-dimensional data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206270/ http://dx.doi.org/10.1007/978-3-030-47426-3_26 |
work_keys_str_mv | AT caixiaosha spectralclusteringbysubspacerandomizationandgraphfusionforhighdimensionaldata AT huangdong spectralclusteringbysubspacerandomizationandgraphfusionforhighdimensionaldata AT wangchangdong spectralclusteringbysubspacerandomizationandgraphfusionforhighdimensionaldata AT kwohcheekeong spectralclusteringbysubspacerandomizationandgraphfusionforhighdimensionaldata |