Cargando…

Projected Affinity Values for Nyström Spectral Clustering

In kernel methods, Nyström approximation is a popular way of calculating out-of-sample extensions and can be further applied to large-scale data clustering and classification tasks. Given a new data point, Nyström employs its empirical affinity vector, k, for calculation. This vector is assumed to b...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Li, Zhu, Haifei, Zhang, Tao, Yang, Honghong, Guan, Yisheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7513044/
https://www.ncbi.nlm.nih.gov/pubmed/33265608
http://dx.doi.org/10.3390/e20070519
_version_ 1783586297031426048
author He, Li
Zhu, Haifei
Zhang, Tao
Yang, Honghong
Guan, Yisheng
author_facet He, Li
Zhu, Haifei
Zhang, Tao
Yang, Honghong
Guan, Yisheng
author_sort He, Li
collection PubMed
description In kernel methods, Nyström approximation is a popular way of calculating out-of-sample extensions and can be further applied to large-scale data clustering and classification tasks. Given a new data point, Nyström employs its empirical affinity vector, k, for calculation. This vector is assumed to be a proper measurement of the similarity between the new point and the training set. In this paper, we suggest replacing the affinity vector by its projections on the leading eigenvectors learned from the training set, i.e., using [Formula: see text] instead, where [Formula: see text] is the i-th eigenvector of the training set and c is the number of eigenvectors used, which is typically equal to the number of classes designed by users. Our work is motivated by the constraints that in kernel space, the kernel-mapped new point should (a) also lie on the unit sphere defined by the Gaussian kernel and (b) generate training set affinity values close to k. These two constraints define a Quadratic Optimization Over a Sphere (QOOS) problem. In this paper, we prove that the projection on the leading eigenvectors, rather than the original affinity vector, is the solution to the QOOS problem. The experimental results show that the proposed replacement of k by [Formula: see text] slightly improves the performance of the Nyström approximation. Compared with other affinity matrix modification methods, our [Formula: see text] obtains comparable or higher clustering performance in terms of accuracy and Normalized Mutual Information (NMI).
format Online
Article
Text
id pubmed-7513044
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75130442020-11-09 Projected Affinity Values for Nyström Spectral Clustering He, Li Zhu, Haifei Zhang, Tao Yang, Honghong Guan, Yisheng Entropy (Basel) Article In kernel methods, Nyström approximation is a popular way of calculating out-of-sample extensions and can be further applied to large-scale data clustering and classification tasks. Given a new data point, Nyström employs its empirical affinity vector, k, for calculation. This vector is assumed to be a proper measurement of the similarity between the new point and the training set. In this paper, we suggest replacing the affinity vector by its projections on the leading eigenvectors learned from the training set, i.e., using [Formula: see text] instead, where [Formula: see text] is the i-th eigenvector of the training set and c is the number of eigenvectors used, which is typically equal to the number of classes designed by users. Our work is motivated by the constraints that in kernel space, the kernel-mapped new point should (a) also lie on the unit sphere defined by the Gaussian kernel and (b) generate training set affinity values close to k. These two constraints define a Quadratic Optimization Over a Sphere (QOOS) problem. In this paper, we prove that the projection on the leading eigenvectors, rather than the original affinity vector, is the solution to the QOOS problem. The experimental results show that the proposed replacement of k by [Formula: see text] slightly improves the performance of the Nyström approximation. Compared with other affinity matrix modification methods, our [Formula: see text] obtains comparable or higher clustering performance in terms of accuracy and Normalized Mutual Information (NMI). MDPI 2018-07-10 /pmc/articles/PMC7513044/ /pubmed/33265608 http://dx.doi.org/10.3390/e20070519 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
He, Li
Zhu, Haifei
Zhang, Tao
Yang, Honghong
Guan, Yisheng
Projected Affinity Values for Nyström Spectral Clustering
title Projected Affinity Values for Nyström Spectral Clustering
title_full Projected Affinity Values for Nyström Spectral Clustering
title_fullStr Projected Affinity Values for Nyström Spectral Clustering
title_full_unstemmed Projected Affinity Values for Nyström Spectral Clustering
title_short Projected Affinity Values for Nyström Spectral Clustering
title_sort projected affinity values for nyström spectral clustering
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7513044/
https://www.ncbi.nlm.nih.gov/pubmed/33265608
http://dx.doi.org/10.3390/e20070519
work_keys_str_mv AT heli projectedaffinityvaluesfornystromspectralclustering
AT zhuhaifei projectedaffinityvaluesfornystromspectralclustering
AT zhangtao projectedaffinityvaluesfornystromspectralclustering
AT yanghonghong projectedaffinityvaluesfornystromspectralclustering
AT guanyisheng projectedaffinityvaluesfornystromspectralclustering