Cargando…

Kernel Probabilistic K-Means Clustering

Kernel fuzzy c-means (KFCM) is a significantly improved version of fuzzy c-means (FCM) for processing linearly inseparable datasets. However, for fuzzification parameter [Formula: see text] , the problem of KFCM (kernel fuzzy c-means) cannot be solved by Lagrangian optimization. To solve this proble...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Bowen, Zhang, Ting, Li, Yujian, Liu, Zhaoying, Zhang, Zhilin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7962817/
https://www.ncbi.nlm.nih.gov/pubmed/33800353
http://dx.doi.org/10.3390/s21051892
_version_ 1783665526642311168
author Liu, Bowen
Zhang, Ting
Li, Yujian
Liu, Zhaoying
Zhang, Zhilin
author_facet Liu, Bowen
Zhang, Ting
Li, Yujian
Liu, Zhaoying
Zhang, Zhilin
author_sort Liu, Bowen
collection PubMed
description Kernel fuzzy c-means (KFCM) is a significantly improved version of fuzzy c-means (FCM) for processing linearly inseparable datasets. However, for fuzzification parameter [Formula: see text] , the problem of KFCM (kernel fuzzy c-means) cannot be solved by Lagrangian optimization. To solve this problem, an equivalent model, called kernel probabilistic k-means (KPKM), is proposed here. The novel model relates KFCM to kernel k-means (KKM) in a unified mathematic framework. Moreover, the proposed KPKM can be addressed by the active gradient projection (AGP) method, which is a nonlinear programming technique with constraints of linear equalities and linear inequalities. To accelerate the AGP method, a fast AGP (FAGP) algorithm was designed. The proposed FAGP uses a maximum-step strategy to estimate the step length, and uses an iterative method to update the projection matrix. Experiments demonstrated the effectiveness of the proposed method through a performance comparison of KPKM with KFCM, KKM, FCM and k-means. Experiments showed that the proposed KPKM is able to find nonlinearly separable structures in synthetic datasets. Ten real UCI datasets were used in this study, and KPKM had better clustering performance on at least six datsets. The proposed fast AGP requires less running time than the original AGP, and it reduced running time by 76–95% on real datasets.
format Online
Article
Text
id pubmed-7962817
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79628172021-03-17 Kernel Probabilistic K-Means Clustering Liu, Bowen Zhang, Ting Li, Yujian Liu, Zhaoying Zhang, Zhilin Sensors (Basel) Article Kernel fuzzy c-means (KFCM) is a significantly improved version of fuzzy c-means (FCM) for processing linearly inseparable datasets. However, for fuzzification parameter [Formula: see text] , the problem of KFCM (kernel fuzzy c-means) cannot be solved by Lagrangian optimization. To solve this problem, an equivalent model, called kernel probabilistic k-means (KPKM), is proposed here. The novel model relates KFCM to kernel k-means (KKM) in a unified mathematic framework. Moreover, the proposed KPKM can be addressed by the active gradient projection (AGP) method, which is a nonlinear programming technique with constraints of linear equalities and linear inequalities. To accelerate the AGP method, a fast AGP (FAGP) algorithm was designed. The proposed FAGP uses a maximum-step strategy to estimate the step length, and uses an iterative method to update the projection matrix. Experiments demonstrated the effectiveness of the proposed method through a performance comparison of KPKM with KFCM, KKM, FCM and k-means. Experiments showed that the proposed KPKM is able to find nonlinearly separable structures in synthetic datasets. Ten real UCI datasets were used in this study, and KPKM had better clustering performance on at least six datsets. The proposed fast AGP requires less running time than the original AGP, and it reduced running time by 76–95% on real datasets. MDPI 2021-03-08 /pmc/articles/PMC7962817/ /pubmed/33800353 http://dx.doi.org/10.3390/s21051892 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liu, Bowen
Zhang, Ting
Li, Yujian
Liu, Zhaoying
Zhang, Zhilin
Kernel Probabilistic K-Means Clustering
title Kernel Probabilistic K-Means Clustering
title_full Kernel Probabilistic K-Means Clustering
title_fullStr Kernel Probabilistic K-Means Clustering
title_full_unstemmed Kernel Probabilistic K-Means Clustering
title_short Kernel Probabilistic K-Means Clustering
title_sort kernel probabilistic k-means clustering
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7962817/
https://www.ncbi.nlm.nih.gov/pubmed/33800353
http://dx.doi.org/10.3390/s21051892
work_keys_str_mv AT liubowen kernelprobabilistickmeansclustering
AT zhangting kernelprobabilistickmeansclustering
AT liyujian kernelprobabilistickmeansclustering
AT liuzhaoying kernelprobabilistickmeansclustering
AT zhangzhilin kernelprobabilistickmeansclustering