Cargando…

An Entropy Regularization k-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering

Although within-cluster information is commonly used in most clustering approaches, other important information such as between-cluster information is rarely considered in some cases. Hence, in this study, we propose a new novel measure of between-cluster distance in subspace, which is to maximize t...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Liyan, Wang, Cheng, Huang, Xiaohui, Zeng, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515186/
https://www.ncbi.nlm.nih.gov/pubmed/33267397
http://dx.doi.org/10.3390/e21070683
_version_ 1783586761389113344
author Xiong, Liyan
Wang, Cheng
Huang, Xiaohui
Zeng, Hui
author_facet Xiong, Liyan
Wang, Cheng
Huang, Xiaohui
Zeng, Hui
author_sort Xiong, Liyan
collection PubMed
description Although within-cluster information is commonly used in most clustering approaches, other important information such as between-cluster information is rarely considered in some cases. Hence, in this study, we propose a new novel measure of between-cluster distance in subspace, which is to maximize the distance between the center of a cluster and the points that do not belong to this cluster. Based on this idea, we firstly design an optimization objective function integrating the between-cluster distance and entropy regularization in this paper. Then, updating rules are given by theoretical analysis. In the following, the properties of our proposed algorithm are investigated, and the performance is evaluated experimentally using two synthetic and seven real-life datasets. Finally, the experimental studies demonstrate that the results of the proposed algorithm (ERKM) outperform most existing state-of-the-art k-means-type clustering algorithms in most cases.
format Online
Article
Text
id pubmed-7515186
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75151862020-11-09 An Entropy Regularization k-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering Xiong, Liyan Wang, Cheng Huang, Xiaohui Zeng, Hui Entropy (Basel) Article Although within-cluster information is commonly used in most clustering approaches, other important information such as between-cluster information is rarely considered in some cases. Hence, in this study, we propose a new novel measure of between-cluster distance in subspace, which is to maximize the distance between the center of a cluster and the points that do not belong to this cluster. Based on this idea, we firstly design an optimization objective function integrating the between-cluster distance and entropy regularization in this paper. Then, updating rules are given by theoretical analysis. In the following, the properties of our proposed algorithm are investigated, and the performance is evaluated experimentally using two synthetic and seven real-life datasets. Finally, the experimental studies demonstrate that the results of the proposed algorithm (ERKM) outperform most existing state-of-the-art k-means-type clustering algorithms in most cases. MDPI 2019-07-12 /pmc/articles/PMC7515186/ /pubmed/33267397 http://dx.doi.org/10.3390/e21070683 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xiong, Liyan
Wang, Cheng
Huang, Xiaohui
Zeng, Hui
An Entropy Regularization k-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title An Entropy Regularization k-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title_full An Entropy Regularization k-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title_fullStr An Entropy Regularization k-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title_full_unstemmed An Entropy Regularization k-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title_short An Entropy Regularization k-Means Algorithm with a New Measure of between-Cluster Distance in Subspace Clustering
title_sort entropy regularization k-means algorithm with a new measure of between-cluster distance in subspace clustering
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7515186/
https://www.ncbi.nlm.nih.gov/pubmed/33267397
http://dx.doi.org/10.3390/e21070683
work_keys_str_mv AT xiongliyan anentropyregularizationkmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT wangcheng anentropyregularizationkmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT huangxiaohui anentropyregularizationkmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT zenghui anentropyregularizationkmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT xiongliyan entropyregularizationkmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT wangcheng entropyregularizationkmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT huangxiaohui entropyregularizationkmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering
AT zenghui entropyregularizationkmeansalgorithmwithanewmeasureofbetweenclusterdistanceinsubspaceclustering