Cargando…

An Improved K-Means Algorithm Based on Evidence Distance

The main influencing factors of the clustering effect of the k-means algorithm are the selection of the initial clustering center and the distance measurement between the sample points. The traditional k-mean algorithm uses Euclidean distance to measure the distance between sample points, thus it su...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Ailin, Hua, Zexi, Shi, Yu, Tang, Yongchuan, Miao, Lingwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8625371/
https://www.ncbi.nlm.nih.gov/pubmed/34828248
http://dx.doi.org/10.3390/e23111550
_version_ 1784606405087461376
author Zhu, Ailin
Hua, Zexi
Shi, Yu
Tang, Yongchuan
Miao, Lingwei
author_facet Zhu, Ailin
Hua, Zexi
Shi, Yu
Tang, Yongchuan
Miao, Lingwei
author_sort Zhu, Ailin
collection PubMed
description The main influencing factors of the clustering effect of the k-means algorithm are the selection of the initial clustering center and the distance measurement between the sample points. The traditional k-mean algorithm uses Euclidean distance to measure the distance between sample points, thus it suffers from low differentiation of attributes between sample points and is prone to local optimal solutions. For this feature, this paper proposes an improved k-means algorithm based on evidence distance. Firstly, the attribute values of sample points are modelled as the basic probability assignment (BPA) of sample points. Then, the traditional Euclidean distance is replaced by the evidence distance for measuring the distance between sample points, and finally k-means clustering is carried out using UCI data. Experimental comparisons are made with the traditional k-means algorithm, the k-means algorithm based on the aggregation distance parameter, and the Gaussian mixture model. The experimental results show that the improved k-means algorithm based on evidence distance proposed in this paper has a better clustering effect and the convergence of the algorithm is also better.
format Online
Article
Text
id pubmed-8625371
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86253712021-11-27 An Improved K-Means Algorithm Based on Evidence Distance Zhu, Ailin Hua, Zexi Shi, Yu Tang, Yongchuan Miao, Lingwei Entropy (Basel) Article The main influencing factors of the clustering effect of the k-means algorithm are the selection of the initial clustering center and the distance measurement between the sample points. The traditional k-mean algorithm uses Euclidean distance to measure the distance between sample points, thus it suffers from low differentiation of attributes between sample points and is prone to local optimal solutions. For this feature, this paper proposes an improved k-means algorithm based on evidence distance. Firstly, the attribute values of sample points are modelled as the basic probability assignment (BPA) of sample points. Then, the traditional Euclidean distance is replaced by the evidence distance for measuring the distance between sample points, and finally k-means clustering is carried out using UCI data. Experimental comparisons are made with the traditional k-means algorithm, the k-means algorithm based on the aggregation distance parameter, and the Gaussian mixture model. The experimental results show that the improved k-means algorithm based on evidence distance proposed in this paper has a better clustering effect and the convergence of the algorithm is also better. MDPI 2021-11-21 /pmc/articles/PMC8625371/ /pubmed/34828248 http://dx.doi.org/10.3390/e23111550 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhu, Ailin
Hua, Zexi
Shi, Yu
Tang, Yongchuan
Miao, Lingwei
An Improved K-Means Algorithm Based on Evidence Distance
title An Improved K-Means Algorithm Based on Evidence Distance
title_full An Improved K-Means Algorithm Based on Evidence Distance
title_fullStr An Improved K-Means Algorithm Based on Evidence Distance
title_full_unstemmed An Improved K-Means Algorithm Based on Evidence Distance
title_short An Improved K-Means Algorithm Based on Evidence Distance
title_sort improved k-means algorithm based on evidence distance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8625371/
https://www.ncbi.nlm.nih.gov/pubmed/34828248
http://dx.doi.org/10.3390/e23111550
work_keys_str_mv AT zhuailin animprovedkmeansalgorithmbasedonevidencedistance
AT huazexi animprovedkmeansalgorithmbasedonevidencedistance
AT shiyu animprovedkmeansalgorithmbasedonevidencedistance
AT tangyongchuan animprovedkmeansalgorithmbasedonevidencedistance
AT miaolingwei animprovedkmeansalgorithmbasedonevidencedistance
AT zhuailin improvedkmeansalgorithmbasedonevidencedistance
AT huazexi improvedkmeansalgorithmbasedonevidencedistance
AT shiyu improvedkmeansalgorithmbasedonevidencedistance
AT tangyongchuan improvedkmeansalgorithmbasedonevidencedistance
AT miaolingwei improvedkmeansalgorithmbasedonevidencedistance