Cargando…

A Global-Relationship Dissimilarity Measure for the k-Modes Clustering Algorithm

The k-modes clustering algorithm has been widely used to cluster categorical data. In this paper, we firstly analyzed the k-modes algorithm and its dissimilarity measure. Based on this, we then proposed a novel dissimilarity measure, which is named as GRD. GRD considers not only the relationships be...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Hongfang, Zhang, Yihui, Liu, Yibin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5387825/
https://www.ncbi.nlm.nih.gov/pubmed/28458686
http://dx.doi.org/10.1155/2017/3691316
Descripción
Sumario:The k-modes clustering algorithm has been widely used to cluster categorical data. In this paper, we firstly analyzed the k-modes algorithm and its dissimilarity measure. Based on this, we then proposed a novel dissimilarity measure, which is named as GRD. GRD considers not only the relationships between the object and all cluster modes but also the differences of different attributes. Finally the experiments were made on four real data sets from UCI. And the corresponding results show that GRD achieves better performance than two existing dissimilarity measures used in k-modes and Cao's algorithms.