Cargando…

A Novel Artificial Bee Colony Based Clustering Algorithm for Categorical Data

Data with categorical attributes are ubiquitous in the real world. However, existing partitional clustering algorithms for categorical data are prone to fall into local optima. To address this issue, in this paper we propose a novel clustering algorithm, ABC-K-Modes (Artificial Bee Colony clustering...

Descripción completa

Detalles Bibliográficos
Autores principales: Ji, Jinchao, Pang, Wei, Zheng, Yanlin, Wang, Zhe, Ma, Zhiqiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4439097/
https://www.ncbi.nlm.nih.gov/pubmed/25993469
http://dx.doi.org/10.1371/journal.pone.0127125
_version_ 1782372453824069632
author Ji, Jinchao
Pang, Wei
Zheng, Yanlin
Wang, Zhe
Ma, Zhiqiang
author_facet Ji, Jinchao
Pang, Wei
Zheng, Yanlin
Wang, Zhe
Ma, Zhiqiang
author_sort Ji, Jinchao
collection PubMed
description Data with categorical attributes are ubiquitous in the real world. However, existing partitional clustering algorithms for categorical data are prone to fall into local optima. To address this issue, in this paper we propose a novel clustering algorithm, ABC-K-Modes (Artificial Bee Colony clustering based on K-Modes), based on the traditional k-modes clustering algorithm and the artificial bee colony approach. In our approach, we first introduce a one-step k-modes procedure, and then integrate this procedure with the artificial bee colony approach to deal with categorical data. In the search process performed by scout bees, we adopt the multi-source search inspired by the idea of batch processing to accelerate the convergence of ABC-K-Modes. The performance of ABC-K-Modes is evaluated by a series of experiments in comparison with that of the other popular algorithms for categorical data.
format Online
Article
Text
id pubmed-4439097
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44390972015-05-29 A Novel Artificial Bee Colony Based Clustering Algorithm for Categorical Data Ji, Jinchao Pang, Wei Zheng, Yanlin Wang, Zhe Ma, Zhiqiang PLoS One Research Article Data with categorical attributes are ubiquitous in the real world. However, existing partitional clustering algorithms for categorical data are prone to fall into local optima. To address this issue, in this paper we propose a novel clustering algorithm, ABC-K-Modes (Artificial Bee Colony clustering based on K-Modes), based on the traditional k-modes clustering algorithm and the artificial bee colony approach. In our approach, we first introduce a one-step k-modes procedure, and then integrate this procedure with the artificial bee colony approach to deal with categorical data. In the search process performed by scout bees, we adopt the multi-source search inspired by the idea of batch processing to accelerate the convergence of ABC-K-Modes. The performance of ABC-K-Modes is evaluated by a series of experiments in comparison with that of the other popular algorithms for categorical data. Public Library of Science 2015-05-20 /pmc/articles/PMC4439097/ /pubmed/25993469 http://dx.doi.org/10.1371/journal.pone.0127125 Text en © 2015 Ji et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ji, Jinchao
Pang, Wei
Zheng, Yanlin
Wang, Zhe
Ma, Zhiqiang
A Novel Artificial Bee Colony Based Clustering Algorithm for Categorical Data
title A Novel Artificial Bee Colony Based Clustering Algorithm for Categorical Data
title_full A Novel Artificial Bee Colony Based Clustering Algorithm for Categorical Data
title_fullStr A Novel Artificial Bee Colony Based Clustering Algorithm for Categorical Data
title_full_unstemmed A Novel Artificial Bee Colony Based Clustering Algorithm for Categorical Data
title_short A Novel Artificial Bee Colony Based Clustering Algorithm for Categorical Data
title_sort novel artificial bee colony based clustering algorithm for categorical data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4439097/
https://www.ncbi.nlm.nih.gov/pubmed/25993469
http://dx.doi.org/10.1371/journal.pone.0127125
work_keys_str_mv AT jijinchao anovelartificialbeecolonybasedclusteringalgorithmforcategoricaldata
AT pangwei anovelartificialbeecolonybasedclusteringalgorithmforcategoricaldata
AT zhengyanlin anovelartificialbeecolonybasedclusteringalgorithmforcategoricaldata
AT wangzhe anovelartificialbeecolonybasedclusteringalgorithmforcategoricaldata
AT mazhiqiang anovelartificialbeecolonybasedclusteringalgorithmforcategoricaldata
AT jijinchao novelartificialbeecolonybasedclusteringalgorithmforcategoricaldata
AT pangwei novelartificialbeecolonybasedclusteringalgorithmforcategoricaldata
AT zhengyanlin novelartificialbeecolonybasedclusteringalgorithmforcategoricaldata
AT wangzhe novelartificialbeecolonybasedclusteringalgorithmforcategoricaldata
AT mazhiqiang novelartificialbeecolonybasedclusteringalgorithmforcategoricaldata