Cargando…

Adaptive Initialization Method for K-Means Algorithm

The K-means algorithm is a widely used clustering algorithm that offers simplicity and efficiency. However, the traditional K-means algorithm uses a random method to determine the initial cluster centers, which make clustering results prone to local optima and then result in worse clustering perform...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yang, Jie, Wang, Yu-Kai, Yao, Xin, Lin, Chin-Teng
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8656690/ https://www.ncbi.nlm.nih.gov/pubmed/34901837 http://dx.doi.org/10.3389/frai.2021.740817

_version_	1784612340446003200
author	Yang, Jie Wang, Yu-Kai Yao, Xin Lin, Chin-Teng
author_facet	Yang, Jie Wang, Yu-Kai Yao, Xin Lin, Chin-Teng
author_sort	Yang, Jie
collection	PubMed
description	The K-means algorithm is a widely used clustering algorithm that offers simplicity and efficiency. However, the traditional K-means algorithm uses a random method to determine the initial cluster centers, which make clustering results prone to local optima and then result in worse clustering performance. In this research, we propose an adaptive initialization method for the K-means algorithm (AIMK) which can adapt to the various characteristics in different datasets and obtain better clustering performance with stable results. For larger or higher-dimensional datasets, we even leverage random sampling in AIMK (name as AIMK-RS) to reduce the time complexity. 22 real-world datasets were applied for performance comparisons. The experimental results show AIMK and AIMK-RS outperform the current initialization methods and several well-known clustering algorithms. Specifically, AIMK-RS can significantly reduce the time complexity to O (n). Moreover, we exploit AIMK to initialize K-medoids and spectral clustering, and better performance is also explored. The above results demonstrate superior performance and good scalability by AIMK or AIMK-RS. In the future, we would like to apply AIMK to more partition-based clustering algorithms to solve real-life practical problems.
format	Online Article Text
id	pubmed-8656690
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-86566902021-12-10 Adaptive Initialization Method for K-Means Algorithm Yang, Jie Wang, Yu-Kai Yao, Xin Lin, Chin-Teng Front Artif Intell Artificial Intelligence The K-means algorithm is a widely used clustering algorithm that offers simplicity and efficiency. However, the traditional K-means algorithm uses a random method to determine the initial cluster centers, which make clustering results prone to local optima and then result in worse clustering performance. In this research, we propose an adaptive initialization method for the K-means algorithm (AIMK) which can adapt to the various characteristics in different datasets and obtain better clustering performance with stable results. For larger or higher-dimensional datasets, we even leverage random sampling in AIMK (name as AIMK-RS) to reduce the time complexity. 22 real-world datasets were applied for performance comparisons. The experimental results show AIMK and AIMK-RS outperform the current initialization methods and several well-known clustering algorithms. Specifically, AIMK-RS can significantly reduce the time complexity to O (n). Moreover, we exploit AIMK to initialize K-medoids and spectral clustering, and better performance is also explored. The above results demonstrate superior performance and good scalability by AIMK or AIMK-RS. In the future, we would like to apply AIMK to more partition-based clustering algorithms to solve real-life practical problems. Frontiers Media S.A. 2021-11-25 /pmc/articles/PMC8656690/ /pubmed/34901837 http://dx.doi.org/10.3389/frai.2021.740817 Text en Copyright © 2021 Yang, Wang, Yao and Lin. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Artificial Intelligence Yang, Jie Wang, Yu-Kai Yao, Xin Lin, Chin-Teng Adaptive Initialization Method for K-Means Algorithm
title	Adaptive Initialization Method for K-Means Algorithm
title_full	Adaptive Initialization Method for K-Means Algorithm
title_fullStr	Adaptive Initialization Method for K-Means Algorithm
title_full_unstemmed	Adaptive Initialization Method for K-Means Algorithm
title_short	Adaptive Initialization Method for K-Means Algorithm
title_sort	adaptive initialization method for k-means algorithm
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8656690/ https://www.ncbi.nlm.nih.gov/pubmed/34901837 http://dx.doi.org/10.3389/frai.2021.740817
work_keys_str_mv	AT yangjie adaptiveinitializationmethodforkmeansalgorithm AT wangyukai adaptiveinitializationmethodforkmeansalgorithm AT yaoxin adaptiveinitializationmethodforkmeansalgorithm AT linchinteng adaptiveinitializationmethodforkmeansalgorithm

Adaptive Initialization Method for K-Means Algorithm

Ejemplares similares