Cargando…

Convalescing Cluster Configuration Using a Superlative Framework

Competent data mining methods are vital to discover knowledge from databases which are built as a result of enormous growth of data. Various techniques of data mining are applied to obtain knowledge from these databases. Data clustering is one such descriptive data mining technique which guides in p...

Descripción completa

Detalles Bibliográficos
Autores principales: Sabitha, R., Karthik, S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4620246/
https://www.ncbi.nlm.nih.gov/pubmed/26543895
http://dx.doi.org/10.1155/2015/180749
_version_ 1782397254918733824
author Sabitha, R.
Karthik, S.
author_facet Sabitha, R.
Karthik, S.
author_sort Sabitha, R.
collection PubMed
description Competent data mining methods are vital to discover knowledge from databases which are built as a result of enormous growth of data. Various techniques of data mining are applied to obtain knowledge from these databases. Data clustering is one such descriptive data mining technique which guides in partitioning data objects into disjoint segments. K-means algorithm is a versatile algorithm among the various approaches used in data clustering. The algorithm and its diverse adaptation methods suffer certain problems in their performance. To overcome these issues a superlative algorithm has been proposed in this paper to perform data clustering. The specific feature of the proposed algorithm is discretizing the dataset, thereby improving the accuracy of clustering, and also adopting the binary search initialization method to generate cluster centroids. The generated centroids are fed as input to K-means approach which iteratively segments the data objects into respective clusters. The clustered results are measured for accuracy and validity. Experiments conducted by testing the approach on datasets from the UC Irvine Machine Learning Repository evidently show that the accuracy and validity measure is higher than the other two approaches, namely, simple K-means and Binary Search method. Thus, the proposed approach proves that discretization process will improve the efficacy of descriptive data mining tasks.
format Online
Article
Text
id pubmed-4620246
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-46202462015-11-05 Convalescing Cluster Configuration Using a Superlative Framework Sabitha, R. Karthik, S. ScientificWorldJournal Research Article Competent data mining methods are vital to discover knowledge from databases which are built as a result of enormous growth of data. Various techniques of data mining are applied to obtain knowledge from these databases. Data clustering is one such descriptive data mining technique which guides in partitioning data objects into disjoint segments. K-means algorithm is a versatile algorithm among the various approaches used in data clustering. The algorithm and its diverse adaptation methods suffer certain problems in their performance. To overcome these issues a superlative algorithm has been proposed in this paper to perform data clustering. The specific feature of the proposed algorithm is discretizing the dataset, thereby improving the accuracy of clustering, and also adopting the binary search initialization method to generate cluster centroids. The generated centroids are fed as input to K-means approach which iteratively segments the data objects into respective clusters. The clustered results are measured for accuracy and validity. Experiments conducted by testing the approach on datasets from the UC Irvine Machine Learning Repository evidently show that the accuracy and validity measure is higher than the other two approaches, namely, simple K-means and Binary Search method. Thus, the proposed approach proves that discretization process will improve the efficacy of descriptive data mining tasks. Hindawi Publishing Corporation 2015 2015-10-12 /pmc/articles/PMC4620246/ /pubmed/26543895 http://dx.doi.org/10.1155/2015/180749 Text en Copyright © 2015 R. Sabitha and S. Karthik. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sabitha, R.
Karthik, S.
Convalescing Cluster Configuration Using a Superlative Framework
title Convalescing Cluster Configuration Using a Superlative Framework
title_full Convalescing Cluster Configuration Using a Superlative Framework
title_fullStr Convalescing Cluster Configuration Using a Superlative Framework
title_full_unstemmed Convalescing Cluster Configuration Using a Superlative Framework
title_short Convalescing Cluster Configuration Using a Superlative Framework
title_sort convalescing cluster configuration using a superlative framework
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4620246/
https://www.ncbi.nlm.nih.gov/pubmed/26543895
http://dx.doi.org/10.1155/2015/180749
work_keys_str_mv AT sabithar convalescingclusterconfigurationusingasuperlativeframework
AT karthiks convalescingclusterconfigurationusingasuperlativeframework