Cargando…

A New Validity Index Based on Fuzzy Energy and Fuzzy Entropy Measures in Fuzzy Clustering Problems

Two well-known drawbacks in fuzzy clustering are the requirement of assigning in advance the number of clusters and random initialization of cluster centers. The quality of the final fuzzy clusters depends heavily on the initial choice of the number of clusters and the initialization of the clusters...

Descripción completa

Detalles Bibliográficos
Autores principales: Martino, Ferdinando Di, Sessa, Salvatore
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7712182/
https://www.ncbi.nlm.nih.gov/pubmed/33286968
http://dx.doi.org/10.3390/e22111200
_version_ 1783618315259740160
author Martino, Ferdinando Di
Sessa, Salvatore
author_facet Martino, Ferdinando Di
Sessa, Salvatore
author_sort Martino, Ferdinando Di
collection PubMed
description Two well-known drawbacks in fuzzy clustering are the requirement of assigning in advance the number of clusters and random initialization of cluster centers. The quality of the final fuzzy clusters depends heavily on the initial choice of the number of clusters and the initialization of the clusters, then, it is necessary to apply a validity index to measure the compactness and the separability of the final clusters and run the clustering algorithm several times. We propose a new fuzzy C-means algorithm in which a validity index based on the concepts of maximum fuzzy energy and minimum fuzzy entropy is applied to initialize the cluster centers and to find the optimal number of clusters and initial cluster centers in order to obtain a good clustering quality, without increasing time consumption. We test our algorithm on UCI (University of California at Irvine) machine learning classification datasets comparing the results with the ones obtained by using well-known validity indices and variations of fuzzy C-means by using optimization algorithms in the initialization phase. The comparison results show that our algorithm represents an optimal trade-off between the quality of clustering and the time consumption.
format Online
Article
Text
id pubmed-7712182
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77121822021-02-24 A New Validity Index Based on Fuzzy Energy and Fuzzy Entropy Measures in Fuzzy Clustering Problems Martino, Ferdinando Di Sessa, Salvatore Entropy (Basel) Article Two well-known drawbacks in fuzzy clustering are the requirement of assigning in advance the number of clusters and random initialization of cluster centers. The quality of the final fuzzy clusters depends heavily on the initial choice of the number of clusters and the initialization of the clusters, then, it is necessary to apply a validity index to measure the compactness and the separability of the final clusters and run the clustering algorithm several times. We propose a new fuzzy C-means algorithm in which a validity index based on the concepts of maximum fuzzy energy and minimum fuzzy entropy is applied to initialize the cluster centers and to find the optimal number of clusters and initial cluster centers in order to obtain a good clustering quality, without increasing time consumption. We test our algorithm on UCI (University of California at Irvine) machine learning classification datasets comparing the results with the ones obtained by using well-known validity indices and variations of fuzzy C-means by using optimization algorithms in the initialization phase. The comparison results show that our algorithm represents an optimal trade-off between the quality of clustering and the time consumption. MDPI 2020-10-23 /pmc/articles/PMC7712182/ /pubmed/33286968 http://dx.doi.org/10.3390/e22111200 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Martino, Ferdinando Di
Sessa, Salvatore
A New Validity Index Based on Fuzzy Energy and Fuzzy Entropy Measures in Fuzzy Clustering Problems
title A New Validity Index Based on Fuzzy Energy and Fuzzy Entropy Measures in Fuzzy Clustering Problems
title_full A New Validity Index Based on Fuzzy Energy and Fuzzy Entropy Measures in Fuzzy Clustering Problems
title_fullStr A New Validity Index Based on Fuzzy Energy and Fuzzy Entropy Measures in Fuzzy Clustering Problems
title_full_unstemmed A New Validity Index Based on Fuzzy Energy and Fuzzy Entropy Measures in Fuzzy Clustering Problems
title_short A New Validity Index Based on Fuzzy Energy and Fuzzy Entropy Measures in Fuzzy Clustering Problems
title_sort new validity index based on fuzzy energy and fuzzy entropy measures in fuzzy clustering problems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7712182/
https://www.ncbi.nlm.nih.gov/pubmed/33286968
http://dx.doi.org/10.3390/e22111200
work_keys_str_mv AT martinoferdinandodi anewvalidityindexbasedonfuzzyenergyandfuzzyentropymeasuresinfuzzyclusteringproblems
AT sessasalvatore anewvalidityindexbasedonfuzzyenergyandfuzzyentropymeasuresinfuzzyclusteringproblems
AT martinoferdinandodi newvalidityindexbasedonfuzzyenergyandfuzzyentropymeasuresinfuzzyclusteringproblems
AT sessasalvatore newvalidityindexbasedonfuzzyenergyandfuzzyentropymeasuresinfuzzyclusteringproblems