Cargando…

Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and L(2,1)-Norm

Accurate clustering is a challenging task with unlabeled data. Ensemble clustering aims to combine sets of base clusterings to obtain a better and more stable clustering and has shown its ability to improve clustering accuracy. Dense representation ensemble clustering (DREC) and entropy-based locall...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Jiaxuan, Wu, Jiang, Li, Taiyong, Nan, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9601663/
https://www.ncbi.nlm.nih.gov/pubmed/37420344
http://dx.doi.org/10.3390/e24101324
_version_ 1784817121179467776
author Xu, Jiaxuan
Wu, Jiang
Li, Taiyong
Nan, Yang
author_facet Xu, Jiaxuan
Wu, Jiang
Li, Taiyong
Nan, Yang
author_sort Xu, Jiaxuan
collection PubMed
description Accurate clustering is a challenging task with unlabeled data. Ensemble clustering aims to combine sets of base clusterings to obtain a better and more stable clustering and has shown its ability to improve clustering accuracy. Dense representation ensemble clustering (DREC) and entropy-based locally weighted ensemble clustering (ELWEC) are two typical methods for ensemble clustering. However, DREC treats each microcluster equally and hence, ignores the differences between each microcluster, while ELWEC conducts clustering on clusters rather than microclusters and ignores the sample–cluster relationship. To address these issues, a divergence-based locally weighted ensemble clustering with dictionary learning (DLWECDL) is proposed in this paper. Specifically, the DLWECDL consists of four phases. First, the clusters from the base clustering are used to generate microclusters. Second, a Kullback–Leibler divergence-based ensemble-driven cluster index is used to measure the weight of each microcluster. With these weights, an ensemble clustering algorithm with dictionary learning and the [Formula: see text]-norm is employed in the third phase. Meanwhile, the objective function is resolved by optimizing four subproblems and a similarity matrix is learned. Finally, a normalized cut (Ncut) is used to partition the similarity matrix and the ensemble clustering results are obtained. In this study, the proposed DLWECDL was validated on 20 widely used datasets and compared to some other state-of-the-art ensemble clustering methods. The experimental results demonstrated that the proposed DLWECDL is a very promising method for ensemble clustering.
format Online
Article
Text
id pubmed-9601663
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96016632022-10-27 Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and L(2,1)-Norm Xu, Jiaxuan Wu, Jiang Li, Taiyong Nan, Yang Entropy (Basel) Article Accurate clustering is a challenging task with unlabeled data. Ensemble clustering aims to combine sets of base clusterings to obtain a better and more stable clustering and has shown its ability to improve clustering accuracy. Dense representation ensemble clustering (DREC) and entropy-based locally weighted ensemble clustering (ELWEC) are two typical methods for ensemble clustering. However, DREC treats each microcluster equally and hence, ignores the differences between each microcluster, while ELWEC conducts clustering on clusters rather than microclusters and ignores the sample–cluster relationship. To address these issues, a divergence-based locally weighted ensemble clustering with dictionary learning (DLWECDL) is proposed in this paper. Specifically, the DLWECDL consists of four phases. First, the clusters from the base clustering are used to generate microclusters. Second, a Kullback–Leibler divergence-based ensemble-driven cluster index is used to measure the weight of each microcluster. With these weights, an ensemble clustering algorithm with dictionary learning and the [Formula: see text]-norm is employed in the third phase. Meanwhile, the objective function is resolved by optimizing four subproblems and a similarity matrix is learned. Finally, a normalized cut (Ncut) is used to partition the similarity matrix and the ensemble clustering results are obtained. In this study, the proposed DLWECDL was validated on 20 widely used datasets and compared to some other state-of-the-art ensemble clustering methods. The experimental results demonstrated that the proposed DLWECDL is a very promising method for ensemble clustering. MDPI 2022-09-21 /pmc/articles/PMC9601663/ /pubmed/37420344 http://dx.doi.org/10.3390/e24101324 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xu, Jiaxuan
Wu, Jiang
Li, Taiyong
Nan, Yang
Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and L(2,1)-Norm
title Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and L(2,1)-Norm
title_full Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and L(2,1)-Norm
title_fullStr Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and L(2,1)-Norm
title_full_unstemmed Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and L(2,1)-Norm
title_short Divergence-Based Locally Weighted Ensemble Clustering with Dictionary Learning and L(2,1)-Norm
title_sort divergence-based locally weighted ensemble clustering with dictionary learning and l(2,1)-norm
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9601663/
https://www.ncbi.nlm.nih.gov/pubmed/37420344
http://dx.doi.org/10.3390/e24101324
work_keys_str_mv AT xujiaxuan divergencebasedlocallyweightedensembleclusteringwithdictionarylearningandl21norm
AT wujiang divergencebasedlocallyweightedensembleclusteringwithdictionarylearningandl21norm
AT litaiyong divergencebasedlocallyweightedensembleclusteringwithdictionarylearningandl21norm
AT nanyang divergencebasedlocallyweightedensembleclusteringwithdictionarylearningandl21norm