Cargando…

An Ensemble and Multi-View Clustering Method Based on Kolmogorov Complexity

The ability to build more robust clustering from many clustering models with different solutions is relevant in scenarios with privacy-preserving constraints, where data features have a different nature or where these features are not available in a single computation unit. Additionally, with the bo...

Descripción completa

Detalles Bibliográficos
Autores principales: Zamora, Juan, Sublime, Jérémie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955949/
https://www.ncbi.nlm.nih.gov/pubmed/36832736
http://dx.doi.org/10.3390/e25020371
_version_ 1784894472605138944
author Zamora, Juan
Sublime, Jérémie
author_facet Zamora, Juan
Sublime, Jérémie
author_sort Zamora, Juan
collection PubMed
description The ability to build more robust clustering from many clustering models with different solutions is relevant in scenarios with privacy-preserving constraints, where data features have a different nature or where these features are not available in a single computation unit. Additionally, with the booming number of multi-view data, but also of clustering algorithms capable of producing a wide variety of representations for the same objects, merging clustering partitions to achieve a single clustering result has become a complex problem with numerous applications. To tackle this problem, we propose a clustering fusion algorithm that takes existing clustering partitions acquired from multiple vector space models, sources, or views, and merges them into a single partition. Our merging method relies on an information theory model based on Kolmogorov complexity that was originally proposed for unsupervised multi-view learning. Our proposed algorithm features a stable merging process and shows competitive results over several real and artificial datasets in comparison with other state-of-the-art methods that have similar goals.
format Online
Article
Text
id pubmed-9955949
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99559492023-02-25 An Ensemble and Multi-View Clustering Method Based on Kolmogorov Complexity Zamora, Juan Sublime, Jérémie Entropy (Basel) Article The ability to build more robust clustering from many clustering models with different solutions is relevant in scenarios with privacy-preserving constraints, where data features have a different nature or where these features are not available in a single computation unit. Additionally, with the booming number of multi-view data, but also of clustering algorithms capable of producing a wide variety of representations for the same objects, merging clustering partitions to achieve a single clustering result has become a complex problem with numerous applications. To tackle this problem, we propose a clustering fusion algorithm that takes existing clustering partitions acquired from multiple vector space models, sources, or views, and merges them into a single partition. Our merging method relies on an information theory model based on Kolmogorov complexity that was originally proposed for unsupervised multi-view learning. Our proposed algorithm features a stable merging process and shows competitive results over several real and artificial datasets in comparison with other state-of-the-art methods that have similar goals. MDPI 2023-02-17 /pmc/articles/PMC9955949/ /pubmed/36832736 http://dx.doi.org/10.3390/e25020371 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zamora, Juan
Sublime, Jérémie
An Ensemble and Multi-View Clustering Method Based on Kolmogorov Complexity
title An Ensemble and Multi-View Clustering Method Based on Kolmogorov Complexity
title_full An Ensemble and Multi-View Clustering Method Based on Kolmogorov Complexity
title_fullStr An Ensemble and Multi-View Clustering Method Based on Kolmogorov Complexity
title_full_unstemmed An Ensemble and Multi-View Clustering Method Based on Kolmogorov Complexity
title_short An Ensemble and Multi-View Clustering Method Based on Kolmogorov Complexity
title_sort ensemble and multi-view clustering method based on kolmogorov complexity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955949/
https://www.ncbi.nlm.nih.gov/pubmed/36832736
http://dx.doi.org/10.3390/e25020371
work_keys_str_mv AT zamorajuan anensembleandmultiviewclusteringmethodbasedonkolmogorovcomplexity
AT sublimejeremie anensembleandmultiviewclusteringmethodbasedonkolmogorovcomplexity
AT zamorajuan ensembleandmultiviewclusteringmethodbasedonkolmogorovcomplexity
AT sublimejeremie ensembleandmultiviewclusteringmethodbasedonkolmogorovcomplexity