Cargando…

A new clustering method based on multipartite networks

The clustering problem is one of the most studied and challenging in machine learning, as it attempts to identify similarities within data without any prior knowledge. Among modern clustering algorithms, the network-based ones are some of the most popular. Most of them convert the data into a graph...

Descripción completa

Detalles Bibliográficos
Autor principal: Lung, Rodica-Ioana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588715/
https://www.ncbi.nlm.nih.gov/pubmed/37869453
http://dx.doi.org/10.7717/peerj-cs.1621
_version_ 1785123638711681024
author Lung, Rodica-Ioana
author_facet Lung, Rodica-Ioana
author_sort Lung, Rodica-Ioana
collection PubMed
description The clustering problem is one of the most studied and challenging in machine learning, as it attempts to identify similarities within data without any prior knowledge. Among modern clustering algorithms, the network-based ones are some of the most popular. Most of them convert the data into a graph in which instances of the data represent the nodes and a similarity measure is used to add edges. This article proposes a novel approach that uses a multipartite network in which layers correspond to attributes of the data and nodes represent intervals for the data. Clusters are intuitively constructed based on the information provided by the paths in the network. Numerical experiments performed on synthetic and real-world benchmarks are used to illustrate the performance of the approach. As a real application, the method is used to group countries based on health, nutrition, and population information from the World Bank database. The results indicate that the proposed method is comparable in performance with some of the state-of-the-art clustering methods, outperforming them for some data sets.
format Online
Article
Text
id pubmed-10588715
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-105887152023-10-21 A new clustering method based on multipartite networks Lung, Rodica-Ioana PeerJ Comput Sci Artificial Intelligence The clustering problem is one of the most studied and challenging in machine learning, as it attempts to identify similarities within data without any prior knowledge. Among modern clustering algorithms, the network-based ones are some of the most popular. Most of them convert the data into a graph in which instances of the data represent the nodes and a similarity measure is used to add edges. This article proposes a novel approach that uses a multipartite network in which layers correspond to attributes of the data and nodes represent intervals for the data. Clusters are intuitively constructed based on the information provided by the paths in the network. Numerical experiments performed on synthetic and real-world benchmarks are used to illustrate the performance of the approach. As a real application, the method is used to group countries based on health, nutrition, and population information from the World Bank database. The results indicate that the proposed method is comparable in performance with some of the state-of-the-art clustering methods, outperforming them for some data sets. PeerJ Inc. 2023-10-13 /pmc/articles/PMC10588715/ /pubmed/37869453 http://dx.doi.org/10.7717/peerj-cs.1621 Text en ©2023 Lung https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Lung, Rodica-Ioana
A new clustering method based on multipartite networks
title A new clustering method based on multipartite networks
title_full A new clustering method based on multipartite networks
title_fullStr A new clustering method based on multipartite networks
title_full_unstemmed A new clustering method based on multipartite networks
title_short A new clustering method based on multipartite networks
title_sort new clustering method based on multipartite networks
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588715/
https://www.ncbi.nlm.nih.gov/pubmed/37869453
http://dx.doi.org/10.7717/peerj-cs.1621
work_keys_str_mv AT lungrodicaioana anewclusteringmethodbasedonmultipartitenetworks
AT lungrodicaioana newclusteringmethodbasedonmultipartitenetworks