Cargando…

Hierarchical Information Clustering by Means of Topologically Embedded Graphs

We introduce a graph-theoretic approach to extract clusters and hierarchies in complex data-sets in an unsupervised and deterministic manner, without the use of any prior information. This is achieved by building topologically embedded networks containing the subset of most significant links and ana...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Won-Min, Di Matteo, T., Aste, Tomaso
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3302882/
https://www.ncbi.nlm.nih.gov/pubmed/22427814
http://dx.doi.org/10.1371/journal.pone.0031929
_version_ 1782226698166599680
author Song, Won-Min
Di Matteo, T.
Aste, Tomaso
author_facet Song, Won-Min
Di Matteo, T.
Aste, Tomaso
author_sort Song, Won-Min
collection PubMed
description We introduce a graph-theoretic approach to extract clusters and hierarchies in complex data-sets in an unsupervised and deterministic manner, without the use of any prior information. This is achieved by building topologically embedded networks containing the subset of most significant links and analyzing the network structure. For a planar embedding, this method provides both the intra-cluster hierarchy, which describes the way clusters are composed, and the inter-cluster hierarchy which describes how clusters gather together. We discuss performance, robustness and reliability of this method by first investigating several artificial data-sets, finding that it can outperform significantly other established approaches. Then we show that our method can successfully differentiate meaningful clusters and hierarchies in a variety of real data-sets. In particular, we find that the application to gene expression patterns of lymphoma samples uncovers biologically significant groups of genes which play key-roles in diagnosis, prognosis and treatment of some of the most relevant human lymphoid malignancies.
format Online
Article
Text
id pubmed-3302882
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33028822012-03-16 Hierarchical Information Clustering by Means of Topologically Embedded Graphs Song, Won-Min Di Matteo, T. Aste, Tomaso PLoS One Research Article We introduce a graph-theoretic approach to extract clusters and hierarchies in complex data-sets in an unsupervised and deterministic manner, without the use of any prior information. This is achieved by building topologically embedded networks containing the subset of most significant links and analyzing the network structure. For a planar embedding, this method provides both the intra-cluster hierarchy, which describes the way clusters are composed, and the inter-cluster hierarchy which describes how clusters gather together. We discuss performance, robustness and reliability of this method by first investigating several artificial data-sets, finding that it can outperform significantly other established approaches. Then we show that our method can successfully differentiate meaningful clusters and hierarchies in a variety of real data-sets. In particular, we find that the application to gene expression patterns of lymphoma samples uncovers biologically significant groups of genes which play key-roles in diagnosis, prognosis and treatment of some of the most relevant human lymphoid malignancies. Public Library of Science 2012-03-09 /pmc/articles/PMC3302882/ /pubmed/22427814 http://dx.doi.org/10.1371/journal.pone.0031929 Text en Song et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Song, Won-Min
Di Matteo, T.
Aste, Tomaso
Hierarchical Information Clustering by Means of Topologically Embedded Graphs
title Hierarchical Information Clustering by Means of Topologically Embedded Graphs
title_full Hierarchical Information Clustering by Means of Topologically Embedded Graphs
title_fullStr Hierarchical Information Clustering by Means of Topologically Embedded Graphs
title_full_unstemmed Hierarchical Information Clustering by Means of Topologically Embedded Graphs
title_short Hierarchical Information Clustering by Means of Topologically Embedded Graphs
title_sort hierarchical information clustering by means of topologically embedded graphs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3302882/
https://www.ncbi.nlm.nih.gov/pubmed/22427814
http://dx.doi.org/10.1371/journal.pone.0031929
work_keys_str_mv AT songwonmin hierarchicalinformationclusteringbymeansoftopologicallyembeddedgraphs
AT dimatteot hierarchicalinformationclusteringbymeansoftopologicallyembeddedgraphs
AT astetomaso hierarchicalinformationclusteringbymeansoftopologicallyembeddedgraphs