Cargando…

Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering

BACKGROUND: How to extract useful information from complex biological networks is a major goal in many fields, especially in genomics and proteomics. We have shown in several works that iterative hierarchical clustering, as implemented in the UVCluster program, is a powerful tool to analyze many of...

Descripción completa

Detalles Bibliográficos
Autores principales: Aldecoa, Rodrigo, Marín, Ignacio
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2904377/
https://www.ncbi.nlm.nih.gov/pubmed/20644733
http://dx.doi.org/10.1371/journal.pone.0011585
_version_ 1782183873209171968
author Aldecoa, Rodrigo
Marín, Ignacio
author_facet Aldecoa, Rodrigo
Marín, Ignacio
author_sort Aldecoa, Rodrigo
collection PubMed
description BACKGROUND: How to extract useful information from complex biological networks is a major goal in many fields, especially in genomics and proteomics. We have shown in several works that iterative hierarchical clustering, as implemented in the UVCluster program, is a powerful tool to analyze many of those networks. However, the amount of computation time required to perform UVCluster analyses imposed significant limitations to its use. METHODOLOGY/PRINCIPAL FINDINGS: We describe the suite Jerarca, designed to efficiently convert networks of interacting units into dendrograms by means of iterative hierarchical clustering. Jerarca is divided into three main sections. First, weighted distances among units are computed using up to three different approaches: a more efficient version of UVCluster and two new, related algorithms called RCluster and SCluster. Second, Jerarca builds dendrograms based on those distances, using well-known phylogenetic algorithms, such as UPGMA or Neighbor-Joining. Finally, Jerarca provides optimal partitions of the trees using statistical criteria based on the distribution of intra- and intercluster connections. Outputs compatible with the phylogenetic software MEGA and the Cytoscape package are generated, allowing the results to be easily visualized. CONCLUSIONS/SIGNIFICANCE: The four main advantages of Jerarca in respect to UVCluster are: 1) Improved speed of a novel UVCluster algorithm; 2) Additional, alternative strategies to perform iterative hierarchical clustering; 3) Automatic evaluation of the hierarchical trees to obtain optimal partitions; and, 4) Outputs compatible with popular software such as MEGA and Cytoscape.
format Text
id pubmed-2904377
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-29043772010-07-19 Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering Aldecoa, Rodrigo Marín, Ignacio PLoS One Research Article BACKGROUND: How to extract useful information from complex biological networks is a major goal in many fields, especially in genomics and proteomics. We have shown in several works that iterative hierarchical clustering, as implemented in the UVCluster program, is a powerful tool to analyze many of those networks. However, the amount of computation time required to perform UVCluster analyses imposed significant limitations to its use. METHODOLOGY/PRINCIPAL FINDINGS: We describe the suite Jerarca, designed to efficiently convert networks of interacting units into dendrograms by means of iterative hierarchical clustering. Jerarca is divided into three main sections. First, weighted distances among units are computed using up to three different approaches: a more efficient version of UVCluster and two new, related algorithms called RCluster and SCluster. Second, Jerarca builds dendrograms based on those distances, using well-known phylogenetic algorithms, such as UPGMA or Neighbor-Joining. Finally, Jerarca provides optimal partitions of the trees using statistical criteria based on the distribution of intra- and intercluster connections. Outputs compatible with the phylogenetic software MEGA and the Cytoscape package are generated, allowing the results to be easily visualized. CONCLUSIONS/SIGNIFICANCE: The four main advantages of Jerarca in respect to UVCluster are: 1) Improved speed of a novel UVCluster algorithm; 2) Additional, alternative strategies to perform iterative hierarchical clustering; 3) Automatic evaluation of the hierarchical trees to obtain optimal partitions; and, 4) Outputs compatible with popular software such as MEGA and Cytoscape. Public Library of Science 2010-07-14 /pmc/articles/PMC2904377/ /pubmed/20644733 http://dx.doi.org/10.1371/journal.pone.0011585 Text en Aldecoa, Marín. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Aldecoa, Rodrigo
Marín, Ignacio
Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering
title Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering
title_full Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering
title_fullStr Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering
title_full_unstemmed Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering
title_short Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering
title_sort jerarca: efficient analysis of complex networks using hierarchical clustering
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2904377/
https://www.ncbi.nlm.nih.gov/pubmed/20644733
http://dx.doi.org/10.1371/journal.pone.0011585
work_keys_str_mv AT aldecoarodrigo jerarcaefficientanalysisofcomplexnetworksusinghierarchicalclustering
AT marinignacio jerarcaefficientanalysisofcomplexnetworksusinghierarchicalclustering