Cargando…
Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering
BACKGROUND: How to extract useful information from complex biological networks is a major goal in many fields, especially in genomics and proteomics. We have shown in several works that iterative hierarchical clustering, as implemented in the UVCluster program, is a powerful tool to analyze many of...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2904377/ https://www.ncbi.nlm.nih.gov/pubmed/20644733 http://dx.doi.org/10.1371/journal.pone.0011585 |
_version_ | 1782183873209171968 |
---|---|
author | Aldecoa, Rodrigo Marín, Ignacio |
author_facet | Aldecoa, Rodrigo Marín, Ignacio |
author_sort | Aldecoa, Rodrigo |
collection | PubMed |
description | BACKGROUND: How to extract useful information from complex biological networks is a major goal in many fields, especially in genomics and proteomics. We have shown in several works that iterative hierarchical clustering, as implemented in the UVCluster program, is a powerful tool to analyze many of those networks. However, the amount of computation time required to perform UVCluster analyses imposed significant limitations to its use. METHODOLOGY/PRINCIPAL FINDINGS: We describe the suite Jerarca, designed to efficiently convert networks of interacting units into dendrograms by means of iterative hierarchical clustering. Jerarca is divided into three main sections. First, weighted distances among units are computed using up to three different approaches: a more efficient version of UVCluster and two new, related algorithms called RCluster and SCluster. Second, Jerarca builds dendrograms based on those distances, using well-known phylogenetic algorithms, such as UPGMA or Neighbor-Joining. Finally, Jerarca provides optimal partitions of the trees using statistical criteria based on the distribution of intra- and intercluster connections. Outputs compatible with the phylogenetic software MEGA and the Cytoscape package are generated, allowing the results to be easily visualized. CONCLUSIONS/SIGNIFICANCE: The four main advantages of Jerarca in respect to UVCluster are: 1) Improved speed of a novel UVCluster algorithm; 2) Additional, alternative strategies to perform iterative hierarchical clustering; 3) Automatic evaluation of the hierarchical trees to obtain optimal partitions; and, 4) Outputs compatible with popular software such as MEGA and Cytoscape. |
format | Text |
id | pubmed-2904377 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-29043772010-07-19 Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering Aldecoa, Rodrigo Marín, Ignacio PLoS One Research Article BACKGROUND: How to extract useful information from complex biological networks is a major goal in many fields, especially in genomics and proteomics. We have shown in several works that iterative hierarchical clustering, as implemented in the UVCluster program, is a powerful tool to analyze many of those networks. However, the amount of computation time required to perform UVCluster analyses imposed significant limitations to its use. METHODOLOGY/PRINCIPAL FINDINGS: We describe the suite Jerarca, designed to efficiently convert networks of interacting units into dendrograms by means of iterative hierarchical clustering. Jerarca is divided into three main sections. First, weighted distances among units are computed using up to three different approaches: a more efficient version of UVCluster and two new, related algorithms called RCluster and SCluster. Second, Jerarca builds dendrograms based on those distances, using well-known phylogenetic algorithms, such as UPGMA or Neighbor-Joining. Finally, Jerarca provides optimal partitions of the trees using statistical criteria based on the distribution of intra- and intercluster connections. Outputs compatible with the phylogenetic software MEGA and the Cytoscape package are generated, allowing the results to be easily visualized. CONCLUSIONS/SIGNIFICANCE: The four main advantages of Jerarca in respect to UVCluster are: 1) Improved speed of a novel UVCluster algorithm; 2) Additional, alternative strategies to perform iterative hierarchical clustering; 3) Automatic evaluation of the hierarchical trees to obtain optimal partitions; and, 4) Outputs compatible with popular software such as MEGA and Cytoscape. Public Library of Science 2010-07-14 /pmc/articles/PMC2904377/ /pubmed/20644733 http://dx.doi.org/10.1371/journal.pone.0011585 Text en Aldecoa, Marín. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Aldecoa, Rodrigo Marín, Ignacio Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering |
title | Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering |
title_full | Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering |
title_fullStr | Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering |
title_full_unstemmed | Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering |
title_short | Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering |
title_sort | jerarca: efficient analysis of complex networks using hierarchical clustering |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2904377/ https://www.ncbi.nlm.nih.gov/pubmed/20644733 http://dx.doi.org/10.1371/journal.pone.0011585 |
work_keys_str_mv | AT aldecoarodrigo jerarcaefficientanalysisofcomplexnetworksusinghierarchicalclustering AT marinignacio jerarcaefficientanalysisofcomplexnetworksusinghierarchicalclustering |