Cargando…

A general strategy to determine the congruence between a hierarchical and a non-hierarchical classification

BACKGROUND: Classification procedures are widely used in phylogenetic inference, the analysis of expression profiles, the study of biological networks, etc. Many algorithms have been proposed to establish the similarity between two different classifications of the same elements. However, methods to...

Descripción completa

Detalles Bibliográficos
Autores principales: Marco, Antonio, Marín, Ignacio
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2213689/
https://www.ncbi.nlm.nih.gov/pubmed/18005402
http://dx.doi.org/10.1186/1471-2105-8-442
_version_ 1782148935117176832
author Marco, Antonio
Marín, Ignacio
author_facet Marco, Antonio
Marín, Ignacio
author_sort Marco, Antonio
collection PubMed
description BACKGROUND: Classification procedures are widely used in phylogenetic inference, the analysis of expression profiles, the study of biological networks, etc. Many algorithms have been proposed to establish the similarity between two different classifications of the same elements. However, methods to determine significant coincidences between hierarchical and non-hierarchical partitions are still poorly developed, in spite of the fact that the search for such coincidences is implicit in many analyses of massive data. RESULTS: We describe a novel strategy to compare a hierarchical and a dichotomic non-hierarchical classification of elements, in order to find clusters in a hierarchical tree in which elements of a given "flat" partition are overrepresented. The key improvement of our strategy respect to previous methods is using permutation analyses of ranked clusters to determine whether regions of the dendrograms present a significant enrichment. We show that this method is more sensitive than previously developed strategies and how it can be applied to several real cases, including microarray and interactome data. Particularly, we use it to compare a hierarchical representation of the yeast mitochondrial interactome and a catalogue of known mitochondrial protein complexes, demonstrating a high level of congruence between those two classifications. We also discuss extensions of this method to other cases which are conceptually related. CONCLUSION: Our method is highly sensitive and outperforms previously described strategies. A PERL script that implements it is available at .
format Text
id pubmed-2213689
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22136892008-01-25 A general strategy to determine the congruence between a hierarchical and a non-hierarchical classification Marco, Antonio Marín, Ignacio BMC Bioinformatics Methodology Article BACKGROUND: Classification procedures are widely used in phylogenetic inference, the analysis of expression profiles, the study of biological networks, etc. Many algorithms have been proposed to establish the similarity between two different classifications of the same elements. However, methods to determine significant coincidences between hierarchical and non-hierarchical partitions are still poorly developed, in spite of the fact that the search for such coincidences is implicit in many analyses of massive data. RESULTS: We describe a novel strategy to compare a hierarchical and a dichotomic non-hierarchical classification of elements, in order to find clusters in a hierarchical tree in which elements of a given "flat" partition are overrepresented. The key improvement of our strategy respect to previous methods is using permutation analyses of ranked clusters to determine whether regions of the dendrograms present a significant enrichment. We show that this method is more sensitive than previously developed strategies and how it can be applied to several real cases, including microarray and interactome data. Particularly, we use it to compare a hierarchical representation of the yeast mitochondrial interactome and a catalogue of known mitochondrial protein complexes, demonstrating a high level of congruence between those two classifications. We also discuss extensions of this method to other cases which are conceptually related. CONCLUSION: Our method is highly sensitive and outperforms previously described strategies. A PERL script that implements it is available at . BioMed Central 2007-11-15 /pmc/articles/PMC2213689/ /pubmed/18005402 http://dx.doi.org/10.1186/1471-2105-8-442 Text en Copyright © 2007 Marco and Marín; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Marco, Antonio
Marín, Ignacio
A general strategy to determine the congruence between a hierarchical and a non-hierarchical classification
title A general strategy to determine the congruence between a hierarchical and a non-hierarchical classification
title_full A general strategy to determine the congruence between a hierarchical and a non-hierarchical classification
title_fullStr A general strategy to determine the congruence between a hierarchical and a non-hierarchical classification
title_full_unstemmed A general strategy to determine the congruence between a hierarchical and a non-hierarchical classification
title_short A general strategy to determine the congruence between a hierarchical and a non-hierarchical classification
title_sort general strategy to determine the congruence between a hierarchical and a non-hierarchical classification
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2213689/
https://www.ncbi.nlm.nih.gov/pubmed/18005402
http://dx.doi.org/10.1186/1471-2105-8-442
work_keys_str_mv AT marcoantonio ageneralstrategytodeterminethecongruencebetweenahierarchicalandanonhierarchicalclassification
AT marinignacio ageneralstrategytodeterminethecongruencebetweenahierarchicalandanonhierarchicalclassification
AT marcoantonio generalstrategytodeterminethecongruencebetweenahierarchicalandanonhierarchicalclassification
AT marinignacio generalstrategytodeterminethecongruencebetweenahierarchicalandanonhierarchicalclassification