Cargando…

Defining functional distances over Gene Ontology

BACKGROUND: A fundamental problem when trying to define the functional relationships between proteins is the difficulty in quantifying functional similarities, even when well-structured ontologies exist regarding the activity of proteins (i.e. 'gene ontology' -GO-). However, functional met...

Descripción completa

Detalles Bibliográficos
Autores principales: del Pozo, Angela, Pazos, Florencio, Valencia, Alfonso
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2375122/
https://www.ncbi.nlm.nih.gov/pubmed/18221506
http://dx.doi.org/10.1186/1471-2105-9-50
_version_ 1782154582534651904
author del Pozo, Angela
Pazos, Florencio
Valencia, Alfonso
author_facet del Pozo, Angela
Pazos, Florencio
Valencia, Alfonso
author_sort del Pozo, Angela
collection PubMed
description BACKGROUND: A fundamental problem when trying to define the functional relationships between proteins is the difficulty in quantifying functional similarities, even when well-structured ontologies exist regarding the activity of proteins (i.e. 'gene ontology' -GO-). However, functional metrics can overcome the problems in the comparing and evaluating functional assignments and predictions. As a reference of proximity, previous approaches to compare GO terms considered linkage in terms of ontology weighted by a probability distribution that balances the non-uniform 'richness' of different parts of the Direct Acyclic Graph. Here, we have followed a different approach to quantify functional similarities between GO terms. RESULTS: We propose a new method to derive 'functional distances' between GO terms that is based on the simultaneous occurrence of terms in the same set of Interpro entries, instead of relying on the structure of the GO. The coincidence of GO terms reveals natural biological links between the GO functions and defines a distance model D(f )which fulfils the properties of a Metric Space. The distances obtained in this way can be represented as a hierarchical 'Functional Tree'. CONCLUSION: The method proposed provides a new definition of distance that enables the similarity between GO terms to be quantified. Additionally, the 'Functional Tree' defines groups with biological meaning enhancing its utility for protein function comparison and prediction. Finally, this approach could be for function-based protein searches in databases, and for analysing the gene clusters produced by DNA array experiments.
format Text
id pubmed-2375122
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23751222008-05-12 Defining functional distances over Gene Ontology del Pozo, Angela Pazos, Florencio Valencia, Alfonso BMC Bioinformatics Methodology Article BACKGROUND: A fundamental problem when trying to define the functional relationships between proteins is the difficulty in quantifying functional similarities, even when well-structured ontologies exist regarding the activity of proteins (i.e. 'gene ontology' -GO-). However, functional metrics can overcome the problems in the comparing and evaluating functional assignments and predictions. As a reference of proximity, previous approaches to compare GO terms considered linkage in terms of ontology weighted by a probability distribution that balances the non-uniform 'richness' of different parts of the Direct Acyclic Graph. Here, we have followed a different approach to quantify functional similarities between GO terms. RESULTS: We propose a new method to derive 'functional distances' between GO terms that is based on the simultaneous occurrence of terms in the same set of Interpro entries, instead of relying on the structure of the GO. The coincidence of GO terms reveals natural biological links between the GO functions and defines a distance model D(f )which fulfils the properties of a Metric Space. The distances obtained in this way can be represented as a hierarchical 'Functional Tree'. CONCLUSION: The method proposed provides a new definition of distance that enables the similarity between GO terms to be quantified. Additionally, the 'Functional Tree' defines groups with biological meaning enhancing its utility for protein function comparison and prediction. Finally, this approach could be for function-based protein searches in databases, and for analysing the gene clusters produced by DNA array experiments. BioMed Central 2008-01-25 /pmc/articles/PMC2375122/ /pubmed/18221506 http://dx.doi.org/10.1186/1471-2105-9-50 Text en Copyright © 2008 del Pozo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
del Pozo, Angela
Pazos, Florencio
Valencia, Alfonso
Defining functional distances over Gene Ontology
title Defining functional distances over Gene Ontology
title_full Defining functional distances over Gene Ontology
title_fullStr Defining functional distances over Gene Ontology
title_full_unstemmed Defining functional distances over Gene Ontology
title_short Defining functional distances over Gene Ontology
title_sort defining functional distances over gene ontology
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2375122/
https://www.ncbi.nlm.nih.gov/pubmed/18221506
http://dx.doi.org/10.1186/1471-2105-9-50
work_keys_str_mv AT delpozoangela definingfunctionaldistancesovergeneontology
AT pazosflorencio definingfunctionaldistancesovergeneontology
AT valenciaalfonso definingfunctionaldistancesovergeneontology