Cargando…

FACT: Functional annotation transfer between proteins with similar feature architectures

BACKGROUND: The increasing number of sequenced genomes provides the basis for exploring the genetic and functional diversity within the tree of life. Only a tiny fraction of the encoded proteins undergoes a thorough experimental characterization. For the remainder, bioinformatics annotation tools ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Koestler, Tina, von Haeseler, Arndt, Ebersberger, Ingo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2931517/
https://www.ncbi.nlm.nih.gov/pubmed/20696036
http://dx.doi.org/10.1186/1471-2105-11-417
_version_ 1782186055923924992
author Koestler, Tina
von Haeseler, Arndt
Ebersberger, Ingo
author_facet Koestler, Tina
von Haeseler, Arndt
Ebersberger, Ingo
author_sort Koestler, Tina
collection PubMed
description BACKGROUND: The increasing number of sequenced genomes provides the basis for exploring the genetic and functional diversity within the tree of life. Only a tiny fraction of the encoded proteins undergoes a thorough experimental characterization. For the remainder, bioinformatics annotation tools are the only means to infer their function. Exploiting significant sequence similarities to already characterized proteins, commonly taken as evidence for homology, is the prevalent method to deduce functional equivalence. Such methods fail when homologs are too diverged, or when they have assumed a different function. Finally, due to convergent evolution, functional equivalence is not necessarily linked to common ancestry. Therefore complementary approaches are required to identify functional equivalents. RESULTS: We present the Feature Architecture Comparison Tool http://www.cibiv.at/FACT to search for functionally equivalent proteins. FACT uses the similarity between feature architectures of two proteins, i.e., the arrangements of functional domains, secondary structure elements and compositional properties, as a proxy for their functional equivalence. A scoring function measures feature architecture similarities, which enables searching for functional equivalents in entire proteomes. Our evaluation of 9,570 EC classified enzymes revealed that FACT, using the full feature, set outperformed the existing architecture-based approaches by identifying significantly more functional equivalents as highest scoring proteins. We show that FACT can identify functional equivalents that share no significant sequence similarity. However, when the highest scoring protein of FACT is also the protein with the highest local sequence similarity, it is in 99% of the cases functionally equivalent to the query. We demonstrate the versatility of FACT by identifying a missing link in the yeast glutathione metabolism and also by searching for the human GolgA5 equivalent in Trypanosoma brucei. CONCLUSIONS: FACT facilitates a quick and sensitive search for functionally equivalent proteins in entire proteomes. FACT is complementary to approaches using sequence similarity to identify proteins with the same function. Thus, FACT is particularly useful when functional equivalents need to be identified in evolutionarily distant species, or when functional equivalents are not homologous. The most reliable annotation transfers, however, are achieved when feature architecture similarity and sequence similarity are jointly taken into account.
format Text
id pubmed-2931517
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29315172010-09-07 FACT: Functional annotation transfer between proteins with similar feature architectures Koestler, Tina von Haeseler, Arndt Ebersberger, Ingo BMC Bioinformatics Methodology Article BACKGROUND: The increasing number of sequenced genomes provides the basis for exploring the genetic and functional diversity within the tree of life. Only a tiny fraction of the encoded proteins undergoes a thorough experimental characterization. For the remainder, bioinformatics annotation tools are the only means to infer their function. Exploiting significant sequence similarities to already characterized proteins, commonly taken as evidence for homology, is the prevalent method to deduce functional equivalence. Such methods fail when homologs are too diverged, or when they have assumed a different function. Finally, due to convergent evolution, functional equivalence is not necessarily linked to common ancestry. Therefore complementary approaches are required to identify functional equivalents. RESULTS: We present the Feature Architecture Comparison Tool http://www.cibiv.at/FACT to search for functionally equivalent proteins. FACT uses the similarity between feature architectures of two proteins, i.e., the arrangements of functional domains, secondary structure elements and compositional properties, as a proxy for their functional equivalence. A scoring function measures feature architecture similarities, which enables searching for functional equivalents in entire proteomes. Our evaluation of 9,570 EC classified enzymes revealed that FACT, using the full feature, set outperformed the existing architecture-based approaches by identifying significantly more functional equivalents as highest scoring proteins. We show that FACT can identify functional equivalents that share no significant sequence similarity. However, when the highest scoring protein of FACT is also the protein with the highest local sequence similarity, it is in 99% of the cases functionally equivalent to the query. We demonstrate the versatility of FACT by identifying a missing link in the yeast glutathione metabolism and also by searching for the human GolgA5 equivalent in Trypanosoma brucei. CONCLUSIONS: FACT facilitates a quick and sensitive search for functionally equivalent proteins in entire proteomes. FACT is complementary to approaches using sequence similarity to identify proteins with the same function. Thus, FACT is particularly useful when functional equivalents need to be identified in evolutionarily distant species, or when functional equivalents are not homologous. The most reliable annotation transfers, however, are achieved when feature architecture similarity and sequence similarity are jointly taken into account. BioMed Central 2010-08-09 /pmc/articles/PMC2931517/ /pubmed/20696036 http://dx.doi.org/10.1186/1471-2105-11-417 Text en Copyright ©2010 Koestler et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Koestler, Tina
von Haeseler, Arndt
Ebersberger, Ingo
FACT: Functional annotation transfer between proteins with similar feature architectures
title FACT: Functional annotation transfer between proteins with similar feature architectures
title_full FACT: Functional annotation transfer between proteins with similar feature architectures
title_fullStr FACT: Functional annotation transfer between proteins with similar feature architectures
title_full_unstemmed FACT: Functional annotation transfer between proteins with similar feature architectures
title_short FACT: Functional annotation transfer between proteins with similar feature architectures
title_sort fact: functional annotation transfer between proteins with similar feature architectures
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2931517/
https://www.ncbi.nlm.nih.gov/pubmed/20696036
http://dx.doi.org/10.1186/1471-2105-11-417
work_keys_str_mv AT koestlertina factfunctionalannotationtransferbetweenproteinswithsimilarfeaturearchitectures
AT vonhaeselerarndt factfunctionalannotationtransferbetweenproteinswithsimilarfeaturearchitectures
AT ebersbergeringo factfunctionalannotationtransferbetweenproteinswithsimilarfeaturearchitectures