Cargando…

BrEPS: a flexible and automatic protocol to compute enzyme-specific sequence profiles for functional annotation

BACKGROUND: Models for the simulation of metabolic networks require the accurate prediction of enzyme function. Based on a genomic sequence, enzymatic functions of gene products are today mainly predicted by sequence database searching and operon analysis. Other methods can support these techniques:...

Descripción completa

Detalles Bibliográficos
Autores principales: Bannert, C, Welfle, A, aus dem Spring, C, Schomburg, D
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009691/
https://www.ncbi.nlm.nih.gov/pubmed/21122127
http://dx.doi.org/10.1186/1471-2105-11-589
_version_ 1782194730772201472
author Bannert, C
Welfle, A
aus dem Spring, C
Schomburg, D
author_facet Bannert, C
Welfle, A
aus dem Spring, C
Schomburg, D
author_sort Bannert, C
collection PubMed
description BACKGROUND: Models for the simulation of metabolic networks require the accurate prediction of enzyme function. Based on a genomic sequence, enzymatic functions of gene products are today mainly predicted by sequence database searching and operon analysis. Other methods can support these techniques: We have developed an automatic method "BrEPS" that creates highly specific sequence patterns for the functional annotation of enzymes. RESULTS: The enzymes in the UniprotKB are identified and their sequences compared against each other with BLAST. The enzymes are then clustered into a number of trees, where each tree node is associated with a set of EC-numbers. The enzyme sequences in the tree nodes are aligned with ClustalW. The conserved columns of the resulting multiple alignments are used to construct sequence patterns. In the last step, we verify the quality of the patterns by computing their specificity. Patterns with low specificity are omitted and recomputed further down in the tree. The final high-quality patterns can be used for functional annotation. We ran our protocol on a recent Swiss-Prot release and show statistics, as well as a comparison to PRIAM, a probabilistic method that is also specialized on the functional annotation of enzymes. We determine the amount of true positive annotations for five common microorganisms with data from BRENDA and AMENDA serving as standard of truth. BrEPS is almost on par with PRIAM, a fact which we discuss in the context of five manually investigated cases. CONCLUSIONS: Our protocol computes highly specific sequence patterns that can be used to support the functional annotation of enzymes. The main advantages of our method are that it is automatic and unsupervised, and quite fast once the patterns are evaluated. The results show that BrEPS can be a valuable addition to the reconstruction of metabolic networks.
format Text
id pubmed-3009691
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30096912010-12-24 BrEPS: a flexible and automatic protocol to compute enzyme-specific sequence profiles for functional annotation Bannert, C Welfle, A aus dem Spring, C Schomburg, D BMC Bioinformatics Methodology Article BACKGROUND: Models for the simulation of metabolic networks require the accurate prediction of enzyme function. Based on a genomic sequence, enzymatic functions of gene products are today mainly predicted by sequence database searching and operon analysis. Other methods can support these techniques: We have developed an automatic method "BrEPS" that creates highly specific sequence patterns for the functional annotation of enzymes. RESULTS: The enzymes in the UniprotKB are identified and their sequences compared against each other with BLAST. The enzymes are then clustered into a number of trees, where each tree node is associated with a set of EC-numbers. The enzyme sequences in the tree nodes are aligned with ClustalW. The conserved columns of the resulting multiple alignments are used to construct sequence patterns. In the last step, we verify the quality of the patterns by computing their specificity. Patterns with low specificity are omitted and recomputed further down in the tree. The final high-quality patterns can be used for functional annotation. We ran our protocol on a recent Swiss-Prot release and show statistics, as well as a comparison to PRIAM, a probabilistic method that is also specialized on the functional annotation of enzymes. We determine the amount of true positive annotations for five common microorganisms with data from BRENDA and AMENDA serving as standard of truth. BrEPS is almost on par with PRIAM, a fact which we discuss in the context of five manually investigated cases. CONCLUSIONS: Our protocol computes highly specific sequence patterns that can be used to support the functional annotation of enzymes. The main advantages of our method are that it is automatic and unsupervised, and quite fast once the patterns are evaluated. The results show that BrEPS can be a valuable addition to the reconstruction of metabolic networks. BioMed Central 2010-12-01 /pmc/articles/PMC3009691/ /pubmed/21122127 http://dx.doi.org/10.1186/1471-2105-11-589 Text en Copyright ©2010 Bannert et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Bannert, C
Welfle, A
aus dem Spring, C
Schomburg, D
BrEPS: a flexible and automatic protocol to compute enzyme-specific sequence profiles for functional annotation
title BrEPS: a flexible and automatic protocol to compute enzyme-specific sequence profiles for functional annotation
title_full BrEPS: a flexible and automatic protocol to compute enzyme-specific sequence profiles for functional annotation
title_fullStr BrEPS: a flexible and automatic protocol to compute enzyme-specific sequence profiles for functional annotation
title_full_unstemmed BrEPS: a flexible and automatic protocol to compute enzyme-specific sequence profiles for functional annotation
title_short BrEPS: a flexible and automatic protocol to compute enzyme-specific sequence profiles for functional annotation
title_sort breps: a flexible and automatic protocol to compute enzyme-specific sequence profiles for functional annotation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009691/
https://www.ncbi.nlm.nih.gov/pubmed/21122127
http://dx.doi.org/10.1186/1471-2105-11-589
work_keys_str_mv AT bannertc brepsaflexibleandautomaticprotocoltocomputeenzymespecificsequenceprofilesforfunctionalannotation
AT welflea brepsaflexibleandautomaticprotocoltocomputeenzymespecificsequenceprofilesforfunctionalannotation
AT ausdemspringc brepsaflexibleandautomaticprotocoltocomputeenzymespecificsequenceprofilesforfunctionalannotation
AT schomburgd brepsaflexibleandautomaticprotocoltocomputeenzymespecificsequenceprofilesforfunctionalannotation