Cargando…

CORRIE: enzyme sequence annotation with confidence estimates

Using a previously developed automated method for enzyme annotation, we report the re-annotation of the ENZYME database and the analysis of local error rates per class. In control experiments, we demonstrate that the method is able to correctly re-annotate 91% of all Enzyme Classification (EC) class...

Descripción completa

Detalles Bibliográficos
Autores principales: Audit, Benjamin, Levy, Emmanuel D, Gilks, Wally R, Goldovsky, Leon, Ouzounis, Christos A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1892082/
https://www.ncbi.nlm.nih.gov/pubmed/17570146
http://dx.doi.org/10.1186/1471-2105-8-S4-S3
_version_ 1782133822995824640
author Audit, Benjamin
Levy, Emmanuel D
Gilks, Wally R
Goldovsky, Leon
Ouzounis, Christos A
author_facet Audit, Benjamin
Levy, Emmanuel D
Gilks, Wally R
Goldovsky, Leon
Ouzounis, Christos A
author_sort Audit, Benjamin
collection PubMed
description Using a previously developed automated method for enzyme annotation, we report the re-annotation of the ENZYME database and the analysis of local error rates per class. In control experiments, we demonstrate that the method is able to correctly re-annotate 91% of all Enzyme Classification (EC) classes with high coverage (755 out of 827). Only 44 enzyme classes are found to contain false positives, while the remaining 28 enzyme classes are not represented. We also show cases where the re-annotation procedure results in partial overlaps for those few enzyme classes where a certain inconsistency might appear between homologous proteins, mostly due to function specificity. Our results allow the interactive exploration of the EC hierarchy for known enzyme families as well as putative enzyme sequences that may need to be classified within the EC hierarchy. These aspects of our framework have been incorporated into a web-server, called CORRIE, which stands for Correspondence Indicator Estimation and allows the interactive prediction of a functional class for putative enzymes from sequence alone, supported by probabilistic measures in the context of the pre-calculated Correspondence Indicators of known enzymes with the functional classes of the EC hierarchy. The CORRIE server is available at: .
format Text
id pubmed-1892082
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18920822007-06-15 CORRIE: enzyme sequence annotation with confidence estimates Audit, Benjamin Levy, Emmanuel D Gilks, Wally R Goldovsky, Leon Ouzounis, Christos A BMC Bioinformatics Proceedings Using a previously developed automated method for enzyme annotation, we report the re-annotation of the ENZYME database and the analysis of local error rates per class. In control experiments, we demonstrate that the method is able to correctly re-annotate 91% of all Enzyme Classification (EC) classes with high coverage (755 out of 827). Only 44 enzyme classes are found to contain false positives, while the remaining 28 enzyme classes are not represented. We also show cases where the re-annotation procedure results in partial overlaps for those few enzyme classes where a certain inconsistency might appear between homologous proteins, mostly due to function specificity. Our results allow the interactive exploration of the EC hierarchy for known enzyme families as well as putative enzyme sequences that may need to be classified within the EC hierarchy. These aspects of our framework have been incorporated into a web-server, called CORRIE, which stands for Correspondence Indicator Estimation and allows the interactive prediction of a functional class for putative enzymes from sequence alone, supported by probabilistic measures in the context of the pre-calculated Correspondence Indicators of known enzymes with the functional classes of the EC hierarchy. The CORRIE server is available at: . BioMed Central 2007-05-22 /pmc/articles/PMC1892082/ /pubmed/17570146 http://dx.doi.org/10.1186/1471-2105-8-S4-S3 Text en Copyright © 2007 Audit et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Audit, Benjamin
Levy, Emmanuel D
Gilks, Wally R
Goldovsky, Leon
Ouzounis, Christos A
CORRIE: enzyme sequence annotation with confidence estimates
title CORRIE: enzyme sequence annotation with confidence estimates
title_full CORRIE: enzyme sequence annotation with confidence estimates
title_fullStr CORRIE: enzyme sequence annotation with confidence estimates
title_full_unstemmed CORRIE: enzyme sequence annotation with confidence estimates
title_short CORRIE: enzyme sequence annotation with confidence estimates
title_sort corrie: enzyme sequence annotation with confidence estimates
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1892082/
https://www.ncbi.nlm.nih.gov/pubmed/17570146
http://dx.doi.org/10.1186/1471-2105-8-S4-S3
work_keys_str_mv AT auditbenjamin corrieenzymesequenceannotationwithconfidenceestimates
AT levyemmanueld corrieenzymesequenceannotationwithconfidenceestimates
AT gilkswallyr corrieenzymesequenceannotationwithconfidenceestimates
AT goldovskyleon corrieenzymesequenceannotationwithconfidenceestimates
AT ouzounischristosa corrieenzymesequenceannotationwithconfidenceestimates