Cargando…

Function prediction from networks of local evolutionary similarity in protein structure

BACKGROUND: Annotating protein function with both high accuracy and sensitivity remains a major challenge in structural genomics. One proven computational strategy has been to group a few key functional amino acids into templates and search for these templates in other protein structures, so as to t...

Descripción completa

Detalles Bibliográficos
Autores principales: Erdin, Serkan, Venner, Eric, Lisewski, Andreas Martin, Lichtarge, Olivier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3584919/
https://www.ncbi.nlm.nih.gov/pubmed/23514548
http://dx.doi.org/10.1186/1471-2105-14-S3-S6
_version_ 1782261077338226688
author Erdin, Serkan
Venner, Eric
Lisewski, Andreas Martin
Lichtarge, Olivier
author_facet Erdin, Serkan
Venner, Eric
Lisewski, Andreas Martin
Lichtarge, Olivier
author_sort Erdin, Serkan
collection PubMed
description BACKGROUND: Annotating protein function with both high accuracy and sensitivity remains a major challenge in structural genomics. One proven computational strategy has been to group a few key functional amino acids into templates and search for these templates in other protein structures, so as to transfer function when a match is found. To this end, we previously developed Evolutionary Trace Annotation (ETA) and showed that diffusing known annotations over a network of template matches on a structural genomic scale improved predictions of function. In order to further increase sensitivity, we now let each protein contribute multiple templates rather than just one, and also let the template size vary. RESULTS: Retrospective benchmarks in 605 Structural Genomics enzymes showed that multiple templates increased sensitivity by up to 14% when combined with single template predictions even as they maintained the accuracy over 91%. Diffusing function globally on networks of single and multiple template matches marginally increased the area under the ROC curve over 0.97, but in a subset of proteins that could not be annotated by ETA, the network approach recovered annotations for the most confident 20-23 of 91 cases with 100% accuracy. CONCLUSIONS: We improve the accuracy and sensitivity of predictions by using multiple templates per protein structure when constructing networks of ETA matches and diffusing annotations.
format Online
Article
Text
id pubmed-3584919
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35849192013-03-11 Function prediction from networks of local evolutionary similarity in protein structure Erdin, Serkan Venner, Eric Lisewski, Andreas Martin Lichtarge, Olivier BMC Bioinformatics Proceedings BACKGROUND: Annotating protein function with both high accuracy and sensitivity remains a major challenge in structural genomics. One proven computational strategy has been to group a few key functional amino acids into templates and search for these templates in other protein structures, so as to transfer function when a match is found. To this end, we previously developed Evolutionary Trace Annotation (ETA) and showed that diffusing known annotations over a network of template matches on a structural genomic scale improved predictions of function. In order to further increase sensitivity, we now let each protein contribute multiple templates rather than just one, and also let the template size vary. RESULTS: Retrospective benchmarks in 605 Structural Genomics enzymes showed that multiple templates increased sensitivity by up to 14% when combined with single template predictions even as they maintained the accuracy over 91%. Diffusing function globally on networks of single and multiple template matches marginally increased the area under the ROC curve over 0.97, but in a subset of proteins that could not be annotated by ETA, the network approach recovered annotations for the most confident 20-23 of 91 cases with 100% accuracy. CONCLUSIONS: We improve the accuracy and sensitivity of predictions by using multiple templates per protein structure when constructing networks of ETA matches and diffusing annotations. BioMed Central 2013-02-28 /pmc/articles/PMC3584919/ /pubmed/23514548 http://dx.doi.org/10.1186/1471-2105-14-S3-S6 Text en Copyright ©2013 Erdin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Erdin, Serkan
Venner, Eric
Lisewski, Andreas Martin
Lichtarge, Olivier
Function prediction from networks of local evolutionary similarity in protein structure
title Function prediction from networks of local evolutionary similarity in protein structure
title_full Function prediction from networks of local evolutionary similarity in protein structure
title_fullStr Function prediction from networks of local evolutionary similarity in protein structure
title_full_unstemmed Function prediction from networks of local evolutionary similarity in protein structure
title_short Function prediction from networks of local evolutionary similarity in protein structure
title_sort function prediction from networks of local evolutionary similarity in protein structure
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3584919/
https://www.ncbi.nlm.nih.gov/pubmed/23514548
http://dx.doi.org/10.1186/1471-2105-14-S3-S6
work_keys_str_mv AT erdinserkan functionpredictionfromnetworksoflocalevolutionarysimilarityinproteinstructure
AT vennereric functionpredictionfromnetworksoflocalevolutionarysimilarityinproteinstructure
AT lisewskiandreasmartin functionpredictionfromnetworksoflocalevolutionarysimilarityinproteinstructure
AT lichtargeolivier functionpredictionfromnetworksoflocalevolutionarysimilarityinproteinstructure