Cargando…

Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities

High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies whic...

Descripción completa

Detalles Bibliográficos
Autores principales: Venner, Eric, Lisewski, Andreas Martin, Erdin, Serkan, Ward, R. Matthew, Amin, Shivas R., Lichtarge, Olivier
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3001439/
https://www.ncbi.nlm.nih.gov/pubmed/21179190
http://dx.doi.org/10.1371/journal.pone.0014286
_version_ 1782193627273887744
author Venner, Eric
Lisewski, Andreas Martin
Erdin, Serkan
Ward, R. Matthew
Amin, Shivas R.
Lichtarge, Olivier
author_facet Venner, Eric
Lisewski, Andreas Martin
Erdin, Serkan
Ward, R. Matthew
Amin, Shivas R.
Lichtarge, Olivier
author_sort Venner, Eric
collection PubMed
description High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC) levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks.
format Text
id pubmed-3001439
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30014392010-12-21 Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities Venner, Eric Lisewski, Andreas Martin Erdin, Serkan Ward, R. Matthew Amin, Shivas R. Lichtarge, Olivier PLoS One Research Article High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC) levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks. Public Library of Science 2010-12-13 /pmc/articles/PMC3001439/ /pubmed/21179190 http://dx.doi.org/10.1371/journal.pone.0014286 Text en Venner et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Venner, Eric
Lisewski, Andreas Martin
Erdin, Serkan
Ward, R. Matthew
Amin, Shivas R.
Lichtarge, Olivier
Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities
title Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities
title_full Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities
title_fullStr Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities
title_full_unstemmed Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities
title_short Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities
title_sort accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3001439/
https://www.ncbi.nlm.nih.gov/pubmed/21179190
http://dx.doi.org/10.1371/journal.pone.0014286
work_keys_str_mv AT vennereric accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities
AT lisewskiandreasmartin accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities
AT erdinserkan accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities
AT wardrmatthew accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities
AT aminshivasr accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities
AT lichtargeolivier accurateproteinstructureannotationthroughcompetitivediffusionofenzymaticfunctionsoveranetworkoflocalevolutionarysimilarities