Cargando…

A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity

BACKGROUND: Sequence similarity to characterized proteins provides testable functional hypotheses for less than 50% of the proteins identified by genome sequencing projects. With structural genomics it is believed that structural similarities may give functional hypotheses for many of the remaining...

Descripción completa

Detalles Bibliográficos
Autores principales: Hvidsten, Torgeir R., Lægreid, Astrid, Kryshtafovych, Andriy, Andersson, Gunnar, Fidelis, Krzysztof, Komorowski, Jan
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2705683/
https://www.ncbi.nlm.nih.gov/pubmed/19603073
http://dx.doi.org/10.1371/journal.pone.0006266
_version_ 1782169017789710336
author Hvidsten, Torgeir R.
Lægreid, Astrid
Kryshtafovych, Andriy
Andersson, Gunnar
Fidelis, Krzysztof
Komorowski, Jan
author_facet Hvidsten, Torgeir R.
Lægreid, Astrid
Kryshtafovych, Andriy
Andersson, Gunnar
Fidelis, Krzysztof
Komorowski, Jan
author_sort Hvidsten, Torgeir R.
collection PubMed
description BACKGROUND: Sequence similarity to characterized proteins provides testable functional hypotheses for less than 50% of the proteins identified by genome sequencing projects. With structural genomics it is believed that structural similarities may give functional hypotheses for many of the remaining proteins. METHODOLOGY/PRINCIPAL FINDINGS: We provide a systematic analysis of the structure-function relationship in proteins using the novel concept of local descriptors of protein structure. A local descriptor is a small substructure of a protein which includes both short- and long-range interactions. We employ a library of commonly reoccurring local descriptors general enough to assemble most existing protein structures. We then model the relationship between these local shapes and Gene Ontology using rule-based learning. Our IF-THEN rule model offers legible, high resolution descriptions that combine local substructures and is able to discriminate functions even for functionally versatile folds such as the frequently occurring TIM barrel and Rossmann fold. By evaluating the predictive performance of the model, we provide a comprehensive quantification of the structure-function relationship based only on local structure similarity. Our findings are, among others, that conserved structure is a stronger prerequisite for enzymatic activity than for binding specificity, and that structure-based predictions complement sequence-based predictions. The model is capable of generating correct hypotheses, as confirmed by a literature study, even when no significant sequence similarity to characterized proteins exists. CONCLUSIONS/SIGNIFICANCE: Our approach offers a new and complete description and quantification of the structure-function relationship in proteins. By demonstrating how our predictions offer higher sensitivity than using global structure, and complement the use of sequence, we show that the presented ideas could advance the development of meta-servers in function prediction.
format Text
id pubmed-2705683
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-27056832009-07-15 A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity Hvidsten, Torgeir R. Lægreid, Astrid Kryshtafovych, Andriy Andersson, Gunnar Fidelis, Krzysztof Komorowski, Jan PLoS One Research Article BACKGROUND: Sequence similarity to characterized proteins provides testable functional hypotheses for less than 50% of the proteins identified by genome sequencing projects. With structural genomics it is believed that structural similarities may give functional hypotheses for many of the remaining proteins. METHODOLOGY/PRINCIPAL FINDINGS: We provide a systematic analysis of the structure-function relationship in proteins using the novel concept of local descriptors of protein structure. A local descriptor is a small substructure of a protein which includes both short- and long-range interactions. We employ a library of commonly reoccurring local descriptors general enough to assemble most existing protein structures. We then model the relationship between these local shapes and Gene Ontology using rule-based learning. Our IF-THEN rule model offers legible, high resolution descriptions that combine local substructures and is able to discriminate functions even for functionally versatile folds such as the frequently occurring TIM barrel and Rossmann fold. By evaluating the predictive performance of the model, we provide a comprehensive quantification of the structure-function relationship based only on local structure similarity. Our findings are, among others, that conserved structure is a stronger prerequisite for enzymatic activity than for binding specificity, and that structure-based predictions complement sequence-based predictions. The model is capable of generating correct hypotheses, as confirmed by a literature study, even when no significant sequence similarity to characterized proteins exists. CONCLUSIONS/SIGNIFICANCE: Our approach offers a new and complete description and quantification of the structure-function relationship in proteins. By demonstrating how our predictions offer higher sensitivity than using global structure, and complement the use of sequence, we show that the presented ideas could advance the development of meta-servers in function prediction. Public Library of Science 2009-07-15 /pmc/articles/PMC2705683/ /pubmed/19603073 http://dx.doi.org/10.1371/journal.pone.0006266 Text en This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Hvidsten, Torgeir R.
Lægreid, Astrid
Kryshtafovych, Andriy
Andersson, Gunnar
Fidelis, Krzysztof
Komorowski, Jan
A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity
title A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity
title_full A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity
title_fullStr A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity
title_full_unstemmed A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity
title_short A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity
title_sort comprehensive analysis of the structure-function relationship in proteins based on local structure similarity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2705683/
https://www.ncbi.nlm.nih.gov/pubmed/19603073
http://dx.doi.org/10.1371/journal.pone.0006266
work_keys_str_mv AT hvidstentorgeirr acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT lægreidastrid acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT kryshtafovychandriy acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT anderssongunnar acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT fideliskrzysztof acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT komorowskijan acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT hvidstentorgeirr comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT lægreidastrid comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT kryshtafovychandriy comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT anderssongunnar comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT fideliskrzysztof comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity
AT komorowskijan comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity