Cargando…
A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity
BACKGROUND: Sequence similarity to characterized proteins provides testable functional hypotheses for less than 50% of the proteins identified by genome sequencing projects. With structural genomics it is believed that structural similarities may give functional hypotheses for many of the remaining...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2705683/ https://www.ncbi.nlm.nih.gov/pubmed/19603073 http://dx.doi.org/10.1371/journal.pone.0006266 |
_version_ | 1782169017789710336 |
---|---|
author | Hvidsten, Torgeir R. Lægreid, Astrid Kryshtafovych, Andriy Andersson, Gunnar Fidelis, Krzysztof Komorowski, Jan |
author_facet | Hvidsten, Torgeir R. Lægreid, Astrid Kryshtafovych, Andriy Andersson, Gunnar Fidelis, Krzysztof Komorowski, Jan |
author_sort | Hvidsten, Torgeir R. |
collection | PubMed |
description | BACKGROUND: Sequence similarity to characterized proteins provides testable functional hypotheses for less than 50% of the proteins identified by genome sequencing projects. With structural genomics it is believed that structural similarities may give functional hypotheses for many of the remaining proteins. METHODOLOGY/PRINCIPAL FINDINGS: We provide a systematic analysis of the structure-function relationship in proteins using the novel concept of local descriptors of protein structure. A local descriptor is a small substructure of a protein which includes both short- and long-range interactions. We employ a library of commonly reoccurring local descriptors general enough to assemble most existing protein structures. We then model the relationship between these local shapes and Gene Ontology using rule-based learning. Our IF-THEN rule model offers legible, high resolution descriptions that combine local substructures and is able to discriminate functions even for functionally versatile folds such as the frequently occurring TIM barrel and Rossmann fold. By evaluating the predictive performance of the model, we provide a comprehensive quantification of the structure-function relationship based only on local structure similarity. Our findings are, among others, that conserved structure is a stronger prerequisite for enzymatic activity than for binding specificity, and that structure-based predictions complement sequence-based predictions. The model is capable of generating correct hypotheses, as confirmed by a literature study, even when no significant sequence similarity to characterized proteins exists. CONCLUSIONS/SIGNIFICANCE: Our approach offers a new and complete description and quantification of the structure-function relationship in proteins. By demonstrating how our predictions offer higher sensitivity than using global structure, and complement the use of sequence, we show that the presented ideas could advance the development of meta-servers in function prediction. |
format | Text |
id | pubmed-2705683 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-27056832009-07-15 A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity Hvidsten, Torgeir R. Lægreid, Astrid Kryshtafovych, Andriy Andersson, Gunnar Fidelis, Krzysztof Komorowski, Jan PLoS One Research Article BACKGROUND: Sequence similarity to characterized proteins provides testable functional hypotheses for less than 50% of the proteins identified by genome sequencing projects. With structural genomics it is believed that structural similarities may give functional hypotheses for many of the remaining proteins. METHODOLOGY/PRINCIPAL FINDINGS: We provide a systematic analysis of the structure-function relationship in proteins using the novel concept of local descriptors of protein structure. A local descriptor is a small substructure of a protein which includes both short- and long-range interactions. We employ a library of commonly reoccurring local descriptors general enough to assemble most existing protein structures. We then model the relationship between these local shapes and Gene Ontology using rule-based learning. Our IF-THEN rule model offers legible, high resolution descriptions that combine local substructures and is able to discriminate functions even for functionally versatile folds such as the frequently occurring TIM barrel and Rossmann fold. By evaluating the predictive performance of the model, we provide a comprehensive quantification of the structure-function relationship based only on local structure similarity. Our findings are, among others, that conserved structure is a stronger prerequisite for enzymatic activity than for binding specificity, and that structure-based predictions complement sequence-based predictions. The model is capable of generating correct hypotheses, as confirmed by a literature study, even when no significant sequence similarity to characterized proteins exists. CONCLUSIONS/SIGNIFICANCE: Our approach offers a new and complete description and quantification of the structure-function relationship in proteins. By demonstrating how our predictions offer higher sensitivity than using global structure, and complement the use of sequence, we show that the presented ideas could advance the development of meta-servers in function prediction. Public Library of Science 2009-07-15 /pmc/articles/PMC2705683/ /pubmed/19603073 http://dx.doi.org/10.1371/journal.pone.0006266 Text en This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. |
spellingShingle | Research Article Hvidsten, Torgeir R. Lægreid, Astrid Kryshtafovych, Andriy Andersson, Gunnar Fidelis, Krzysztof Komorowski, Jan A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity |
title | A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity |
title_full | A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity |
title_fullStr | A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity |
title_full_unstemmed | A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity |
title_short | A Comprehensive Analysis of the Structure-Function Relationship in Proteins Based on Local Structure Similarity |
title_sort | comprehensive analysis of the structure-function relationship in proteins based on local structure similarity |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2705683/ https://www.ncbi.nlm.nih.gov/pubmed/19603073 http://dx.doi.org/10.1371/journal.pone.0006266 |
work_keys_str_mv | AT hvidstentorgeirr acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT lægreidastrid acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT kryshtafovychandriy acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT anderssongunnar acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT fideliskrzysztof acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT komorowskijan acomprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT hvidstentorgeirr comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT lægreidastrid comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT kryshtafovychandriy comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT anderssongunnar comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT fideliskrzysztof comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity AT komorowskijan comprehensiveanalysisofthestructurefunctionrelationshipinproteinsbasedonlocalstructuresimilarity |