Cargando…
Universal Architectural Concepts Underlying Protein Folding Patterns
What is the architectural “basis set” of the observed universe of protein structures? Using information-theoretic inference, we answer this question with a dictionary of 1,493 substructures—called concepts—typically at a subdomain level, based on an unbiased subset of known protein structures. Each...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8120156/ https://www.ncbi.nlm.nih.gov/pubmed/33996891 http://dx.doi.org/10.3389/fmolb.2020.612920 |
_version_ | 1783691996844523520 |
---|---|
author | Konagurthu, Arun S. Subramanian, Ramanan Allison, Lloyd Abramson, David Stuckey, Peter J. Garcia de la Banda, Maria Lesk, Arthur M. |
author_facet | Konagurthu, Arun S. Subramanian, Ramanan Allison, Lloyd Abramson, David Stuckey, Peter J. Garcia de la Banda, Maria Lesk, Arthur M. |
author_sort | Konagurthu, Arun S. |
collection | PubMed |
description | What is the architectural “basis set” of the observed universe of protein structures? Using information-theoretic inference, we answer this question with a dictionary of 1,493 substructures—called concepts—typically at a subdomain level, based on an unbiased subset of known protein structures. Each concept represents a topologically conserved assembly of helices and strands that make contact. Any protein structure can be dissected into instances of concepts from this dictionary. We dissected the Protein Data Bank and completely inventoried all the concept instances. This yields many insights, including correlations between concepts and catalytic activities or binding sites, useful for rational drug design; local amino-acid sequence–structure correlations, useful for ab initio structure prediction methods; and information supporting the recognition and exploration of evolutionary relationships, useful for structural studies. An interactive site, Proçodic, at http://lcb.infotech.monash.edu.au/prosodic (click), provides access to and navigation of the entire dictionary of concepts and their usages, and all associated information. This report is part of a continuing programme with the goal of elucidating fundamental principles of protein architecture, in the spirit of the work of Cyrus Chothia. |
format | Online Article Text |
id | pubmed-8120156 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81201562021-05-15 Universal Architectural Concepts Underlying Protein Folding Patterns Konagurthu, Arun S. Subramanian, Ramanan Allison, Lloyd Abramson, David Stuckey, Peter J. Garcia de la Banda, Maria Lesk, Arthur M. Front Mol Biosci Molecular Biosciences What is the architectural “basis set” of the observed universe of protein structures? Using information-theoretic inference, we answer this question with a dictionary of 1,493 substructures—called concepts—typically at a subdomain level, based on an unbiased subset of known protein structures. Each concept represents a topologically conserved assembly of helices and strands that make contact. Any protein structure can be dissected into instances of concepts from this dictionary. We dissected the Protein Data Bank and completely inventoried all the concept instances. This yields many insights, including correlations between concepts and catalytic activities or binding sites, useful for rational drug design; local amino-acid sequence–structure correlations, useful for ab initio structure prediction methods; and information supporting the recognition and exploration of evolutionary relationships, useful for structural studies. An interactive site, Proçodic, at http://lcb.infotech.monash.edu.au/prosodic (click), provides access to and navigation of the entire dictionary of concepts and their usages, and all associated information. This report is part of a continuing programme with the goal of elucidating fundamental principles of protein architecture, in the spirit of the work of Cyrus Chothia. Frontiers Media S.A. 2021-04-30 /pmc/articles/PMC8120156/ /pubmed/33996891 http://dx.doi.org/10.3389/fmolb.2020.612920 Text en Copyright © 2021 Konagurthu, Subramanian, Allison, Abramson, Stuckey, Garcia de la Banda and Lesk. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Molecular Biosciences Konagurthu, Arun S. Subramanian, Ramanan Allison, Lloyd Abramson, David Stuckey, Peter J. Garcia de la Banda, Maria Lesk, Arthur M. Universal Architectural Concepts Underlying Protein Folding Patterns |
title | Universal Architectural Concepts Underlying Protein Folding Patterns |
title_full | Universal Architectural Concepts Underlying Protein Folding Patterns |
title_fullStr | Universal Architectural Concepts Underlying Protein Folding Patterns |
title_full_unstemmed | Universal Architectural Concepts Underlying Protein Folding Patterns |
title_short | Universal Architectural Concepts Underlying Protein Folding Patterns |
title_sort | universal architectural concepts underlying protein folding patterns |
topic | Molecular Biosciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8120156/ https://www.ncbi.nlm.nih.gov/pubmed/33996891 http://dx.doi.org/10.3389/fmolb.2020.612920 |
work_keys_str_mv | AT konagurthuaruns universalarchitecturalconceptsunderlyingproteinfoldingpatterns AT subramanianramanan universalarchitecturalconceptsunderlyingproteinfoldingpatterns AT allisonlloyd universalarchitecturalconceptsunderlyingproteinfoldingpatterns AT abramsondavid universalarchitecturalconceptsunderlyingproteinfoldingpatterns AT stuckeypeterj universalarchitecturalconceptsunderlyingproteinfoldingpatterns AT garciadelabandamaria universalarchitecturalconceptsunderlyingproteinfoldingpatterns AT leskarthurm universalarchitecturalconceptsunderlyingproteinfoldingpatterns |