Cargando…

Universal Architectural Concepts Underlying Protein Folding Patterns

What is the architectural “basis set” of the observed universe of protein structures? Using information-theoretic inference, we answer this question with a dictionary of 1,493 substructures—called concepts—typically at a subdomain level, based on an unbiased subset of known protein structures. Each...

Descripción completa

Detalles Bibliográficos
Autores principales: Konagurthu, Arun S., Subramanian, Ramanan, Allison, Lloyd, Abramson, David, Stuckey, Peter J., Garcia de la Banda, Maria, Lesk, Arthur M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8120156/
https://www.ncbi.nlm.nih.gov/pubmed/33996891
http://dx.doi.org/10.3389/fmolb.2020.612920
_version_ 1783691996844523520
author Konagurthu, Arun S.
Subramanian, Ramanan
Allison, Lloyd
Abramson, David
Stuckey, Peter J.
Garcia de la Banda, Maria
Lesk, Arthur M.
author_facet Konagurthu, Arun S.
Subramanian, Ramanan
Allison, Lloyd
Abramson, David
Stuckey, Peter J.
Garcia de la Banda, Maria
Lesk, Arthur M.
author_sort Konagurthu, Arun S.
collection PubMed
description What is the architectural “basis set” of the observed universe of protein structures? Using information-theoretic inference, we answer this question with a dictionary of 1,493 substructures—called concepts—typically at a subdomain level, based on an unbiased subset of known protein structures. Each concept represents a topologically conserved assembly of helices and strands that make contact. Any protein structure can be dissected into instances of concepts from this dictionary. We dissected the Protein Data Bank and completely inventoried all the concept instances. This yields many insights, including correlations between concepts and catalytic activities or binding sites, useful for rational drug design; local amino-acid sequence–structure correlations, useful for ab initio structure prediction methods; and information supporting the recognition and exploration of evolutionary relationships, useful for structural studies. An interactive site, Proçodic, at http://lcb.infotech.monash.edu.au/prosodic (click), provides access to and navigation of the entire dictionary of concepts and their usages, and all associated information. This report is part of a continuing programme with the goal of elucidating fundamental principles of protein architecture, in the spirit of the work of Cyrus Chothia.
format Online
Article
Text
id pubmed-8120156
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81201562021-05-15 Universal Architectural Concepts Underlying Protein Folding Patterns Konagurthu, Arun S. Subramanian, Ramanan Allison, Lloyd Abramson, David Stuckey, Peter J. Garcia de la Banda, Maria Lesk, Arthur M. Front Mol Biosci Molecular Biosciences What is the architectural “basis set” of the observed universe of protein structures? Using information-theoretic inference, we answer this question with a dictionary of 1,493 substructures—called concepts—typically at a subdomain level, based on an unbiased subset of known protein structures. Each concept represents a topologically conserved assembly of helices and strands that make contact. Any protein structure can be dissected into instances of concepts from this dictionary. We dissected the Protein Data Bank and completely inventoried all the concept instances. This yields many insights, including correlations between concepts and catalytic activities or binding sites, useful for rational drug design; local amino-acid sequence–structure correlations, useful for ab initio structure prediction methods; and information supporting the recognition and exploration of evolutionary relationships, useful for structural studies. An interactive site, Proçodic, at http://lcb.infotech.monash.edu.au/prosodic (click), provides access to and navigation of the entire dictionary of concepts and their usages, and all associated information. This report is part of a continuing programme with the goal of elucidating fundamental principles of protein architecture, in the spirit of the work of Cyrus Chothia. Frontiers Media S.A. 2021-04-30 /pmc/articles/PMC8120156/ /pubmed/33996891 http://dx.doi.org/10.3389/fmolb.2020.612920 Text en Copyright © 2021 Konagurthu, Subramanian, Allison, Abramson, Stuckey, Garcia de la Banda and Lesk. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Konagurthu, Arun S.
Subramanian, Ramanan
Allison, Lloyd
Abramson, David
Stuckey, Peter J.
Garcia de la Banda, Maria
Lesk, Arthur M.
Universal Architectural Concepts Underlying Protein Folding Patterns
title Universal Architectural Concepts Underlying Protein Folding Patterns
title_full Universal Architectural Concepts Underlying Protein Folding Patterns
title_fullStr Universal Architectural Concepts Underlying Protein Folding Patterns
title_full_unstemmed Universal Architectural Concepts Underlying Protein Folding Patterns
title_short Universal Architectural Concepts Underlying Protein Folding Patterns
title_sort universal architectural concepts underlying protein folding patterns
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8120156/
https://www.ncbi.nlm.nih.gov/pubmed/33996891
http://dx.doi.org/10.3389/fmolb.2020.612920
work_keys_str_mv AT konagurthuaruns universalarchitecturalconceptsunderlyingproteinfoldingpatterns
AT subramanianramanan universalarchitecturalconceptsunderlyingproteinfoldingpatterns
AT allisonlloyd universalarchitecturalconceptsunderlyingproteinfoldingpatterns
AT abramsondavid universalarchitecturalconceptsunderlyingproteinfoldingpatterns
AT stuckeypeterj universalarchitecturalconceptsunderlyingproteinfoldingpatterns
AT garciadelabandamaria universalarchitecturalconceptsunderlyingproteinfoldingpatterns
AT leskarthurm universalarchitecturalconceptsunderlyingproteinfoldingpatterns