Cargando…

Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs

BACKGROUND: Mapping protein primary sequences to their three dimensional folds referred to as the 'second genetic code' remains an unsolved scientific problem. A crucial part of the problem concerns the geometrical specificity in side chain association leading to densely packed protein cor...

Descripción completa

Detalles Bibliográficos
Autores principales: Basu, Sankar, Bhattacharyya, Dhananjay, Banerjee, Rahul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3123238/
https://www.ncbi.nlm.nih.gov/pubmed/21605466
http://dx.doi.org/10.1186/1471-2105-12-195
_version_ 1782206951351910400
author Basu, Sankar
Bhattacharyya, Dhananjay
Banerjee, Rahul
author_facet Basu, Sankar
Bhattacharyya, Dhananjay
Banerjee, Rahul
author_sort Basu, Sankar
collection PubMed
description BACKGROUND: Mapping protein primary sequences to their three dimensional folds referred to as the 'second genetic code' remains an unsolved scientific problem. A crucial part of the problem concerns the geometrical specificity in side chain association leading to densely packed protein cores, a hallmark of correctly folded native structures. Thus, any model of packing within proteins should constitute an indispensable component of protein folding and design. RESULTS: In this study an attempt has been made to find, characterize and classify recurring patterns in the packing of side chain atoms within a protein which sustains its native fold. The interaction of side chain atoms within the protein core has been represented as a contact network based on the surface complementarity and overlap between associating side chain surfaces. Some network topologies definitely appear to be preferred and they have been termed 'packing motifs', analogous to super secondary structures in proteins. Study of the distribution of these motifs reveals the ubiquitous presence of typical smaller graphs, which appear to get linked or coalesce to give larger graphs, reminiscent of the nucleation-condensation model in protein folding. One such frequently occurring motif, also envisaged as the unit of clustering, the three residue clique was invariably found in regions of dense packing. Finally, topological measures based on surface contact networks appeared to be effective in discriminating sequences native to a specific fold amongst a set of decoys. CONCLUSIONS: Out of innumerable topological possibilities, only a finite number of specific packing motifs are actually realized in proteins. This small number of motifs could serve as a basis set in the construction of larger networks. Of these, the triplet clique exhibits distinct preference both in terms of composition and geometry.
format Online
Article
Text
id pubmed-3123238
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31232382011-06-25 Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs Basu, Sankar Bhattacharyya, Dhananjay Banerjee, Rahul BMC Bioinformatics Research Article BACKGROUND: Mapping protein primary sequences to their three dimensional folds referred to as the 'second genetic code' remains an unsolved scientific problem. A crucial part of the problem concerns the geometrical specificity in side chain association leading to densely packed protein cores, a hallmark of correctly folded native structures. Thus, any model of packing within proteins should constitute an indispensable component of protein folding and design. RESULTS: In this study an attempt has been made to find, characterize and classify recurring patterns in the packing of side chain atoms within a protein which sustains its native fold. The interaction of side chain atoms within the protein core has been represented as a contact network based on the surface complementarity and overlap between associating side chain surfaces. Some network topologies definitely appear to be preferred and they have been termed 'packing motifs', analogous to super secondary structures in proteins. Study of the distribution of these motifs reveals the ubiquitous presence of typical smaller graphs, which appear to get linked or coalesce to give larger graphs, reminiscent of the nucleation-condensation model in protein folding. One such frequently occurring motif, also envisaged as the unit of clustering, the three residue clique was invariably found in regions of dense packing. Finally, topological measures based on surface contact networks appeared to be effective in discriminating sequences native to a specific fold amongst a set of decoys. CONCLUSIONS: Out of innumerable topological possibilities, only a finite number of specific packing motifs are actually realized in proteins. This small number of motifs could serve as a basis set in the construction of larger networks. Of these, the triplet clique exhibits distinct preference both in terms of composition and geometry. BioMed Central 2011-05-24 /pmc/articles/PMC3123238/ /pubmed/21605466 http://dx.doi.org/10.1186/1471-2105-12-195 Text en Copyright ©2011 Basu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Basu, Sankar
Bhattacharyya, Dhananjay
Banerjee, Rahul
Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs
title Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs
title_full Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs
title_fullStr Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs
title_full_unstemmed Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs
title_short Mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs
title_sort mapping the distribution of packing topologies within protein interiors shows predominant preference for specific packing motifs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3123238/
https://www.ncbi.nlm.nih.gov/pubmed/21605466
http://dx.doi.org/10.1186/1471-2105-12-195
work_keys_str_mv AT basusankar mappingthedistributionofpackingtopologieswithinproteininteriorsshowspredominantpreferenceforspecificpackingmotifs
AT bhattacharyyadhananjay mappingthedistributionofpackingtopologieswithinproteininteriorsshowspredominantpreferenceforspecificpackingmotifs
AT banerjeerahul mappingthedistributionofpackingtopologieswithinproteininteriorsshowspredominantpreferenceforspecificpackingmotifs