Cargando…

Clustering and percolation in protein loop structures

BACKGROUND: High precision protein loop modelling remains a challenge, both in template based and template independent approaches to protein structure prediction. METHOD: We introduce the concepts of protein loop clustering and percolation, to develop a quantitative approach to systematically classi...

Descripción completa

Detalles Bibliográficos
Autores principales: Peng, Xubiao, He, Jianfeng, Niemi, Antti J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4625449/
https://www.ncbi.nlm.nih.gov/pubmed/26510704
http://dx.doi.org/10.1186/s12900-015-0049-x
_version_ 1782397977173688320
author Peng, Xubiao
He, Jianfeng
Niemi, Antti J.
author_facet Peng, Xubiao
He, Jianfeng
Niemi, Antti J.
author_sort Peng, Xubiao
collection PubMed
description BACKGROUND: High precision protein loop modelling remains a challenge, both in template based and template independent approaches to protein structure prediction. METHOD: We introduce the concepts of protein loop clustering and percolation, to develop a quantitative approach to systematically classify the modular building blocks of loops in crystallographic folded proteins. These fragments are all different parameterisations of a unique kink solution to a generalised discrete nonlinear Schrödinger (DNLS) equation. Accordingly, the fragments are also local energy minima of the ensuing energy function. RESULTS: We show how the loop fragments cover practically all ultrahigh resolution crystallographic protein structures in Protein Data Bank (PDB), with a 0.2 Ångström root-mean-square (RMS) precision. We find that no more than 12 different loop fragments are needed, to describe around 38 % of ultrahigh resolution loops in PDB. But there is also a large number of loop fragments that are either unique, or very rare, and examples of unique fragments are found even in the structure of a myoglobin. CONCLUSIONS: Protein loops are built in a modular fashion. The loops are composed of fragments that can be modelled by the kink of the DNLS equation. The majority of loop fragments are also common, which are shared by many proteins. These common fragments are probably important for supporting the overall protein conformation. But there are also several fragments that are either unique to a given protein, or very rare. Such fragments are probably related to the function of the protein. Furthermore, we have found that the amino acid sequence does not determine the structure in a unique fashion. There are many examples of loop fragments with an identical amino acid sequence, but with a very different structure. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12900-015-0049-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4625449
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46254492015-10-30 Clustering and percolation in protein loop structures Peng, Xubiao He, Jianfeng Niemi, Antti J. BMC Struct Biol Research Article BACKGROUND: High precision protein loop modelling remains a challenge, both in template based and template independent approaches to protein structure prediction. METHOD: We introduce the concepts of protein loop clustering and percolation, to develop a quantitative approach to systematically classify the modular building blocks of loops in crystallographic folded proteins. These fragments are all different parameterisations of a unique kink solution to a generalised discrete nonlinear Schrödinger (DNLS) equation. Accordingly, the fragments are also local energy minima of the ensuing energy function. RESULTS: We show how the loop fragments cover practically all ultrahigh resolution crystallographic protein structures in Protein Data Bank (PDB), with a 0.2 Ångström root-mean-square (RMS) precision. We find that no more than 12 different loop fragments are needed, to describe around 38 % of ultrahigh resolution loops in PDB. But there is also a large number of loop fragments that are either unique, or very rare, and examples of unique fragments are found even in the structure of a myoglobin. CONCLUSIONS: Protein loops are built in a modular fashion. The loops are composed of fragments that can be modelled by the kink of the DNLS equation. The majority of loop fragments are also common, which are shared by many proteins. These common fragments are probably important for supporting the overall protein conformation. But there are also several fragments that are either unique to a given protein, or very rare. Such fragments are probably related to the function of the protein. Furthermore, we have found that the amino acid sequence does not determine the structure in a unique fashion. There are many examples of loop fragments with an identical amino acid sequence, but with a very different structure. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12900-015-0049-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-10-29 /pmc/articles/PMC4625449/ /pubmed/26510704 http://dx.doi.org/10.1186/s12900-015-0049-x Text en © Peng et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Peng, Xubiao
He, Jianfeng
Niemi, Antti J.
Clustering and percolation in protein loop structures
title Clustering and percolation in protein loop structures
title_full Clustering and percolation in protein loop structures
title_fullStr Clustering and percolation in protein loop structures
title_full_unstemmed Clustering and percolation in protein loop structures
title_short Clustering and percolation in protein loop structures
title_sort clustering and percolation in protein loop structures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4625449/
https://www.ncbi.nlm.nih.gov/pubmed/26510704
http://dx.doi.org/10.1186/s12900-015-0049-x
work_keys_str_mv AT pengxubiao clusteringandpercolationinproteinloopstructures
AT hejianfeng clusteringandpercolationinproteinloopstructures
AT niemianttij clusteringandpercolationinproteinloopstructures