Cargando…

Bayesian inference of protein structure from chemical shift data

Protein chemical shifts are routinely used to augment molecular mechanics force fields in protein structure simulations, with weights of the chemical shift restraints determined empirically. These weights, however, might not be an optimal descriptor of a given protein structure and predictive model,...

Descripción completa

Detalles Bibliográficos
Autores principales: Bratholm, Lars A., Christensen, Anders S., Hamelryck, Thomas, Jensen, Jan H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4375973/
https://www.ncbi.nlm.nih.gov/pubmed/25825683
http://dx.doi.org/10.7717/peerj.861
_version_ 1782363658569908224
author Bratholm, Lars A.
Christensen, Anders S.
Hamelryck, Thomas
Jensen, Jan H.
author_facet Bratholm, Lars A.
Christensen, Anders S.
Hamelryck, Thomas
Jensen, Jan H.
author_sort Bratholm, Lars A.
collection PubMed
description Protein chemical shifts are routinely used to augment molecular mechanics force fields in protein structure simulations, with weights of the chemical shift restraints determined empirically. These weights, however, might not be an optimal descriptor of a given protein structure and predictive model, and a bias is introduced which might result in incorrect structures. In the inferential structure determination framework, both the unknown structure and the disagreement between experimental and back-calculated data are formulated as a joint probability distribution, thus utilizing the full information content of the data. Here, we present the formulation of such a probability distribution where the error in chemical shift prediction is described by either a Gaussian or Cauchy distribution. The methodology is demonstrated and compared to a set of empirically weighted potentials through Markov chain Monte Carlo simulations of three small proteins (ENHD, Protein G and the SMN Tudor Domain) using the PROFASI force field and the chemical shift predictor CamShift. Using a clustering-criterion for identifying the best structure, together with the addition of a solvent exposure scoring term, the simulations suggests that sampling both the structure and the uncertainties in chemical shift prediction leads more accurate structures compared to conventional methods using empirical determined weights. The Cauchy distribution, using either sampled uncertainties or predetermined weights, did, however, result in overall better convergence to the native fold, suggesting that both types of distribution might be useful in different aspects of the protein structure prediction.
format Online
Article
Text
id pubmed-4375973
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-43759732015-03-30 Bayesian inference of protein structure from chemical shift data Bratholm, Lars A. Christensen, Anders S. Hamelryck, Thomas Jensen, Jan H. PeerJ Biochemistry Protein chemical shifts are routinely used to augment molecular mechanics force fields in protein structure simulations, with weights of the chemical shift restraints determined empirically. These weights, however, might not be an optimal descriptor of a given protein structure and predictive model, and a bias is introduced which might result in incorrect structures. In the inferential structure determination framework, both the unknown structure and the disagreement between experimental and back-calculated data are formulated as a joint probability distribution, thus utilizing the full information content of the data. Here, we present the formulation of such a probability distribution where the error in chemical shift prediction is described by either a Gaussian or Cauchy distribution. The methodology is demonstrated and compared to a set of empirically weighted potentials through Markov chain Monte Carlo simulations of three small proteins (ENHD, Protein G and the SMN Tudor Domain) using the PROFASI force field and the chemical shift predictor CamShift. Using a clustering-criterion for identifying the best structure, together with the addition of a solvent exposure scoring term, the simulations suggests that sampling both the structure and the uncertainties in chemical shift prediction leads more accurate structures compared to conventional methods using empirical determined weights. The Cauchy distribution, using either sampled uncertainties or predetermined weights, did, however, result in overall better convergence to the native fold, suggesting that both types of distribution might be useful in different aspects of the protein structure prediction. PeerJ Inc. 2015-03-24 /pmc/articles/PMC4375973/ /pubmed/25825683 http://dx.doi.org/10.7717/peerj.861 Text en © 2015 Bratholm et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Biochemistry
Bratholm, Lars A.
Christensen, Anders S.
Hamelryck, Thomas
Jensen, Jan H.
Bayesian inference of protein structure from chemical shift data
title Bayesian inference of protein structure from chemical shift data
title_full Bayesian inference of protein structure from chemical shift data
title_fullStr Bayesian inference of protein structure from chemical shift data
title_full_unstemmed Bayesian inference of protein structure from chemical shift data
title_short Bayesian inference of protein structure from chemical shift data
title_sort bayesian inference of protein structure from chemical shift data
topic Biochemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4375973/
https://www.ncbi.nlm.nih.gov/pubmed/25825683
http://dx.doi.org/10.7717/peerj.861
work_keys_str_mv AT bratholmlarsa bayesianinferenceofproteinstructurefromchemicalshiftdata
AT christensenanderss bayesianinferenceofproteinstructurefromchemicalshiftdata
AT hamelryckthomas bayesianinferenceofproteinstructurefromchemicalshiftdata
AT jensenjanh bayesianinferenceofproteinstructurefromchemicalshiftdata