Cargando…

Bayesian inference of protein conformational ensembles from limited structural data

Many proteins consist of folded domains connected by regions with higher flexibility. The details of the resulting conformational ensemble play a central role in controlling interactions between domains and with binding partners. Small-Angle Scattering (SAS) is well-suited to study the conformationa...

Descripción completa

Detalles Bibliográficos
Autores principales: Potrzebowski, Wojciech, Trewhella, Jill, Andre, Ingemar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6312354/
https://www.ncbi.nlm.nih.gov/pubmed/30557358
http://dx.doi.org/10.1371/journal.pcbi.1006641
_version_ 1783383768900304896
author Potrzebowski, Wojciech
Trewhella, Jill
Andre, Ingemar
author_facet Potrzebowski, Wojciech
Trewhella, Jill
Andre, Ingemar
author_sort Potrzebowski, Wojciech
collection PubMed
description Many proteins consist of folded domains connected by regions with higher flexibility. The details of the resulting conformational ensemble play a central role in controlling interactions between domains and with binding partners. Small-Angle Scattering (SAS) is well-suited to study the conformational states adopted by proteins in solution. However, analysis is complicated by the limited information content in SAS data and care must be taken to avoid constructing overly complex ensemble models and fitting to noise in the experimental data. To address these challenges, we developed a method based on Bayesian statistics that infers conformational ensembles from a structural library generated by all-atom Monte Carlo simulations. The first stage of the method involves a fast model selection based on variational Bayesian inference that maximizes the model evidence of the selected ensemble. This is followed by a complete Bayesian inference of population weights in the selected ensemble. Experiments with simulated ensembles demonstrate that model evidence is capable of identifying the correct ensemble and that correct number of ensemble members can be recovered up to high level of noise. Using experimental data, we demonstrate how the method can be extended to include data from Nuclear Magnetic Resonance (NMR) and structural energies of conformers extracted from the all-atom energy functions. We show that the data from SAXS, NMR chemical shifts and energies calculated from conformers can work synergistically to improve the definition of the conformational ensemble.
format Online
Article
Text
id pubmed-6312354
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63123542019-01-08 Bayesian inference of protein conformational ensembles from limited structural data Potrzebowski, Wojciech Trewhella, Jill Andre, Ingemar PLoS Comput Biol Research Article Many proteins consist of folded domains connected by regions with higher flexibility. The details of the resulting conformational ensemble play a central role in controlling interactions between domains and with binding partners. Small-Angle Scattering (SAS) is well-suited to study the conformational states adopted by proteins in solution. However, analysis is complicated by the limited information content in SAS data and care must be taken to avoid constructing overly complex ensemble models and fitting to noise in the experimental data. To address these challenges, we developed a method based on Bayesian statistics that infers conformational ensembles from a structural library generated by all-atom Monte Carlo simulations. The first stage of the method involves a fast model selection based on variational Bayesian inference that maximizes the model evidence of the selected ensemble. This is followed by a complete Bayesian inference of population weights in the selected ensemble. Experiments with simulated ensembles demonstrate that model evidence is capable of identifying the correct ensemble and that correct number of ensemble members can be recovered up to high level of noise. Using experimental data, we demonstrate how the method can be extended to include data from Nuclear Magnetic Resonance (NMR) and structural energies of conformers extracted from the all-atom energy functions. We show that the data from SAXS, NMR chemical shifts and energies calculated from conformers can work synergistically to improve the definition of the conformational ensemble. Public Library of Science 2018-12-17 /pmc/articles/PMC6312354/ /pubmed/30557358 http://dx.doi.org/10.1371/journal.pcbi.1006641 Text en © 2018 Potrzebowski et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Potrzebowski, Wojciech
Trewhella, Jill
Andre, Ingemar
Bayesian inference of protein conformational ensembles from limited structural data
title Bayesian inference of protein conformational ensembles from limited structural data
title_full Bayesian inference of protein conformational ensembles from limited structural data
title_fullStr Bayesian inference of protein conformational ensembles from limited structural data
title_full_unstemmed Bayesian inference of protein conformational ensembles from limited structural data
title_short Bayesian inference of protein conformational ensembles from limited structural data
title_sort bayesian inference of protein conformational ensembles from limited structural data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6312354/
https://www.ncbi.nlm.nih.gov/pubmed/30557358
http://dx.doi.org/10.1371/journal.pcbi.1006641
work_keys_str_mv AT potrzebowskiwojciech bayesianinferenceofproteinconformationalensemblesfromlimitedstructuraldata
AT trewhellajill bayesianinferenceofproteinconformationalensemblesfromlimitedstructuraldata
AT andreingemar bayesianinferenceofproteinconformationalensemblesfromlimitedstructuraldata