Cargando…
Bayesian inference of protein conformational ensembles from limited structural data
Many proteins consist of folded domains connected by regions with higher flexibility. The details of the resulting conformational ensemble play a central role in controlling interactions between domains and with binding partners. Small-Angle Scattering (SAS) is well-suited to study the conformationa...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6312354/ https://www.ncbi.nlm.nih.gov/pubmed/30557358 http://dx.doi.org/10.1371/journal.pcbi.1006641 |
_version_ | 1783383768900304896 |
---|---|
author | Potrzebowski, Wojciech Trewhella, Jill Andre, Ingemar |
author_facet | Potrzebowski, Wojciech Trewhella, Jill Andre, Ingemar |
author_sort | Potrzebowski, Wojciech |
collection | PubMed |
description | Many proteins consist of folded domains connected by regions with higher flexibility. The details of the resulting conformational ensemble play a central role in controlling interactions between domains and with binding partners. Small-Angle Scattering (SAS) is well-suited to study the conformational states adopted by proteins in solution. However, analysis is complicated by the limited information content in SAS data and care must be taken to avoid constructing overly complex ensemble models and fitting to noise in the experimental data. To address these challenges, we developed a method based on Bayesian statistics that infers conformational ensembles from a structural library generated by all-atom Monte Carlo simulations. The first stage of the method involves a fast model selection based on variational Bayesian inference that maximizes the model evidence of the selected ensemble. This is followed by a complete Bayesian inference of population weights in the selected ensemble. Experiments with simulated ensembles demonstrate that model evidence is capable of identifying the correct ensemble and that correct number of ensemble members can be recovered up to high level of noise. Using experimental data, we demonstrate how the method can be extended to include data from Nuclear Magnetic Resonance (NMR) and structural energies of conformers extracted from the all-atom energy functions. We show that the data from SAXS, NMR chemical shifts and energies calculated from conformers can work synergistically to improve the definition of the conformational ensemble. |
format | Online Article Text |
id | pubmed-6312354 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-63123542019-01-08 Bayesian inference of protein conformational ensembles from limited structural data Potrzebowski, Wojciech Trewhella, Jill Andre, Ingemar PLoS Comput Biol Research Article Many proteins consist of folded domains connected by regions with higher flexibility. The details of the resulting conformational ensemble play a central role in controlling interactions between domains and with binding partners. Small-Angle Scattering (SAS) is well-suited to study the conformational states adopted by proteins in solution. However, analysis is complicated by the limited information content in SAS data and care must be taken to avoid constructing overly complex ensemble models and fitting to noise in the experimental data. To address these challenges, we developed a method based on Bayesian statistics that infers conformational ensembles from a structural library generated by all-atom Monte Carlo simulations. The first stage of the method involves a fast model selection based on variational Bayesian inference that maximizes the model evidence of the selected ensemble. This is followed by a complete Bayesian inference of population weights in the selected ensemble. Experiments with simulated ensembles demonstrate that model evidence is capable of identifying the correct ensemble and that correct number of ensemble members can be recovered up to high level of noise. Using experimental data, we demonstrate how the method can be extended to include data from Nuclear Magnetic Resonance (NMR) and structural energies of conformers extracted from the all-atom energy functions. We show that the data from SAXS, NMR chemical shifts and energies calculated from conformers can work synergistically to improve the definition of the conformational ensemble. Public Library of Science 2018-12-17 /pmc/articles/PMC6312354/ /pubmed/30557358 http://dx.doi.org/10.1371/journal.pcbi.1006641 Text en © 2018 Potrzebowski et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Potrzebowski, Wojciech Trewhella, Jill Andre, Ingemar Bayesian inference of protein conformational ensembles from limited structural data |
title | Bayesian inference of protein conformational ensembles from limited structural data |
title_full | Bayesian inference of protein conformational ensembles from limited structural data |
title_fullStr | Bayesian inference of protein conformational ensembles from limited structural data |
title_full_unstemmed | Bayesian inference of protein conformational ensembles from limited structural data |
title_short | Bayesian inference of protein conformational ensembles from limited structural data |
title_sort | bayesian inference of protein conformational ensembles from limited structural data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6312354/ https://www.ncbi.nlm.nih.gov/pubmed/30557358 http://dx.doi.org/10.1371/journal.pcbi.1006641 |
work_keys_str_mv | AT potrzebowskiwojciech bayesianinferenceofproteinconformationalensemblesfromlimitedstructuraldata AT trewhellajill bayesianinferenceofproteinconformationalensemblesfromlimitedstructuraldata AT andreingemar bayesianinferenceofproteinconformationalensemblesfromlimitedstructuraldata |