Cargando…

Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours

An ongoing challenge in protein chemistry is to identify the underlying interaction energies that capture protein dynamics. The traditional trade-off in biomolecular simulation between accuracy and computational efficiency is predicated on the assumption that detailed force fields are typically well...

Descripción completa

Detalles Bibliográficos
Autores principales: Jumper, John M., Faruk, Nabil F., Freed, Karl F., Sosnick, Tobin R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6307714/
https://www.ncbi.nlm.nih.gov/pubmed/30589834
http://dx.doi.org/10.1371/journal.pcbi.1006578
_version_ 1783383051175198720
author Jumper, John M.
Faruk, Nabil F.
Freed, Karl F.
Sosnick, Tobin R.
author_facet Jumper, John M.
Faruk, Nabil F.
Freed, Karl F.
Sosnick, Tobin R.
author_sort Jumper, John M.
collection PubMed
description An ongoing challenge in protein chemistry is to identify the underlying interaction energies that capture protein dynamics. The traditional trade-off in biomolecular simulation between accuracy and computational efficiency is predicated on the assumption that detailed force fields are typically well-parameterized, obtaining a significant fraction of possible accuracy. We re-examine this trade-off in the more realistic regime in which parameterization is a greater source of error than the level of detail in the force field. To address parameterization of coarse-grained force fields, we use the contrastive divergence technique from machine learning to train from simulations of 450 proteins. In our procedure, the computational efficiency of the model enables high accuracy through the precise tuning of the Boltzmann ensemble. This method is applied to our recently developed Upside model, where the free energy for side chains is rapidly calculated at every time-step, allowing for a smooth energy landscape without steric rattling of the side chains. After this contrastive divergence training, the model is able to de novo fold proteins up to 100 residues on a single core in days. This improved Upside model provides a starting point both for investigation of folding dynamics and as an inexpensive Bayesian prior for protein physics that can be integrated with additional experimental or bioinformatic data.
format Online
Article
Text
id pubmed-6307714
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63077142019-01-08 Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours Jumper, John M. Faruk, Nabil F. Freed, Karl F. Sosnick, Tobin R. PLoS Comput Biol Research Article An ongoing challenge in protein chemistry is to identify the underlying interaction energies that capture protein dynamics. The traditional trade-off in biomolecular simulation between accuracy and computational efficiency is predicated on the assumption that detailed force fields are typically well-parameterized, obtaining a significant fraction of possible accuracy. We re-examine this trade-off in the more realistic regime in which parameterization is a greater source of error than the level of detail in the force field. To address parameterization of coarse-grained force fields, we use the contrastive divergence technique from machine learning to train from simulations of 450 proteins. In our procedure, the computational efficiency of the model enables high accuracy through the precise tuning of the Boltzmann ensemble. This method is applied to our recently developed Upside model, where the free energy for side chains is rapidly calculated at every time-step, allowing for a smooth energy landscape without steric rattling of the side chains. After this contrastive divergence training, the model is able to de novo fold proteins up to 100 residues on a single core in days. This improved Upside model provides a starting point both for investigation of folding dynamics and as an inexpensive Bayesian prior for protein physics that can be integrated with additional experimental or bioinformatic data. Public Library of Science 2018-12-27 /pmc/articles/PMC6307714/ /pubmed/30589834 http://dx.doi.org/10.1371/journal.pcbi.1006578 Text en © 2018 Jumper et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Jumper, John M.
Faruk, Nabil F.
Freed, Karl F.
Sosnick, Tobin R.
Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours
title Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours
title_full Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours
title_fullStr Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours
title_full_unstemmed Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours
title_short Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours
title_sort trajectory-based training enables protein simulations with accurate folding and boltzmann ensembles in cpu-hours
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6307714/
https://www.ncbi.nlm.nih.gov/pubmed/30589834
http://dx.doi.org/10.1371/journal.pcbi.1006578
work_keys_str_mv AT jumperjohnm trajectorybasedtrainingenablesproteinsimulationswithaccuratefoldingandboltzmannensemblesincpuhours
AT faruknabilf trajectorybasedtrainingenablesproteinsimulationswithaccuratefoldingandboltzmannensemblesincpuhours
AT freedkarlf trajectorybasedtrainingenablesproteinsimulationswithaccuratefoldingandboltzmannensemblesincpuhours
AT sosnicktobinr trajectorybasedtrainingenablesproteinsimulationswithaccuratefoldingandboltzmannensemblesincpuhours