Cargando…
Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers
[Image: see text] Recently, predicting the native structures of proteins has become possible using computational molecular physics (CMP)—physics-based force fields sampled with proper statistics—but only for small proteins. Algorithms with better scaling are needed. We describe ML x MELD x MD, a mol...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9281603/ https://www.ncbi.nlm.nih.gov/pubmed/35133832 http://dx.doi.org/10.1021/acs.jctc.1c00916 |
_version_ | 1784746917140365312 |
---|---|
author | Nassar, Roy Brini, Emiliano Parui, Sridip Liu, Cong Dignon, Gregory L. Dill, Ken A. |
author_facet | Nassar, Roy Brini, Emiliano Parui, Sridip Liu, Cong Dignon, Gregory L. Dill, Ken A. |
author_sort | Nassar, Roy |
collection | PubMed |
description | [Image: see text] Recently, predicting the native structures of proteins has become possible using computational molecular physics (CMP)—physics-based force fields sampled with proper statistics—but only for small proteins. Algorithms with better scaling are needed. We describe ML x MELD x MD, a molecular dynamics (MD) method that inputs residue contacts derived from machine learning (ML) servers into MELD, a Bayesian accelerator that preserves detailed-balance statistics. Contacts are derived from trRosetta-predicted distance histograms (distograms) and are integrated into MELD’s atomistic MD as spatial restraints through parametrized potential functions. In the CASP14 blind prediction event, ML x MELD x MD predicted 13 native structures to better than 4.5 Å error, including for 10 proteins in the range of 115–250 amino acids long. Also, the scaling of simulation time vs protein length is much better than unguided MD: t(sim) ∼ e(0.023N) for ML x MELD x MD vs t(sim) ∼ e(0.168N) for MD alone. This shows how machine learning information can be leveraged to advance physics-based modeling of proteins. |
format | Online Article Text |
id | pubmed-9281603 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-92816032023-02-08 Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers Nassar, Roy Brini, Emiliano Parui, Sridip Liu, Cong Dignon, Gregory L. Dill, Ken A. J Chem Theory Comput [Image: see text] Recently, predicting the native structures of proteins has become possible using computational molecular physics (CMP)—physics-based force fields sampled with proper statistics—but only for small proteins. Algorithms with better scaling are needed. We describe ML x MELD x MD, a molecular dynamics (MD) method that inputs residue contacts derived from machine learning (ML) servers into MELD, a Bayesian accelerator that preserves detailed-balance statistics. Contacts are derived from trRosetta-predicted distance histograms (distograms) and are integrated into MELD’s atomistic MD as spatial restraints through parametrized potential functions. In the CASP14 blind prediction event, ML x MELD x MD predicted 13 native structures to better than 4.5 Å error, including for 10 proteins in the range of 115–250 amino acids long. Also, the scaling of simulation time vs protein length is much better than unguided MD: t(sim) ∼ e(0.023N) for ML x MELD x MD vs t(sim) ∼ e(0.168N) for MD alone. This shows how machine learning information can be leveraged to advance physics-based modeling of proteins. American Chemical Society 2022-02-08 2022-03-08 /pmc/articles/PMC9281603/ /pubmed/35133832 http://dx.doi.org/10.1021/acs.jctc.1c00916 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Nassar, Roy Brini, Emiliano Parui, Sridip Liu, Cong Dignon, Gregory L. Dill, Ken A. Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers |
title | Accelerating Protein Folding Molecular Dynamics Using
Inter-Residue Distances from Machine Learning Servers |
title_full | Accelerating Protein Folding Molecular Dynamics Using
Inter-Residue Distances from Machine Learning Servers |
title_fullStr | Accelerating Protein Folding Molecular Dynamics Using
Inter-Residue Distances from Machine Learning Servers |
title_full_unstemmed | Accelerating Protein Folding Molecular Dynamics Using
Inter-Residue Distances from Machine Learning Servers |
title_short | Accelerating Protein Folding Molecular Dynamics Using
Inter-Residue Distances from Machine Learning Servers |
title_sort | accelerating protein folding molecular dynamics using
inter-residue distances from machine learning servers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9281603/ https://www.ncbi.nlm.nih.gov/pubmed/35133832 http://dx.doi.org/10.1021/acs.jctc.1c00916 |
work_keys_str_mv | AT nassarroy acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers AT briniemiliano acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers AT paruisridip acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers AT liucong acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers AT dignongregoryl acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers AT dillkena acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers |