Cargando…

Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers

[Image: see text] Recently, predicting the native structures of proteins has become possible using computational molecular physics (CMP)—physics-based force fields sampled with proper statistics—but only for small proteins. Algorithms with better scaling are needed. We describe ML x MELD x MD, a mol...

Descripción completa

Detalles Bibliográficos
Autores principales: Nassar, Roy, Brini, Emiliano, Parui, Sridip, Liu, Cong, Dignon, Gregory L., Dill, Ken A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9281603/
https://www.ncbi.nlm.nih.gov/pubmed/35133832
http://dx.doi.org/10.1021/acs.jctc.1c00916
_version_ 1784746917140365312
author Nassar, Roy
Brini, Emiliano
Parui, Sridip
Liu, Cong
Dignon, Gregory L.
Dill, Ken A.
author_facet Nassar, Roy
Brini, Emiliano
Parui, Sridip
Liu, Cong
Dignon, Gregory L.
Dill, Ken A.
author_sort Nassar, Roy
collection PubMed
description [Image: see text] Recently, predicting the native structures of proteins has become possible using computational molecular physics (CMP)—physics-based force fields sampled with proper statistics—but only for small proteins. Algorithms with better scaling are needed. We describe ML x MELD x MD, a molecular dynamics (MD) method that inputs residue contacts derived from machine learning (ML) servers into MELD, a Bayesian accelerator that preserves detailed-balance statistics. Contacts are derived from trRosetta-predicted distance histograms (distograms) and are integrated into MELD’s atomistic MD as spatial restraints through parametrized potential functions. In the CASP14 blind prediction event, ML x MELD x MD predicted 13 native structures to better than 4.5 Å error, including for 10 proteins in the range of 115–250 amino acids long. Also, the scaling of simulation time vs protein length is much better than unguided MD: t(sim) ∼ e(0.023N) for ML x MELD x MD vs t(sim) ∼ e(0.168N) for MD alone. This shows how machine learning information can be leveraged to advance physics-based modeling of proteins.
format Online
Article
Text
id pubmed-9281603
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-92816032023-02-08 Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers Nassar, Roy Brini, Emiliano Parui, Sridip Liu, Cong Dignon, Gregory L. Dill, Ken A. J Chem Theory Comput [Image: see text] Recently, predicting the native structures of proteins has become possible using computational molecular physics (CMP)—physics-based force fields sampled with proper statistics—but only for small proteins. Algorithms with better scaling are needed. We describe ML x MELD x MD, a molecular dynamics (MD) method that inputs residue contacts derived from machine learning (ML) servers into MELD, a Bayesian accelerator that preserves detailed-balance statistics. Contacts are derived from trRosetta-predicted distance histograms (distograms) and are integrated into MELD’s atomistic MD as spatial restraints through parametrized potential functions. In the CASP14 blind prediction event, ML x MELD x MD predicted 13 native structures to better than 4.5 Å error, including for 10 proteins in the range of 115–250 amino acids long. Also, the scaling of simulation time vs protein length is much better than unguided MD: t(sim) ∼ e(0.023N) for ML x MELD x MD vs t(sim) ∼ e(0.168N) for MD alone. This shows how machine learning information can be leveraged to advance physics-based modeling of proteins. American Chemical Society 2022-02-08 2022-03-08 /pmc/articles/PMC9281603/ /pubmed/35133832 http://dx.doi.org/10.1021/acs.jctc.1c00916 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Nassar, Roy
Brini, Emiliano
Parui, Sridip
Liu, Cong
Dignon, Gregory L.
Dill, Ken A.
Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers
title Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers
title_full Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers
title_fullStr Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers
title_full_unstemmed Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers
title_short Accelerating Protein Folding Molecular Dynamics Using Inter-Residue Distances from Machine Learning Servers
title_sort accelerating protein folding molecular dynamics using inter-residue distances from machine learning servers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9281603/
https://www.ncbi.nlm.nih.gov/pubmed/35133832
http://dx.doi.org/10.1021/acs.jctc.1c00916
work_keys_str_mv AT nassarroy acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers
AT briniemiliano acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers
AT paruisridip acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers
AT liucong acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers
AT dignongregoryl acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers
AT dillkena acceleratingproteinfoldingmoleculardynamicsusinginterresiduedistancesfrommachinelearningservers