Cargando…
Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling
[Image: see text] Machine learning methods provide a general framework for automatically finding and representing the essential characteristics of simulation data. This task is particularly crucial in enhanced sampling simulations. There we seek a few generalized degrees of freedom, referred to as c...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical
Society
2021
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8389995/ https://www.ncbi.nlm.nih.gov/pubmed/34213915 http://dx.doi.org/10.1021/acs.jpca.1c02869 |
_version_ | 1783742994826919936 |
---|---|
author | Rydzewski, Jakub Valsson, Omar |
author_facet | Rydzewski, Jakub Valsson, Omar |
author_sort | Rydzewski, Jakub |
collection | PubMed |
description | [Image: see text] Machine learning methods provide a general framework for automatically finding and representing the essential characteristics of simulation data. This task is particularly crucial in enhanced sampling simulations. There we seek a few generalized degrees of freedom, referred to as collective variables (CVs), to represent and drive the sampling of the free energy landscape. In theory, these CVs should separate different metastable states and correspond to the slow degrees of freedom of the studied physical process. To this aim, we propose a new method that we call multiscale reweighted stochastic embedding (MRSE). Our work builds upon a parametric version of stochastic neighbor embedding. The technique automatically learns CVs that map a high-dimensional feature space to a low-dimensional latent space via a deep neural network. We introduce several new advancements to stochastic neighbor embedding methods that make MRSE especially suitable for enhanced sampling simulations: (1) weight-tempered random sampling as a landmark selection scheme to obtain training data sets that strike a balance between equilibrium representation and capturing important metastable states lying higher in free energy; (2) a multiscale representation of the high-dimensional feature space via a Gaussian mixture probability model; and (3) a reweighting procedure to account for training data from a biased probability distribution. We show that MRSE constructs low-dimensional CVs that can correctly characterize the different metastable states in three model systems: the Müller-Brown potential, alanine dipeptide, and alanine tetrapeptide. |
format | Online Article Text |
id | pubmed-8389995 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | American Chemical
Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-83899952021-08-31 Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling Rydzewski, Jakub Valsson, Omar J Phys Chem A [Image: see text] Machine learning methods provide a general framework for automatically finding and representing the essential characteristics of simulation data. This task is particularly crucial in enhanced sampling simulations. There we seek a few generalized degrees of freedom, referred to as collective variables (CVs), to represent and drive the sampling of the free energy landscape. In theory, these CVs should separate different metastable states and correspond to the slow degrees of freedom of the studied physical process. To this aim, we propose a new method that we call multiscale reweighted stochastic embedding (MRSE). Our work builds upon a parametric version of stochastic neighbor embedding. The technique automatically learns CVs that map a high-dimensional feature space to a low-dimensional latent space via a deep neural network. We introduce several new advancements to stochastic neighbor embedding methods that make MRSE especially suitable for enhanced sampling simulations: (1) weight-tempered random sampling as a landmark selection scheme to obtain training data sets that strike a balance between equilibrium representation and capturing important metastable states lying higher in free energy; (2) a multiscale representation of the high-dimensional feature space via a Gaussian mixture probability model; and (3) a reweighting procedure to account for training data from a biased probability distribution. We show that MRSE constructs low-dimensional CVs that can correctly characterize the different metastable states in three model systems: the Müller-Brown potential, alanine dipeptide, and alanine tetrapeptide. American Chemical Society 2021-07-02 2021-07-22 /pmc/articles/PMC8389995/ /pubmed/34213915 http://dx.doi.org/10.1021/acs.jpca.1c02869 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Rydzewski, Jakub Valsson, Omar Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling |
title | Multiscale Reweighted Stochastic Embedding: Deep Learning
of Collective Variables for Enhanced Sampling |
title_full | Multiscale Reweighted Stochastic Embedding: Deep Learning
of Collective Variables for Enhanced Sampling |
title_fullStr | Multiscale Reweighted Stochastic Embedding: Deep Learning
of Collective Variables for Enhanced Sampling |
title_full_unstemmed | Multiscale Reweighted Stochastic Embedding: Deep Learning
of Collective Variables for Enhanced Sampling |
title_short | Multiscale Reweighted Stochastic Embedding: Deep Learning
of Collective Variables for Enhanced Sampling |
title_sort | multiscale reweighted stochastic embedding: deep learning
of collective variables for enhanced sampling |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8389995/ https://www.ncbi.nlm.nih.gov/pubmed/34213915 http://dx.doi.org/10.1021/acs.jpca.1c02869 |
work_keys_str_mv | AT rydzewskijakub multiscalereweightedstochasticembeddingdeeplearningofcollectivevariablesforenhancedsampling AT valssonomar multiscalereweightedstochasticembeddingdeeplearningofcollectivevariablesforenhancedsampling |