Cargando…

Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling

[Image: see text] Machine learning methods provide a general framework for automatically finding and representing the essential characteristics of simulation data. This task is particularly crucial in enhanced sampling simulations. There we seek a few generalized degrees of freedom, referred to as c...

Descripción completa

Detalles Bibliográficos
Autores principales: Rydzewski, Jakub, Valsson, Omar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8389995/
https://www.ncbi.nlm.nih.gov/pubmed/34213915
http://dx.doi.org/10.1021/acs.jpca.1c02869
_version_ 1783742994826919936
author Rydzewski, Jakub
Valsson, Omar
author_facet Rydzewski, Jakub
Valsson, Omar
author_sort Rydzewski, Jakub
collection PubMed
description [Image: see text] Machine learning methods provide a general framework for automatically finding and representing the essential characteristics of simulation data. This task is particularly crucial in enhanced sampling simulations. There we seek a few generalized degrees of freedom, referred to as collective variables (CVs), to represent and drive the sampling of the free energy landscape. In theory, these CVs should separate different metastable states and correspond to the slow degrees of freedom of the studied physical process. To this aim, we propose a new method that we call multiscale reweighted stochastic embedding (MRSE). Our work builds upon a parametric version of stochastic neighbor embedding. The technique automatically learns CVs that map a high-dimensional feature space to a low-dimensional latent space via a deep neural network. We introduce several new advancements to stochastic neighbor embedding methods that make MRSE especially suitable for enhanced sampling simulations: (1) weight-tempered random sampling as a landmark selection scheme to obtain training data sets that strike a balance between equilibrium representation and capturing important metastable states lying higher in free energy; (2) a multiscale representation of the high-dimensional feature space via a Gaussian mixture probability model; and (3) a reweighting procedure to account for training data from a biased probability distribution. We show that MRSE constructs low-dimensional CVs that can correctly characterize the different metastable states in three model systems: the Müller-Brown potential, alanine dipeptide, and alanine tetrapeptide.
format Online
Article
Text
id pubmed-8389995
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-83899952021-08-31 Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling Rydzewski, Jakub Valsson, Omar J Phys Chem A [Image: see text] Machine learning methods provide a general framework for automatically finding and representing the essential characteristics of simulation data. This task is particularly crucial in enhanced sampling simulations. There we seek a few generalized degrees of freedom, referred to as collective variables (CVs), to represent and drive the sampling of the free energy landscape. In theory, these CVs should separate different metastable states and correspond to the slow degrees of freedom of the studied physical process. To this aim, we propose a new method that we call multiscale reweighted stochastic embedding (MRSE). Our work builds upon a parametric version of stochastic neighbor embedding. The technique automatically learns CVs that map a high-dimensional feature space to a low-dimensional latent space via a deep neural network. We introduce several new advancements to stochastic neighbor embedding methods that make MRSE especially suitable for enhanced sampling simulations: (1) weight-tempered random sampling as a landmark selection scheme to obtain training data sets that strike a balance between equilibrium representation and capturing important metastable states lying higher in free energy; (2) a multiscale representation of the high-dimensional feature space via a Gaussian mixture probability model; and (3) a reweighting procedure to account for training data from a biased probability distribution. We show that MRSE constructs low-dimensional CVs that can correctly characterize the different metastable states in three model systems: the Müller-Brown potential, alanine dipeptide, and alanine tetrapeptide. American Chemical Society 2021-07-02 2021-07-22 /pmc/articles/PMC8389995/ /pubmed/34213915 http://dx.doi.org/10.1021/acs.jpca.1c02869 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Rydzewski, Jakub
Valsson, Omar
Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling
title Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling
title_full Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling
title_fullStr Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling
title_full_unstemmed Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling
title_short Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling
title_sort multiscale reweighted stochastic embedding: deep learning of collective variables for enhanced sampling
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8389995/
https://www.ncbi.nlm.nih.gov/pubmed/34213915
http://dx.doi.org/10.1021/acs.jpca.1c02869
work_keys_str_mv AT rydzewskijakub multiscalereweightedstochasticembeddingdeeplearningofcollectivevariablesforenhancedsampling
AT valssonomar multiscalereweightedstochasticembeddingdeeplearningofcollectivevariablesforenhancedsampling