Cargando…

Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks

[Image: see text] With the continual improvement of computing hardware and algorithms, simulations have become a powerful tool for understanding all sorts of (bio)molecular processes. To handle the large simulation data sets and to accelerate slow, activated transitions, a condensed set of descripto...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hooft, Ferry, Pérez de Alba Ortíz, Alberto, Ensing, Bernd
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Chemical Society 2021
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8047796/ https://www.ncbi.nlm.nih.gov/pubmed/33662202 http://dx.doi.org/10.1021/acs.jctc.0c00981

_version_	1783679112455389184
author	Hooft, Ferry Pérez de Alba Ortíz, Alberto Ensing, Bernd
author_facet	Hooft, Ferry Pérez de Alba Ortíz, Alberto Ensing, Bernd
author_sort	Hooft, Ferry
collection	PubMed
description	[Image: see text] With the continual improvement of computing hardware and algorithms, simulations have become a powerful tool for understanding all sorts of (bio)molecular processes. To handle the large simulation data sets and to accelerate slow, activated transitions, a condensed set of descriptors, or collective variables (CVs), is needed to discern the relevant dynamics that describes the molecular process of interest. However, proposing an adequate set of CVs that can capture the intrinsic reaction coordinate of the molecular transition is often extremely difficult. Here, we present a framework to find an optimal set of CVs from a pool of candidates using a combination of artificial neural networks and genetic algorithms. The approach effectively replaces the encoder of an autoencoder network with genes to represent the latent space, i.e., the CVs. Given a selection of CVs as input, the network is trained to recover the atom coordinates underlying the CV values at points along the transition. The network performance is used as an estimator of the fitness of the input CVs. Two genetic algorithms optimize the CV selection and the neural network architecture. The successful retrieval of optimal CVs by this framework is illustrated at the hand of two case studies: the well-known conformational change in the alanine dipeptide molecule and the more intricate transition of a base pair in B-DNA from the classic Watson–Crick pairing to the alternative Hoogsteen pairing. Key advantages of our framework include the following: optimal interpretable CVs, avoiding costly calculation of committor or time-correlation functions, and automatic hyperparameter optimization. In addition, we show that applying a time-delay between the network input and output allows for enhanced selection of slow variables. Moreover, the network can also be used to generate molecular configurations of unexplored microstates, for example, for augmentation of the simulation data.
format	Online Article Text
id	pubmed-8047796
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	American Chemical Society
record_format	MEDLINE/PubMed
spelling	pubmed-80477962021-04-16 Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks Hooft, Ferry Pérez de Alba Ortíz, Alberto Ensing, Bernd J Chem Theory Comput [Image: see text] With the continual improvement of computing hardware and algorithms, simulations have become a powerful tool for understanding all sorts of (bio)molecular processes. To handle the large simulation data sets and to accelerate slow, activated transitions, a condensed set of descriptors, or collective variables (CVs), is needed to discern the relevant dynamics that describes the molecular process of interest. However, proposing an adequate set of CVs that can capture the intrinsic reaction coordinate of the molecular transition is often extremely difficult. Here, we present a framework to find an optimal set of CVs from a pool of candidates using a combination of artificial neural networks and genetic algorithms. The approach effectively replaces the encoder of an autoencoder network with genes to represent the latent space, i.e., the CVs. Given a selection of CVs as input, the network is trained to recover the atom coordinates underlying the CV values at points along the transition. The network performance is used as an estimator of the fitness of the input CVs. Two genetic algorithms optimize the CV selection and the neural network architecture. The successful retrieval of optimal CVs by this framework is illustrated at the hand of two case studies: the well-known conformational change in the alanine dipeptide molecule and the more intricate transition of a base pair in B-DNA from the classic Watson–Crick pairing to the alternative Hoogsteen pairing. Key advantages of our framework include the following: optimal interpretable CVs, avoiding costly calculation of committor or time-correlation functions, and automatic hyperparameter optimization. In addition, we show that applying a time-delay between the network input and output allows for enhanced selection of slow variables. Moreover, the network can also be used to generate molecular configurations of unexplored microstates, for example, for augmentation of the simulation data. American Chemical Society 2021-03-04 2021-04-13 /pmc/articles/PMC8047796/ /pubmed/33662202 http://dx.doi.org/10.1021/acs.jctc.0c00981 Text en © 2021 The Authors. Published by American Chemical Society Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle	Hooft, Ferry Pérez de Alba Ortíz, Alberto Ensing, Bernd Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks
title	Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks
title_full	Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks
title_fullStr	Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks
title_full_unstemmed	Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks
title_short	Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks
title_sort	discovering collective variables of molecular transitions via genetic algorithms and neural networks
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8047796/ https://www.ncbi.nlm.nih.gov/pubmed/33662202 http://dx.doi.org/10.1021/acs.jctc.0c00981
work_keys_str_mv	AT hooftferry discoveringcollectivevariablesofmoleculartransitionsviageneticalgorithmsandneuralnetworks AT perezdealbaortizalberto discoveringcollectivevariablesofmoleculartransitionsviageneticalgorithmsandneuralnetworks AT ensingbernd discoveringcollectivevariablesofmoleculartransitionsviageneticalgorithmsandneuralnetworks

Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks

Ejemplares similares