Cargando…

Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design

Inverse molecular design involves algorithms that sample molecules with specific target properties from a multitude of candidates and can be posed as an optimization problem. High-dimensional optimization tasks in the natural sciences are commonly tackled via population-based metaheuristic optimizat...

Descripción completa

Detalles Bibliográficos
Autores principales: Nigam, AkshatKumar, Pollice, Robert, Aspuru-Guzik, Alán
Formato: Online Artículo Texto
Lenguaje:English
Publicado: RSC 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9358752/
https://www.ncbi.nlm.nih.gov/pubmed/36091415
http://dx.doi.org/10.1039/d2dd00003b
_version_ 1784763999595790336
author Nigam, AkshatKumar
Pollice, Robert
Aspuru-Guzik, Alán
author_facet Nigam, AkshatKumar
Pollice, Robert
Aspuru-Guzik, Alán
author_sort Nigam, AkshatKumar
collection PubMed
description Inverse molecular design involves algorithms that sample molecules with specific target properties from a multitude of candidates and can be posed as an optimization problem. High-dimensional optimization tasks in the natural sciences are commonly tackled via population-based metaheuristic optimization algorithms such as evolutionary algorithms. However, often unavoidable expensive property evaluation can limit the widespread use of such approaches as the associated cost can become prohibitive. Herein, we present JANUS, a genetic algorithm inspired by parallel tempering. It propagates two populations, one for exploration and another for exploitation, improving optimization by reducing property evaluations. JANUS is augmented by a deep neural network that approximates molecular properties and relies on active learning for enhanced molecular sampling. It uses the SELFIES representation and the STONED algorithm for the efficient generation of structures, and outperforms other generative models in common inverse molecular design tasks achieving state-of-the-art target metrics across multiple benchmarks. As neither most of the benchmarks nor the structure generator in JANUS account for synthesizability, a significant fraction of the proposed molecules is synthetically infeasible demonstrating that this aspect needs to be considered when evaluating the performance of molecular generative models.
format Online
Article
Text
id pubmed-9358752
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher RSC
record_format MEDLINE/PubMed
spelling pubmed-93587522022-09-08 Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design Nigam, AkshatKumar Pollice, Robert Aspuru-Guzik, Alán Digit Discov Chemistry Inverse molecular design involves algorithms that sample molecules with specific target properties from a multitude of candidates and can be posed as an optimization problem. High-dimensional optimization tasks in the natural sciences are commonly tackled via population-based metaheuristic optimization algorithms such as evolutionary algorithms. However, often unavoidable expensive property evaluation can limit the widespread use of such approaches as the associated cost can become prohibitive. Herein, we present JANUS, a genetic algorithm inspired by parallel tempering. It propagates two populations, one for exploration and another for exploitation, improving optimization by reducing property evaluations. JANUS is augmented by a deep neural network that approximates molecular properties and relies on active learning for enhanced molecular sampling. It uses the SELFIES representation and the STONED algorithm for the efficient generation of structures, and outperforms other generative models in common inverse molecular design tasks achieving state-of-the-art target metrics across multiple benchmarks. As neither most of the benchmarks nor the structure generator in JANUS account for synthesizability, a significant fraction of the proposed molecules is synthetically infeasible demonstrating that this aspect needs to be considered when evaluating the performance of molecular generative models. RSC 2022-05-03 /pmc/articles/PMC9358752/ /pubmed/36091415 http://dx.doi.org/10.1039/d2dd00003b Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Nigam, AkshatKumar
Pollice, Robert
Aspuru-Guzik, Alán
Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design
title Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design
title_full Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design
title_fullStr Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design
title_full_unstemmed Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design
title_short Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design
title_sort parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9358752/
https://www.ncbi.nlm.nih.gov/pubmed/36091415
http://dx.doi.org/10.1039/d2dd00003b
work_keys_str_mv AT nigamakshatkumar paralleltemperedgeneticalgorithmguidedbydeepneuralnetworksforinversemoleculardesign
AT pollicerobert paralleltemperedgeneticalgorithmguidedbydeepneuralnetworksforinversemoleculardesign
AT aspuruguzikalan paralleltemperedgeneticalgorithmguidedbydeepneuralnetworksforinversemoleculardesign