Cargando…

Molecular optimization by capturing chemist’s intuition using deep neural networks

A main challenge in drug discovery is finding molecules with a desirable balance of multiple properties. Here, we focus on the task of molecular optimization, where the goal is to optimize a given starting molecule towards desirable properties. This task can be framed as a machine translation proble...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Jiazhen, You, Huifang, Sandström, Emil, Nittinger, Eva, Bjerrum, Esben Jannik, Tyrchan, Christian, Czechtizky, Werngard, Engkvist, Ola
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7980633/
https://www.ncbi.nlm.nih.gov/pubmed/33743817
http://dx.doi.org/10.1186/s13321-021-00497-0
_version_ 1783667467844845568
author He, Jiazhen
You, Huifang
Sandström, Emil
Nittinger, Eva
Bjerrum, Esben Jannik
Tyrchan, Christian
Czechtizky, Werngard
Engkvist, Ola
author_facet He, Jiazhen
You, Huifang
Sandström, Emil
Nittinger, Eva
Bjerrum, Esben Jannik
Tyrchan, Christian
Czechtizky, Werngard
Engkvist, Ola
author_sort He, Jiazhen
collection PubMed
description A main challenge in drug discovery is finding molecules with a desirable balance of multiple properties. Here, we focus on the task of molecular optimization, where the goal is to optimize a given starting molecule towards desirable properties. This task can be framed as a machine translation problem in natural language processing, where in our case, a molecule is translated into a molecule with optimized properties based on the SMILES representation. Typically, chemists would use their intuition to suggest chemical transformations for the starting molecule being optimized. A widely used strategy is the concept of matched molecular pairs where two molecules differ by a single transformation. We seek to capture the chemist’s intuition from matched molecular pairs using machine translation models. Specifically, the sequence-to-sequence model with attention mechanism, and the Transformer model are employed to generate molecules with desirable properties. As a proof of concept, three ADMET properties are optimized simultaneously: logD, solubility, and clearance, which are important properties of a drug. Since desirable properties often vary from project to project, the user-specified desirable property changes are incorporated into the input as an additional condition together with the starting molecules being optimized. Thus, the models can be guided to generate molecules satisfying the desirable properties. Additionally, we compare the two machine translation models based on the SMILES representation, with a graph-to-graph translation model HierG2G, which has shown the state-of-the-art performance in molecular optimization. Our results show that the Transformer can generate more molecules with desirable properties by making small modifications to the given starting molecules, which can be intuitive to chemists. A further enrichment of diverse molecules can be achieved by using an ensemble of models. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00497-0.
format Online
Article
Text
id pubmed-7980633
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-79806332021-03-22 Molecular optimization by capturing chemist’s intuition using deep neural networks He, Jiazhen You, Huifang Sandström, Emil Nittinger, Eva Bjerrum, Esben Jannik Tyrchan, Christian Czechtizky, Werngard Engkvist, Ola J Cheminform Research Article A main challenge in drug discovery is finding molecules with a desirable balance of multiple properties. Here, we focus on the task of molecular optimization, where the goal is to optimize a given starting molecule towards desirable properties. This task can be framed as a machine translation problem in natural language processing, where in our case, a molecule is translated into a molecule with optimized properties based on the SMILES representation. Typically, chemists would use their intuition to suggest chemical transformations for the starting molecule being optimized. A widely used strategy is the concept of matched molecular pairs where two molecules differ by a single transformation. We seek to capture the chemist’s intuition from matched molecular pairs using machine translation models. Specifically, the sequence-to-sequence model with attention mechanism, and the Transformer model are employed to generate molecules with desirable properties. As a proof of concept, three ADMET properties are optimized simultaneously: logD, solubility, and clearance, which are important properties of a drug. Since desirable properties often vary from project to project, the user-specified desirable property changes are incorporated into the input as an additional condition together with the starting molecules being optimized. Thus, the models can be guided to generate molecules satisfying the desirable properties. Additionally, we compare the two machine translation models based on the SMILES representation, with a graph-to-graph translation model HierG2G, which has shown the state-of-the-art performance in molecular optimization. Our results show that the Transformer can generate more molecules with desirable properties by making small modifications to the given starting molecules, which can be intuitive to chemists. A further enrichment of diverse molecules can be achieved by using an ensemble of models. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00497-0. Springer International Publishing 2021-03-20 /pmc/articles/PMC7980633/ /pubmed/33743817 http://dx.doi.org/10.1186/s13321-021-00497-0 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
He, Jiazhen
You, Huifang
Sandström, Emil
Nittinger, Eva
Bjerrum, Esben Jannik
Tyrchan, Christian
Czechtizky, Werngard
Engkvist, Ola
Molecular optimization by capturing chemist’s intuition using deep neural networks
title Molecular optimization by capturing chemist’s intuition using deep neural networks
title_full Molecular optimization by capturing chemist’s intuition using deep neural networks
title_fullStr Molecular optimization by capturing chemist’s intuition using deep neural networks
title_full_unstemmed Molecular optimization by capturing chemist’s intuition using deep neural networks
title_short Molecular optimization by capturing chemist’s intuition using deep neural networks
title_sort molecular optimization by capturing chemist’s intuition using deep neural networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7980633/
https://www.ncbi.nlm.nih.gov/pubmed/33743817
http://dx.doi.org/10.1186/s13321-021-00497-0
work_keys_str_mv AT hejiazhen molecularoptimizationbycapturingchemistsintuitionusingdeepneuralnetworks
AT youhuifang molecularoptimizationbycapturingchemistsintuitionusingdeepneuralnetworks
AT sandstromemil molecularoptimizationbycapturingchemistsintuitionusingdeepneuralnetworks
AT nittingereva molecularoptimizationbycapturingchemistsintuitionusingdeepneuralnetworks
AT bjerrumesbenjannik molecularoptimizationbycapturingchemistsintuitionusingdeepneuralnetworks
AT tyrchanchristian molecularoptimizationbycapturingchemistsintuitionusingdeepneuralnetworks
AT czechtizkywerngard molecularoptimizationbycapturingchemistsintuitionusingdeepneuralnetworks
AT engkvistola molecularoptimizationbycapturingchemistsintuitionusingdeepneuralnetworks