Cargando…

DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology

In polypharmacology drugs are required to bind to multiple specific targets, for example to enhance efficacy or to reduce resistance formation. Although deep learning has achieved a breakthrough in de novo design in drug discovery, most of its applications only focus on a single drug target to gener...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Xuhan, Ye, Kai, van Vlijmen, Herman W. T., Emmerich, Michael T. M., IJzerman, Adriaan P., van Westen, Gerard J. P.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8588612/ https://www.ncbi.nlm.nih.gov/pubmed/34772471 http://dx.doi.org/10.1186/s13321-021-00561-9

_version_	1784598511294087168
author	Liu, Xuhan Ye, Kai van Vlijmen, Herman W. T. Emmerich, Michael T. M. IJzerman, Adriaan P. van Westen, Gerard J. P.
author_facet	Liu, Xuhan Ye, Kai van Vlijmen, Herman W. T. Emmerich, Michael T. M. IJzerman, Adriaan P. van Westen, Gerard J. P.
author_sort	Liu, Xuhan
collection	PubMed
description	In polypharmacology drugs are required to bind to multiple specific targets, for example to enhance efficacy or to reduce resistance formation. Although deep learning has achieved a breakthrough in de novo design in drug discovery, most of its applications only focus on a single drug target to generate drug-like active molecules. However, in reality drug molecules often interact with more than one target which can have desired (polypharmacology) or undesired (toxicity) effects. In a previous study we proposed a new method named DrugEx that integrates an exploration strategy into RNN-based reinforcement learning to improve the diversity of the generated molecules. Here, we extended our DrugEx algorithm with multi-objective optimization to generate drug-like molecules towards multiple targets or one specific target while avoiding off-targets (the two adenosine receptors, A(1)AR and A(2A)AR, and the potassium ion channel hERG in this study). In our model, we applied an RNN as the agent and machine learning predictors as the environment. Both the agent and the environment were pre-trained in advance and then interplayed under a reinforcement learning framework. The concept of evolutionary algorithms was merged into our method such that crossover and mutation operations were implemented by the same deep learning model as the agent. During the training loop, the agent generates a batch of SMILES-based molecules. Subsequently scores for all objectives provided by the environment are used to construct Pareto ranks of the generated molecules. For this ranking a non-dominated sorting algorithm and a Tanimoto-based crowding distance algorithm using chemical fingerprints are applied. Here, we adopted GPU acceleration to speed up the process of Pareto optimization. The final reward of each molecule is calculated based on the Pareto ranking with the ranking selection algorithm. The agent is trained under the guidance of the reward to make sure it can generate desired molecules after convergence of the training process. All in all we demonstrate generation of compounds with a diverse predicted selectivity profile towards multiple targets, offering the potential of high efficacy and low toxicity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00561-9.
format	Online Article Text
id	pubmed-8588612
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-85886122021-11-15 DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology Liu, Xuhan Ye, Kai van Vlijmen, Herman W. T. Emmerich, Michael T. M. IJzerman, Adriaan P. van Westen, Gerard J. P. J Cheminform Research Article In polypharmacology drugs are required to bind to multiple specific targets, for example to enhance efficacy or to reduce resistance formation. Although deep learning has achieved a breakthrough in de novo design in drug discovery, most of its applications only focus on a single drug target to generate drug-like active molecules. However, in reality drug molecules often interact with more than one target which can have desired (polypharmacology) or undesired (toxicity) effects. In a previous study we proposed a new method named DrugEx that integrates an exploration strategy into RNN-based reinforcement learning to improve the diversity of the generated molecules. Here, we extended our DrugEx algorithm with multi-objective optimization to generate drug-like molecules towards multiple targets or one specific target while avoiding off-targets (the two adenosine receptors, A(1)AR and A(2A)AR, and the potassium ion channel hERG in this study). In our model, we applied an RNN as the agent and machine learning predictors as the environment. Both the agent and the environment were pre-trained in advance and then interplayed under a reinforcement learning framework. The concept of evolutionary algorithms was merged into our method such that crossover and mutation operations were implemented by the same deep learning model as the agent. During the training loop, the agent generates a batch of SMILES-based molecules. Subsequently scores for all objectives provided by the environment are used to construct Pareto ranks of the generated molecules. For this ranking a non-dominated sorting algorithm and a Tanimoto-based crowding distance algorithm using chemical fingerprints are applied. Here, we adopted GPU acceleration to speed up the process of Pareto optimization. The final reward of each molecule is calculated based on the Pareto ranking with the ranking selection algorithm. The agent is trained under the guidance of the reward to make sure it can generate desired molecules after convergence of the training process. All in all we demonstrate generation of compounds with a diverse predicted selectivity profile towards multiple targets, offering the potential of high efficacy and low toxicity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00561-9. Springer International Publishing 2021-11-12 /pmc/articles/PMC8588612/ /pubmed/34772471 http://dx.doi.org/10.1186/s13321-021-00561-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Article Liu, Xuhan Ye, Kai van Vlijmen, Herman W. T. Emmerich, Michael T. M. IJzerman, Adriaan P. van Westen, Gerard J. P. DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology
title	DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology
title_full	DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology
title_fullStr	DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology
title_full_unstemmed	DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology
title_short	DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology
title_sort	drugex v2: de novo design of drug molecules by pareto-based multi-objective reinforcement learning in polypharmacology
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8588612/ https://www.ncbi.nlm.nih.gov/pubmed/34772471 http://dx.doi.org/10.1186/s13321-021-00561-9
work_keys_str_mv	AT liuxuhan drugexv2denovodesignofdrugmoleculesbyparetobasedmultiobjectivereinforcementlearninginpolypharmacology AT yekai drugexv2denovodesignofdrugmoleculesbyparetobasedmultiobjectivereinforcementlearninginpolypharmacology AT vanvlijmenhermanwt drugexv2denovodesignofdrugmoleculesbyparetobasedmultiobjectivereinforcementlearninginpolypharmacology AT emmerichmichaeltm drugexv2denovodesignofdrugmoleculesbyparetobasedmultiobjectivereinforcementlearninginpolypharmacology AT ijzermanadriaanp drugexv2denovodesignofdrugmoleculesbyparetobasedmultiobjectivereinforcementlearninginpolypharmacology AT vanwestengerardjp drugexv2denovodesignofdrugmoleculesbyparetobasedmultiobjectivereinforcementlearninginpolypharmacology

DrugEx v2: de novo design of drug molecules by Pareto-based multi-objective reinforcement learning in polypharmacology

Ejemplares similares