Cargando…
Integrating synthetic accessibility with AI-based generative drug design
Generative models are frequently used for de novo design in drug discovery projects to propose new molecules. However, the question of whether or not the generated molecules can be synthesized is not systematically taken into account during generation, even though being able to synthesize the genera...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10507964/ https://www.ncbi.nlm.nih.gov/pubmed/37726842 http://dx.doi.org/10.1186/s13321-023-00742-8 |
_version_ | 1785107427568386048 |
---|---|
author | Parrot, Maud Tajmouati, Hamza da Silva, Vinicius Barros Ribeiro Atwood, Brian Ross Fourcade, Robin Gaston-Mathé, Yann Do Huu, Nicolas Perron, Quentin |
author_facet | Parrot, Maud Tajmouati, Hamza da Silva, Vinicius Barros Ribeiro Atwood, Brian Ross Fourcade, Robin Gaston-Mathé, Yann Do Huu, Nicolas Perron, Quentin |
author_sort | Parrot, Maud |
collection | PubMed |
description | Generative models are frequently used for de novo design in drug discovery projects to propose new molecules. However, the question of whether or not the generated molecules can be synthesized is not systematically taken into account during generation, even though being able to synthesize the generated molecules is a fundamental requirement for such methods to be useful in practice. Methods have been developed to estimate molecule “synthesizability”, but, so far, there is no consensus on whether or not a molecule is synthesizable. In this paper we introduce the Retro-Score (RScore), which computes a synthetic accessibility score of molecules by performing a full retrosynthetic analysis through our data-driven synthetic planning software Spaya, and its dedicated API: Spaya-API (https://spaya.ai). We start by comparing several synthetic accessibility scores to a binary “chemist score” as estimated by chemists on a bench of generated molecules, as a first experimental validation that the RScore is a reliable synthetic accessibility score. We then describe a pipeline to generate molecules that validate a list of targets while still being easy to synthesize. We further this idea by performing experiments comparing molecular generator outputs across a range of constraints and conditions. We show that the RScore can be learned by a Neural Network, which leads to a new score: RSPred. We demonstrate that using the RScore or RSPred as a constraint during molecular generation enables our molecular generators to produce more synthesizable solutions, with higher diversity. The open-source Python code containing all the scores and the experiments can be found on (https://github.com/iktos/generation-under-synthetic-constraint). GRAPHIC ABSTRACT: [Image: see text] |
format | Online Article Text |
id | pubmed-10507964 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-105079642023-09-20 Integrating synthetic accessibility with AI-based generative drug design Parrot, Maud Tajmouati, Hamza da Silva, Vinicius Barros Ribeiro Atwood, Brian Ross Fourcade, Robin Gaston-Mathé, Yann Do Huu, Nicolas Perron, Quentin J Cheminform Research Generative models are frequently used for de novo design in drug discovery projects to propose new molecules. However, the question of whether or not the generated molecules can be synthesized is not systematically taken into account during generation, even though being able to synthesize the generated molecules is a fundamental requirement for such methods to be useful in practice. Methods have been developed to estimate molecule “synthesizability”, but, so far, there is no consensus on whether or not a molecule is synthesizable. In this paper we introduce the Retro-Score (RScore), which computes a synthetic accessibility score of molecules by performing a full retrosynthetic analysis through our data-driven synthetic planning software Spaya, and its dedicated API: Spaya-API (https://spaya.ai). We start by comparing several synthetic accessibility scores to a binary “chemist score” as estimated by chemists on a bench of generated molecules, as a first experimental validation that the RScore is a reliable synthetic accessibility score. We then describe a pipeline to generate molecules that validate a list of targets while still being easy to synthesize. We further this idea by performing experiments comparing molecular generator outputs across a range of constraints and conditions. We show that the RScore can be learned by a Neural Network, which leads to a new score: RSPred. We demonstrate that using the RScore or RSPred as a constraint during molecular generation enables our molecular generators to produce more synthesizable solutions, with higher diversity. The open-source Python code containing all the scores and the experiments can be found on (https://github.com/iktos/generation-under-synthetic-constraint). GRAPHIC ABSTRACT: [Image: see text] Springer International Publishing 2023-09-19 /pmc/articles/PMC10507964/ /pubmed/37726842 http://dx.doi.org/10.1186/s13321-023-00742-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Parrot, Maud Tajmouati, Hamza da Silva, Vinicius Barros Ribeiro Atwood, Brian Ross Fourcade, Robin Gaston-Mathé, Yann Do Huu, Nicolas Perron, Quentin Integrating synthetic accessibility with AI-based generative drug design |
title | Integrating synthetic accessibility with AI-based generative drug design |
title_full | Integrating synthetic accessibility with AI-based generative drug design |
title_fullStr | Integrating synthetic accessibility with AI-based generative drug design |
title_full_unstemmed | Integrating synthetic accessibility with AI-based generative drug design |
title_short | Integrating synthetic accessibility with AI-based generative drug design |
title_sort | integrating synthetic accessibility with ai-based generative drug design |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10507964/ https://www.ncbi.nlm.nih.gov/pubmed/37726842 http://dx.doi.org/10.1186/s13321-023-00742-8 |
work_keys_str_mv | AT parrotmaud integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign AT tajmouatihamza integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign AT dasilvaviniciusbarrosribeiro integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign AT atwoodbrianross integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign AT fourcaderobin integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign AT gastonmatheyann integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign AT dohuunicolas integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign AT perronquentin integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign |