Cargando…

Integrating synthetic accessibility with AI-based generative drug design

Generative models are frequently used for de novo design in drug discovery projects to propose new molecules. However, the question of whether or not the generated molecules can be synthesized is not systematically taken into account during generation, even though being able to synthesize the genera...

Descripción completa

Detalles Bibliográficos
Autores principales: Parrot, Maud, Tajmouati, Hamza, da Silva, Vinicius Barros Ribeiro, Atwood, Brian Ross, Fourcade, Robin, Gaston-Mathé, Yann, Do Huu, Nicolas, Perron, Quentin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10507964/
https://www.ncbi.nlm.nih.gov/pubmed/37726842
http://dx.doi.org/10.1186/s13321-023-00742-8
_version_ 1785107427568386048
author Parrot, Maud
Tajmouati, Hamza
da Silva, Vinicius Barros Ribeiro
Atwood, Brian Ross
Fourcade, Robin
Gaston-Mathé, Yann
Do Huu, Nicolas
Perron, Quentin
author_facet Parrot, Maud
Tajmouati, Hamza
da Silva, Vinicius Barros Ribeiro
Atwood, Brian Ross
Fourcade, Robin
Gaston-Mathé, Yann
Do Huu, Nicolas
Perron, Quentin
author_sort Parrot, Maud
collection PubMed
description Generative models are frequently used for de novo design in drug discovery projects to propose new molecules. However, the question of whether or not the generated molecules can be synthesized is not systematically taken into account during generation, even though being able to synthesize the generated molecules is a fundamental requirement for such methods to be useful in practice. Methods have been developed to estimate molecule “synthesizability”, but, so far, there is no consensus on whether or not a molecule is synthesizable. In this paper we introduce the Retro-Score (RScore), which computes a synthetic accessibility score of molecules by performing a full retrosynthetic analysis through our data-driven synthetic planning software Spaya, and its dedicated API: Spaya-API (https://spaya.ai). We start by comparing several synthetic accessibility scores to a binary “chemist score” as estimated by chemists on a bench of generated molecules, as a first experimental validation that the RScore is a reliable synthetic accessibility score. We then describe a pipeline to generate molecules that validate a list of targets while still being easy to synthesize. We further this idea by performing experiments comparing molecular generator outputs across a range of constraints and conditions. We show that the RScore can be learned by a Neural Network, which leads to a new score: RSPred. We demonstrate that using the RScore or RSPred as a constraint during molecular generation enables our molecular generators to produce more synthesizable solutions, with higher diversity. The open-source Python code containing all the scores and the experiments can be found on (https://github.com/iktos/generation-under-synthetic-constraint). GRAPHIC ABSTRACT: [Image: see text]
format Online
Article
Text
id pubmed-10507964
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-105079642023-09-20 Integrating synthetic accessibility with AI-based generative drug design Parrot, Maud Tajmouati, Hamza da Silva, Vinicius Barros Ribeiro Atwood, Brian Ross Fourcade, Robin Gaston-Mathé, Yann Do Huu, Nicolas Perron, Quentin J Cheminform Research Generative models are frequently used for de novo design in drug discovery projects to propose new molecules. However, the question of whether or not the generated molecules can be synthesized is not systematically taken into account during generation, even though being able to synthesize the generated molecules is a fundamental requirement for such methods to be useful in practice. Methods have been developed to estimate molecule “synthesizability”, but, so far, there is no consensus on whether or not a molecule is synthesizable. In this paper we introduce the Retro-Score (RScore), which computes a synthetic accessibility score of molecules by performing a full retrosynthetic analysis through our data-driven synthetic planning software Spaya, and its dedicated API: Spaya-API (https://spaya.ai). We start by comparing several synthetic accessibility scores to a binary “chemist score” as estimated by chemists on a bench of generated molecules, as a first experimental validation that the RScore is a reliable synthetic accessibility score. We then describe a pipeline to generate molecules that validate a list of targets while still being easy to synthesize. We further this idea by performing experiments comparing molecular generator outputs across a range of constraints and conditions. We show that the RScore can be learned by a Neural Network, which leads to a new score: RSPred. We demonstrate that using the RScore or RSPred as a constraint during molecular generation enables our molecular generators to produce more synthesizable solutions, with higher diversity. The open-source Python code containing all the scores and the experiments can be found on (https://github.com/iktos/generation-under-synthetic-constraint). GRAPHIC ABSTRACT: [Image: see text] Springer International Publishing 2023-09-19 /pmc/articles/PMC10507964/ /pubmed/37726842 http://dx.doi.org/10.1186/s13321-023-00742-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Parrot, Maud
Tajmouati, Hamza
da Silva, Vinicius Barros Ribeiro
Atwood, Brian Ross
Fourcade, Robin
Gaston-Mathé, Yann
Do Huu, Nicolas
Perron, Quentin
Integrating synthetic accessibility with AI-based generative drug design
title Integrating synthetic accessibility with AI-based generative drug design
title_full Integrating synthetic accessibility with AI-based generative drug design
title_fullStr Integrating synthetic accessibility with AI-based generative drug design
title_full_unstemmed Integrating synthetic accessibility with AI-based generative drug design
title_short Integrating synthetic accessibility with AI-based generative drug design
title_sort integrating synthetic accessibility with ai-based generative drug design
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10507964/
https://www.ncbi.nlm.nih.gov/pubmed/37726842
http://dx.doi.org/10.1186/s13321-023-00742-8
work_keys_str_mv AT parrotmaud integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign
AT tajmouatihamza integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign
AT dasilvaviniciusbarrosribeiro integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign
AT atwoodbrianross integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign
AT fourcaderobin integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign
AT gastonmatheyann integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign
AT dohuunicolas integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign
AT perronquentin integratingsyntheticaccessibilitywithaibasedgenerativedrugdesign