Cargando…

Retrosynthetic accessibility score (RAscore) – rapid machine learned synthesizability classification from AI driven retrosynthetic planning

Computer aided synthesis planning (CASP) is part of a suite of artificial intelligence (AI) based tools that are able to propose synthesis routes to a wide range of compounds. However, at present they are too slow to be used to screen the synthetic feasibility of millions of generated or enumerated...

Descripción completa

Detalles Bibliográficos
Autores principales: Thakkar, Amol, Chadimová, Veronika, Bjerrum, Esben Jannik, Engkvist, Ola, Reymond, Jean-Louis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8179384/
https://www.ncbi.nlm.nih.gov/pubmed/34164104
http://dx.doi.org/10.1039/d0sc05401a
_version_ 1783703768427134976
author Thakkar, Amol
Chadimová, Veronika
Bjerrum, Esben Jannik
Engkvist, Ola
Reymond, Jean-Louis
author_facet Thakkar, Amol
Chadimová, Veronika
Bjerrum, Esben Jannik
Engkvist, Ola
Reymond, Jean-Louis
author_sort Thakkar, Amol
collection PubMed
description Computer aided synthesis planning (CASP) is part of a suite of artificial intelligence (AI) based tools that are able to propose synthesis routes to a wide range of compounds. However, at present they are too slow to be used to screen the synthetic feasibility of millions of generated or enumerated compounds before identification of potential bioactivity by virtual screening (VS) workflows. Herein we report a machine learning (ML) based method capable of classifying whether a synthetic route can be identified for a particular compound or not by the CASP tool AiZynthFinder. The resulting ML models return a retrosynthetic accessibility score (RAscore) of any molecule of interest, and computes at least 4500 times faster than retrosynthetic analysis performed by the underlying CASP tool. The RAscore should be useful for pre-screening millions of virtual molecules from enumerated databases or generative models for synthetic accessibility and produce higher quality databases for virtual screening of biological activity.
format Online
Article
Text
id pubmed-8179384
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-81793842021-06-22 Retrosynthetic accessibility score (RAscore) – rapid machine learned synthesizability classification from AI driven retrosynthetic planning Thakkar, Amol Chadimová, Veronika Bjerrum, Esben Jannik Engkvist, Ola Reymond, Jean-Louis Chem Sci Chemistry Computer aided synthesis planning (CASP) is part of a suite of artificial intelligence (AI) based tools that are able to propose synthesis routes to a wide range of compounds. However, at present they are too slow to be used to screen the synthetic feasibility of millions of generated or enumerated compounds before identification of potential bioactivity by virtual screening (VS) workflows. Herein we report a machine learning (ML) based method capable of classifying whether a synthetic route can be identified for a particular compound or not by the CASP tool AiZynthFinder. The resulting ML models return a retrosynthetic accessibility score (RAscore) of any molecule of interest, and computes at least 4500 times faster than retrosynthetic analysis performed by the underlying CASP tool. The RAscore should be useful for pre-screening millions of virtual molecules from enumerated databases or generative models for synthetic accessibility and produce higher quality databases for virtual screening of biological activity. The Royal Society of Chemistry 2021-01-22 /pmc/articles/PMC8179384/ /pubmed/34164104 http://dx.doi.org/10.1039/d0sc05401a Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Thakkar, Amol
Chadimová, Veronika
Bjerrum, Esben Jannik
Engkvist, Ola
Reymond, Jean-Louis
Retrosynthetic accessibility score (RAscore) – rapid machine learned synthesizability classification from AI driven retrosynthetic planning
title Retrosynthetic accessibility score (RAscore) – rapid machine learned synthesizability classification from AI driven retrosynthetic planning
title_full Retrosynthetic accessibility score (RAscore) – rapid machine learned synthesizability classification from AI driven retrosynthetic planning
title_fullStr Retrosynthetic accessibility score (RAscore) – rapid machine learned synthesizability classification from AI driven retrosynthetic planning
title_full_unstemmed Retrosynthetic accessibility score (RAscore) – rapid machine learned synthesizability classification from AI driven retrosynthetic planning
title_short Retrosynthetic accessibility score (RAscore) – rapid machine learned synthesizability classification from AI driven retrosynthetic planning
title_sort retrosynthetic accessibility score (rascore) – rapid machine learned synthesizability classification from ai driven retrosynthetic planning
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8179384/
https://www.ncbi.nlm.nih.gov/pubmed/34164104
http://dx.doi.org/10.1039/d0sc05401a
work_keys_str_mv AT thakkaramol retrosyntheticaccessibilityscorerascorerapidmachinelearnedsynthesizabilityclassificationfromaidrivenretrosyntheticplanning
AT chadimovaveronika retrosyntheticaccessibilityscorerascorerapidmachinelearnedsynthesizabilityclassificationfromaidrivenretrosyntheticplanning
AT bjerrumesbenjannik retrosyntheticaccessibilityscorerascorerapidmachinelearnedsynthesizabilityclassificationfromaidrivenretrosyntheticplanning
AT engkvistola retrosyntheticaccessibilityscorerascorerapidmachinelearnedsynthesizabilityclassificationfromaidrivenretrosyntheticplanning
AT reymondjeanlouis retrosyntheticaccessibilityscorerascorerapidmachinelearnedsynthesizabilityclassificationfromaidrivenretrosyntheticplanning