Cargando…

Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning

Modern computer-assisted synthesis planning tools provide strong support for this problem. However, they are still limited by computational complexity. This limitation may be overcome by scoring the synthetic accessibility as a pre-retrosynthesis heuristic. A wide range of machine learning scoring a...

Descripción completa

Detalles Bibliográficos
Autores principales: Skoraczyński, Grzegorz, Kitlas, Mateusz, Miasojedow, Błażej, Gambin, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9840255/
https://www.ncbi.nlm.nih.gov/pubmed/36641473
http://dx.doi.org/10.1186/s13321-023-00678-z
_version_ 1784869603641393152
author Skoraczyński, Grzegorz
Kitlas, Mateusz
Miasojedow, Błażej
Gambin, Anna
author_facet Skoraczyński, Grzegorz
Kitlas, Mateusz
Miasojedow, Błażej
Gambin, Anna
author_sort Skoraczyński, Grzegorz
collection PubMed
description Modern computer-assisted synthesis planning tools provide strong support for this problem. However, they are still limited by computational complexity. This limitation may be overcome by scoring the synthetic accessibility as a pre-retrosynthesis heuristic. A wide range of machine learning scoring approaches is available, however, their applicability and correctness were studied to a limited extent. Moreover, there is a lack of critical assessment of synthetic accessibility scores with common test conditions.In the present work, we assess if synthetic accessibility scores can reliably predict the outcomes of retrosynthesis planning. Using a specially prepared compounds database, we examine the outcomes of the retrosynthetic tool AiZynthFinder. We test whether synthetic accessibility scores: SAscore, SYBA, SCScore, and RAscore accurately predict the results of retrosynthesis planning. Furthermore, we investigate if synthetic accessibility scores can speed up retrosynthesis planning by better prioritizing explored partial synthetic routes and thus reducing the size of the search space. For that purpose, we analyze the AiZynthFinder partial solutions search trees, their structure, and complexity parameters, such as the number of nodes, or treewidth.We confirm that synthetic accessibility scores in most cases well discriminate feasible molecules from infeasible ones and can be potential boosters of retrosynthesis planning tools. Moreover, we show the current challenges of designing computer-assisted synthesis planning tools. We conclude that hybrid machine learning and human intuition-based synthetic accessibility scores can efficiently boost the effectiveness of computer-assisted retrosynthesis planning, however, they need to be carefully crafted for retrosynthesis planning algorithms.The source code of this work is publicly available at https://github.com/grzsko/ASAP. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00678-z.
format Online
Article
Text
id pubmed-9840255
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-98402552023-01-15 Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning Skoraczyński, Grzegorz Kitlas, Mateusz Miasojedow, Błażej Gambin, Anna J Cheminform Research Modern computer-assisted synthesis planning tools provide strong support for this problem. However, they are still limited by computational complexity. This limitation may be overcome by scoring the synthetic accessibility as a pre-retrosynthesis heuristic. A wide range of machine learning scoring approaches is available, however, their applicability and correctness were studied to a limited extent. Moreover, there is a lack of critical assessment of synthetic accessibility scores with common test conditions.In the present work, we assess if synthetic accessibility scores can reliably predict the outcomes of retrosynthesis planning. Using a specially prepared compounds database, we examine the outcomes of the retrosynthetic tool AiZynthFinder. We test whether synthetic accessibility scores: SAscore, SYBA, SCScore, and RAscore accurately predict the results of retrosynthesis planning. Furthermore, we investigate if synthetic accessibility scores can speed up retrosynthesis planning by better prioritizing explored partial synthetic routes and thus reducing the size of the search space. For that purpose, we analyze the AiZynthFinder partial solutions search trees, their structure, and complexity parameters, such as the number of nodes, or treewidth.We confirm that synthetic accessibility scores in most cases well discriminate feasible molecules from infeasible ones and can be potential boosters of retrosynthesis planning tools. Moreover, we show the current challenges of designing computer-assisted synthesis planning tools. We conclude that hybrid machine learning and human intuition-based synthetic accessibility scores can efficiently boost the effectiveness of computer-assisted retrosynthesis planning, however, they need to be carefully crafted for retrosynthesis planning algorithms.The source code of this work is publicly available at https://github.com/grzsko/ASAP. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00678-z. Springer International Publishing 2023-01-14 /pmc/articles/PMC9840255/ /pubmed/36641473 http://dx.doi.org/10.1186/s13321-023-00678-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Skoraczyński, Grzegorz
Kitlas, Mateusz
Miasojedow, Błażej
Gambin, Anna
Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning
title Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning
title_full Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning
title_fullStr Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning
title_full_unstemmed Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning
title_short Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning
title_sort critical assessment of synthetic accessibility scores in computer-assisted synthesis planning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9840255/
https://www.ncbi.nlm.nih.gov/pubmed/36641473
http://dx.doi.org/10.1186/s13321-023-00678-z
work_keys_str_mv AT skoraczynskigrzegorz criticalassessmentofsyntheticaccessibilityscoresincomputerassistedsynthesisplanning
AT kitlasmateusz criticalassessmentofsyntheticaccessibilityscoresincomputerassistedsynthesisplanning
AT miasojedowbłazej criticalassessmentofsyntheticaccessibilityscoresincomputerassistedsynthesisplanning
AT gambinanna criticalassessmentofsyntheticaccessibilityscoresincomputerassistedsynthesisplanning