Cargando…
Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning
Modern computer-assisted synthesis planning tools provide strong support for this problem. However, they are still limited by computational complexity. This limitation may be overcome by scoring the synthetic accessibility as a pre-retrosynthesis heuristic. A wide range of machine learning scoring a...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9840255/ https://www.ncbi.nlm.nih.gov/pubmed/36641473 http://dx.doi.org/10.1186/s13321-023-00678-z |
_version_ | 1784869603641393152 |
---|---|
author | Skoraczyński, Grzegorz Kitlas, Mateusz Miasojedow, Błażej Gambin, Anna |
author_facet | Skoraczyński, Grzegorz Kitlas, Mateusz Miasojedow, Błażej Gambin, Anna |
author_sort | Skoraczyński, Grzegorz |
collection | PubMed |
description | Modern computer-assisted synthesis planning tools provide strong support for this problem. However, they are still limited by computational complexity. This limitation may be overcome by scoring the synthetic accessibility as a pre-retrosynthesis heuristic. A wide range of machine learning scoring approaches is available, however, their applicability and correctness were studied to a limited extent. Moreover, there is a lack of critical assessment of synthetic accessibility scores with common test conditions.In the present work, we assess if synthetic accessibility scores can reliably predict the outcomes of retrosynthesis planning. Using a specially prepared compounds database, we examine the outcomes of the retrosynthetic tool AiZynthFinder. We test whether synthetic accessibility scores: SAscore, SYBA, SCScore, and RAscore accurately predict the results of retrosynthesis planning. Furthermore, we investigate if synthetic accessibility scores can speed up retrosynthesis planning by better prioritizing explored partial synthetic routes and thus reducing the size of the search space. For that purpose, we analyze the AiZynthFinder partial solutions search trees, their structure, and complexity parameters, such as the number of nodes, or treewidth.We confirm that synthetic accessibility scores in most cases well discriminate feasible molecules from infeasible ones and can be potential boosters of retrosynthesis planning tools. Moreover, we show the current challenges of designing computer-assisted synthesis planning tools. We conclude that hybrid machine learning and human intuition-based synthetic accessibility scores can efficiently boost the effectiveness of computer-assisted retrosynthesis planning, however, they need to be carefully crafted for retrosynthesis planning algorithms.The source code of this work is publicly available at https://github.com/grzsko/ASAP. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00678-z. |
format | Online Article Text |
id | pubmed-9840255 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-98402552023-01-15 Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning Skoraczyński, Grzegorz Kitlas, Mateusz Miasojedow, Błażej Gambin, Anna J Cheminform Research Modern computer-assisted synthesis planning tools provide strong support for this problem. However, they are still limited by computational complexity. This limitation may be overcome by scoring the synthetic accessibility as a pre-retrosynthesis heuristic. A wide range of machine learning scoring approaches is available, however, their applicability and correctness were studied to a limited extent. Moreover, there is a lack of critical assessment of synthetic accessibility scores with common test conditions.In the present work, we assess if synthetic accessibility scores can reliably predict the outcomes of retrosynthesis planning. Using a specially prepared compounds database, we examine the outcomes of the retrosynthetic tool AiZynthFinder. We test whether synthetic accessibility scores: SAscore, SYBA, SCScore, and RAscore accurately predict the results of retrosynthesis planning. Furthermore, we investigate if synthetic accessibility scores can speed up retrosynthesis planning by better prioritizing explored partial synthetic routes and thus reducing the size of the search space. For that purpose, we analyze the AiZynthFinder partial solutions search trees, their structure, and complexity parameters, such as the number of nodes, or treewidth.We confirm that synthetic accessibility scores in most cases well discriminate feasible molecules from infeasible ones and can be potential boosters of retrosynthesis planning tools. Moreover, we show the current challenges of designing computer-assisted synthesis planning tools. We conclude that hybrid machine learning and human intuition-based synthetic accessibility scores can efficiently boost the effectiveness of computer-assisted retrosynthesis planning, however, they need to be carefully crafted for retrosynthesis planning algorithms.The source code of this work is publicly available at https://github.com/grzsko/ASAP. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00678-z. Springer International Publishing 2023-01-14 /pmc/articles/PMC9840255/ /pubmed/36641473 http://dx.doi.org/10.1186/s13321-023-00678-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Skoraczyński, Grzegorz Kitlas, Mateusz Miasojedow, Błażej Gambin, Anna Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning |
title | Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning |
title_full | Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning |
title_fullStr | Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning |
title_full_unstemmed | Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning |
title_short | Critical assessment of synthetic accessibility scores in computer-assisted synthesis planning |
title_sort | critical assessment of synthetic accessibility scores in computer-assisted synthesis planning |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9840255/ https://www.ncbi.nlm.nih.gov/pubmed/36641473 http://dx.doi.org/10.1186/s13321-023-00678-z |
work_keys_str_mv | AT skoraczynskigrzegorz criticalassessmentofsyntheticaccessibilityscoresincomputerassistedsynthesisplanning AT kitlasmateusz criticalassessmentofsyntheticaccessibilityscoresincomputerassistedsynthesisplanning AT miasojedowbłazej criticalassessmentofsyntheticaccessibilityscoresincomputerassistedsynthesisplanning AT gambinanna criticalassessmentofsyntheticaccessibilityscoresincomputerassistedsynthesisplanning |