Cargando…

Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions

BACKGROUND: A method to estimate ease of synthesis (synthetic accessibility) of drug-like molecules is needed in many areas of the drug discovery process. The development and validation of such a method that is able to characterize molecule synthetic accessibility as a score between 1 (easy to make)...

Descripción completa

Detalles Bibliográficos
Autores principales: Ertl, Peter, Schuffenhauer, Ansgar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3225829/
https://www.ncbi.nlm.nih.gov/pubmed/20298526
http://dx.doi.org/10.1186/1758-2946-1-8
_version_ 1782217532673884160
author Ertl, Peter
Schuffenhauer, Ansgar
author_facet Ertl, Peter
Schuffenhauer, Ansgar
author_sort Ertl, Peter
collection PubMed
description BACKGROUND: A method to estimate ease of synthesis (synthetic accessibility) of drug-like molecules is needed in many areas of the drug discovery process. The development and validation of such a method that is able to characterize molecule synthetic accessibility as a score between 1 (easy to make) and 10 (very difficult to make) is described in this article. RESULTS: The method for estimation of the synthetic accessibility score (SAscore) described here is based on a combination of fragment contributions and a complexity penalty. Fragment contributions have been calculated based on the analysis of one million representative molecules from PubChem and therefore one can say that they capture historical synthetic knowledge stored in this database. The molecular complexity score takes into account the presence of non-standard structural features, such as large rings, non-standard ring fusions, stereocomplexity and molecule size. The method has been validated by comparing calculated SAscores with ease of synthesis as estimated by experienced medicinal chemists for a set of 40 molecules. The agreement between calculated and manually estimated synthetic accessibility is very good with r(2 )= 0.89. CONCLUSION: A novel method to estimate synthetic accessibility of molecules has been developed. This method uses historical synthetic knowledge obtained by analyzing information from millions of already synthesized chemicals and considers also molecule complexity. The method is sufficiently fast and provides results consistent with estimation of ease of synthesis by experienced medicinal chemists. The calculated SAscore may be used to support various drug discovery processes where a large number of molecules needs to be ranked based on their synthetic accessibility, for example when purchasing samples for screening, selecting hits from high-throughput screening for follow-up, or ranking molecules generated by various de novo design approaches.
format Online
Article
Text
id pubmed-3225829
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Springer
record_format MEDLINE/PubMed
spelling pubmed-32258292011-11-30 Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions Ertl, Peter Schuffenhauer, Ansgar J Cheminform Research Article BACKGROUND: A method to estimate ease of synthesis (synthetic accessibility) of drug-like molecules is needed in many areas of the drug discovery process. The development and validation of such a method that is able to characterize molecule synthetic accessibility as a score between 1 (easy to make) and 10 (very difficult to make) is described in this article. RESULTS: The method for estimation of the synthetic accessibility score (SAscore) described here is based on a combination of fragment contributions and a complexity penalty. Fragment contributions have been calculated based on the analysis of one million representative molecules from PubChem and therefore one can say that they capture historical synthetic knowledge stored in this database. The molecular complexity score takes into account the presence of non-standard structural features, such as large rings, non-standard ring fusions, stereocomplexity and molecule size. The method has been validated by comparing calculated SAscores with ease of synthesis as estimated by experienced medicinal chemists for a set of 40 molecules. The agreement between calculated and manually estimated synthetic accessibility is very good with r(2 )= 0.89. CONCLUSION: A novel method to estimate synthetic accessibility of molecules has been developed. This method uses historical synthetic knowledge obtained by analyzing information from millions of already synthesized chemicals and considers also molecule complexity. The method is sufficiently fast and provides results consistent with estimation of ease of synthesis by experienced medicinal chemists. The calculated SAscore may be used to support various drug discovery processes where a large number of molecules needs to be ranked based on their synthetic accessibility, for example when purchasing samples for screening, selecting hits from high-throughput screening for follow-up, or ranking molecules generated by various de novo design approaches. Springer 2009-06-10 /pmc/articles/PMC3225829/ /pubmed/20298526 http://dx.doi.org/10.1186/1758-2946-1-8 Text en Copyright © 2009 Ertl and Schuffenhauer; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ertl, Peter
Schuffenhauer, Ansgar
Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
title Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
title_full Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
title_fullStr Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
title_full_unstemmed Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
title_short Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
title_sort estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3225829/
https://www.ncbi.nlm.nih.gov/pubmed/20298526
http://dx.doi.org/10.1186/1758-2946-1-8
work_keys_str_mv AT ertlpeter estimationofsyntheticaccessibilityscoreofdruglikemoleculesbasedonmolecularcomplexityandfragmentcontributions
AT schuffenhaueransgar estimationofsyntheticaccessibilityscoreofdruglikemoleculesbasedonmolecularcomplexityandfragmentcontributions