Cargando…

Systematic Extraction of Analogue Series from Large Compound Collections Using a New Computational Compound–Core Relationship Method

[Image: see text] Chemical optimization of organic compounds produces a series of analogues. In addition to considering an analogue series (AS) or multiple series on a case-by-case basis, which is often done in the practice of chemistry, the extraction of analogues from compound repositories is of h...

Descripción completa

Detalles Bibliográficos
Autores principales: Naveja, J. Jesús, Vogt, Martin, Stumpfe, Dagmar, Medina-Franco, José L., Bajorath, Jürgen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2019
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6648924/
https://www.ncbi.nlm.nih.gov/pubmed/31459378
http://dx.doi.org/10.1021/acsomega.8b03390
_version_ 1783437974901358592
author Naveja, J. Jesús
Vogt, Martin
Stumpfe, Dagmar
Medina-Franco, José L.
Bajorath, Jürgen
author_facet Naveja, J. Jesús
Vogt, Martin
Stumpfe, Dagmar
Medina-Franco, José L.
Bajorath, Jürgen
author_sort Naveja, J. Jesús
collection PubMed
description [Image: see text] Chemical optimization of organic compounds produces a series of analogues. In addition to considering an analogue series (AS) or multiple series on a case-by-case basis, which is often done in the practice of chemistry, the extraction of analogues from compound repositories is of high interest in organic and medicinal chemistry. In organic chemistry, ASs are a source of alternative synthetic routes and also aid in exploring relationships between compounds from different sources including synthetic vs. naturally occurring molecules. In medicinal chemistry, ASs are the major source of structure–activity relationship information and of hits or leads for drug development. ASs might be identified in different ways. For a given reference compound, a substructure search can be carried out using its scaffold. Alternatively, matched molecular pairs can be calculated to retrieve analogues from a compound repository. However, if no query compounds are used, the identification of ASs in databases is a difficult task. Herein, we introduce a computational approach to systematically identify ASs in collections of organic compounds. The approach involves compound decomposition on the basis of well-established retrosynthetic rules, organization of compound–core relationships, and identification of analogues sharing the same core. The method was applied on a large scale to extract ASs from the ChEMBL database, yielding more than 30 000 distinct series.
format Online
Article
Text
id pubmed-6648924
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-66489242019-08-27 Systematic Extraction of Analogue Series from Large Compound Collections Using a New Computational Compound–Core Relationship Method Naveja, J. Jesús Vogt, Martin Stumpfe, Dagmar Medina-Franco, José L. Bajorath, Jürgen ACS Omega [Image: see text] Chemical optimization of organic compounds produces a series of analogues. In addition to considering an analogue series (AS) or multiple series on a case-by-case basis, which is often done in the practice of chemistry, the extraction of analogues from compound repositories is of high interest in organic and medicinal chemistry. In organic chemistry, ASs are a source of alternative synthetic routes and also aid in exploring relationships between compounds from different sources including synthetic vs. naturally occurring molecules. In medicinal chemistry, ASs are the major source of structure–activity relationship information and of hits or leads for drug development. ASs might be identified in different ways. For a given reference compound, a substructure search can be carried out using its scaffold. Alternatively, matched molecular pairs can be calculated to retrieve analogues from a compound repository. However, if no query compounds are used, the identification of ASs in databases is a difficult task. Herein, we introduce a computational approach to systematically identify ASs in collections of organic compounds. The approach involves compound decomposition on the basis of well-established retrosynthetic rules, organization of compound–core relationships, and identification of analogues sharing the same core. The method was applied on a large scale to extract ASs from the ChEMBL database, yielding more than 30 000 distinct series. American Chemical Society 2019-01-14 /pmc/articles/PMC6648924/ /pubmed/31459378 http://dx.doi.org/10.1021/acsomega.8b03390 Text en Copyright © 2019 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle Naveja, J. Jesús
Vogt, Martin
Stumpfe, Dagmar
Medina-Franco, José L.
Bajorath, Jürgen
Systematic Extraction of Analogue Series from Large Compound Collections Using a New Computational Compound–Core Relationship Method
title Systematic Extraction of Analogue Series from Large Compound Collections Using a New Computational Compound–Core Relationship Method
title_full Systematic Extraction of Analogue Series from Large Compound Collections Using a New Computational Compound–Core Relationship Method
title_fullStr Systematic Extraction of Analogue Series from Large Compound Collections Using a New Computational Compound–Core Relationship Method
title_full_unstemmed Systematic Extraction of Analogue Series from Large Compound Collections Using a New Computational Compound–Core Relationship Method
title_short Systematic Extraction of Analogue Series from Large Compound Collections Using a New Computational Compound–Core Relationship Method
title_sort systematic extraction of analogue series from large compound collections using a new computational compound–core relationship method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6648924/
https://www.ncbi.nlm.nih.gov/pubmed/31459378
http://dx.doi.org/10.1021/acsomega.8b03390
work_keys_str_mv AT navejajjesus systematicextractionofanalogueseriesfromlargecompoundcollectionsusinganewcomputationalcompoundcorerelationshipmethod
AT vogtmartin systematicextractionofanalogueseriesfromlargecompoundcollectionsusinganewcomputationalcompoundcorerelationshipmethod
AT stumpfedagmar systematicextractionofanalogueseriesfromlargecompoundcollectionsusinganewcomputationalcompoundcorerelationshipmethod
AT medinafrancojosel systematicextractionofanalogueseriesfromlargecompoundcollectionsusinganewcomputationalcompoundcorerelationshipmethod
AT bajorathjurgen systematicextractionofanalogueseriesfromlargecompoundcollectionsusinganewcomputationalcompoundcorerelationshipmethod