Cargando…

Cheminformatics-based enumeration and analysis of large libraries of macrolide scaffolds

We report on the development of a cheminformatics enumeration technology and the analysis of a resulting large dataset of virtual macrolide scaffolds. Although macrolides have been shown to have valuable biological properties, there is no ready-to-screen virtual library of diverse macrolides in the...

Descripción completa

Detalles Bibliográficos
Autores principales: Zin, Phyo Phyo Kyaw, Williams, Gavin, Fourches, Denis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6755550/
https://www.ncbi.nlm.nih.gov/pubmed/30421084
http://dx.doi.org/10.1186/s13321-018-0307-6
_version_ 1783453255777386496
author Zin, Phyo Phyo Kyaw
Williams, Gavin
Fourches, Denis
author_facet Zin, Phyo Phyo Kyaw
Williams, Gavin
Fourches, Denis
author_sort Zin, Phyo Phyo Kyaw
collection PubMed
description We report on the development of a cheminformatics enumeration technology and the analysis of a resulting large dataset of virtual macrolide scaffolds. Although macrolides have been shown to have valuable biological properties, there is no ready-to-screen virtual library of diverse macrolides in the public domain. Conducting molecular modeling (especially virtual screening) of these complex molecules is highly relevant as the organic synthesis of these compounds, when feasible, typically requires many synthetic steps, and thus dramatically slows the discovery of new bioactive macrolides. Herein, we introduce a cheminformatics approach and associated software that allows for designing and generating libraries of virtual macrocycle/macrolide scaffolds with user-defined constitutional and structural constraints (e.g., types and numbers of structural motifs to be included in the macrocycle, ring size, maximum number of compounds generated). To study the chemical diversity of such generated molecules, we enumerated V1M (Virtual 1 million Macrolide scaffolds) library, each containing twelve common structural motifs. For each macrolide scaffold, we calculated several key properties, such as molecular weight, hydrogen bond donors/acceptors, topological polar surface area. In this study, we discuss (1) the initial concept and current features of our PKS (polyketides) Enumerator software, (2) the chemical diversity and distribution of structural motifs in V1M library, and (3) the unique opportunities for future virtual screening of such enumerated ensembles of macrolides. Importantly, V1M is provided in the Supplementary Material of this paper allowing other researchers to conduct any type of molecular modeling and virtual screening studies. Therefore, this technology for enumerating extremely large libraries of macrolide scaffolds could hold a unique potential in the field of computational chemistry and drug discovery for rational designing of new antibiotics and anti-cancer agents. [Image: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-018-0307-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6755550
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-67555502019-09-26 Cheminformatics-based enumeration and analysis of large libraries of macrolide scaffolds Zin, Phyo Phyo Kyaw Williams, Gavin Fourches, Denis J Cheminform Research Article We report on the development of a cheminformatics enumeration technology and the analysis of a resulting large dataset of virtual macrolide scaffolds. Although macrolides have been shown to have valuable biological properties, there is no ready-to-screen virtual library of diverse macrolides in the public domain. Conducting molecular modeling (especially virtual screening) of these complex molecules is highly relevant as the organic synthesis of these compounds, when feasible, typically requires many synthetic steps, and thus dramatically slows the discovery of new bioactive macrolides. Herein, we introduce a cheminformatics approach and associated software that allows for designing and generating libraries of virtual macrocycle/macrolide scaffolds with user-defined constitutional and structural constraints (e.g., types and numbers of structural motifs to be included in the macrocycle, ring size, maximum number of compounds generated). To study the chemical diversity of such generated molecules, we enumerated V1M (Virtual 1 million Macrolide scaffolds) library, each containing twelve common structural motifs. For each macrolide scaffold, we calculated several key properties, such as molecular weight, hydrogen bond donors/acceptors, topological polar surface area. In this study, we discuss (1) the initial concept and current features of our PKS (polyketides) Enumerator software, (2) the chemical diversity and distribution of structural motifs in V1M library, and (3) the unique opportunities for future virtual screening of such enumerated ensembles of macrolides. Importantly, V1M is provided in the Supplementary Material of this paper allowing other researchers to conduct any type of molecular modeling and virtual screening studies. Therefore, this technology for enumerating extremely large libraries of macrolide scaffolds could hold a unique potential in the field of computational chemistry and drug discovery for rational designing of new antibiotics and anti-cancer agents. [Image: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-018-0307-6) contains supplementary material, which is available to authorized users. Springer International Publishing 2018-11-12 /pmc/articles/PMC6755550/ /pubmed/30421084 http://dx.doi.org/10.1186/s13321-018-0307-6 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Zin, Phyo Phyo Kyaw
Williams, Gavin
Fourches, Denis
Cheminformatics-based enumeration and analysis of large libraries of macrolide scaffolds
title Cheminformatics-based enumeration and analysis of large libraries of macrolide scaffolds
title_full Cheminformatics-based enumeration and analysis of large libraries of macrolide scaffolds
title_fullStr Cheminformatics-based enumeration and analysis of large libraries of macrolide scaffolds
title_full_unstemmed Cheminformatics-based enumeration and analysis of large libraries of macrolide scaffolds
title_short Cheminformatics-based enumeration and analysis of large libraries of macrolide scaffolds
title_sort cheminformatics-based enumeration and analysis of large libraries of macrolide scaffolds
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6755550/
https://www.ncbi.nlm.nih.gov/pubmed/30421084
http://dx.doi.org/10.1186/s13321-018-0307-6
work_keys_str_mv AT zinphyophyokyaw cheminformaticsbasedenumerationandanalysisoflargelibrariesofmacrolidescaffolds
AT williamsgavin cheminformaticsbasedenumerationandanalysisoflargelibrariesofmacrolidescaffolds
AT fourchesdenis cheminformaticsbasedenumerationandanalysisoflargelibrariesofmacrolidescaffolds