Cargando…

Cheminformatics Analysis and Modeling with MacrolactoneDB

Macrolactones, macrocyclic lactones with at least twelve atoms within the core ring, include diverse natural products such as macrolides with potent bioactivities (e.g. antibiotics) and useful drug-like characteristics. We have developed MacrolactoneDB, which integrates nearly 14,000 existing macrol...

Descripción completa

Detalles Bibliográficos
Autores principales: Zin, Phyo Phyo Kyaw, Williams, Gavin J., Ekins, Sean
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7156526/
https://www.ncbi.nlm.nih.gov/pubmed/32286395
http://dx.doi.org/10.1038/s41598-020-63192-4
_version_ 1783522227129417728
author Zin, Phyo Phyo Kyaw
Williams, Gavin J.
Ekins, Sean
author_facet Zin, Phyo Phyo Kyaw
Williams, Gavin J.
Ekins, Sean
author_sort Zin, Phyo Phyo Kyaw
collection PubMed
description Macrolactones, macrocyclic lactones with at least twelve atoms within the core ring, include diverse natural products such as macrolides with potent bioactivities (e.g. antibiotics) and useful drug-like characteristics. We have developed MacrolactoneDB, which integrates nearly 14,000 existing macrolactones and their bioactivity information from different public databases, and new molecular descriptors to better characterize macrolide structures. The chemical distribution of MacrolactoneDB was analyzed in terms of important molecular properties and we have utilized three targets of interest (Plasmodium falciparum, Hepatitis C virus and T-cells) to demonstrate the value of compiling this data. Regression machine learning models were generated to predict biological endpoints using seven molecular descriptor sets and eight machine learning algorithms. Our results show that merging descriptors yields the best predictive power with Random Forest models, often boosted by consensus or hybrid modeling approaches. Our study provides cheminformatics insights into this privileged, underexplored structural class of compounds with high therapeutic potential.
format Online
Article
Text
id pubmed-7156526
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-71565262020-04-19 Cheminformatics Analysis and Modeling with MacrolactoneDB Zin, Phyo Phyo Kyaw Williams, Gavin J. Ekins, Sean Sci Rep Article Macrolactones, macrocyclic lactones with at least twelve atoms within the core ring, include diverse natural products such as macrolides with potent bioactivities (e.g. antibiotics) and useful drug-like characteristics. We have developed MacrolactoneDB, which integrates nearly 14,000 existing macrolactones and their bioactivity information from different public databases, and new molecular descriptors to better characterize macrolide structures. The chemical distribution of MacrolactoneDB was analyzed in terms of important molecular properties and we have utilized three targets of interest (Plasmodium falciparum, Hepatitis C virus and T-cells) to demonstrate the value of compiling this data. Regression machine learning models were generated to predict biological endpoints using seven molecular descriptor sets and eight machine learning algorithms. Our results show that merging descriptors yields the best predictive power with Random Forest models, often boosted by consensus or hybrid modeling approaches. Our study provides cheminformatics insights into this privileged, underexplored structural class of compounds with high therapeutic potential. Nature Publishing Group UK 2020-04-14 /pmc/articles/PMC7156526/ /pubmed/32286395 http://dx.doi.org/10.1038/s41598-020-63192-4 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Zin, Phyo Phyo Kyaw
Williams, Gavin J.
Ekins, Sean
Cheminformatics Analysis and Modeling with MacrolactoneDB
title Cheminformatics Analysis and Modeling with MacrolactoneDB
title_full Cheminformatics Analysis and Modeling with MacrolactoneDB
title_fullStr Cheminformatics Analysis and Modeling with MacrolactoneDB
title_full_unstemmed Cheminformatics Analysis and Modeling with MacrolactoneDB
title_short Cheminformatics Analysis and Modeling with MacrolactoneDB
title_sort cheminformatics analysis and modeling with macrolactonedb
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7156526/
https://www.ncbi.nlm.nih.gov/pubmed/32286395
http://dx.doi.org/10.1038/s41598-020-63192-4
work_keys_str_mv AT zinphyophyokyaw cheminformaticsanalysisandmodelingwithmacrolactonedb
AT williamsgavinj cheminformaticsanalysisandmodelingwithmacrolactonedb
AT ekinssean cheminformaticsanalysisandmodelingwithmacrolactonedb