Cargando…
Cheminformatics Analysis and Modeling with MacrolactoneDB
Macrolactones, macrocyclic lactones with at least twelve atoms within the core ring, include diverse natural products such as macrolides with potent bioactivities (e.g. antibiotics) and useful drug-like characteristics. We have developed MacrolactoneDB, which integrates nearly 14,000 existing macrol...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7156526/ https://www.ncbi.nlm.nih.gov/pubmed/32286395 http://dx.doi.org/10.1038/s41598-020-63192-4 |
_version_ | 1783522227129417728 |
---|---|
author | Zin, Phyo Phyo Kyaw Williams, Gavin J. Ekins, Sean |
author_facet | Zin, Phyo Phyo Kyaw Williams, Gavin J. Ekins, Sean |
author_sort | Zin, Phyo Phyo Kyaw |
collection | PubMed |
description | Macrolactones, macrocyclic lactones with at least twelve atoms within the core ring, include diverse natural products such as macrolides with potent bioactivities (e.g. antibiotics) and useful drug-like characteristics. We have developed MacrolactoneDB, which integrates nearly 14,000 existing macrolactones and their bioactivity information from different public databases, and new molecular descriptors to better characterize macrolide structures. The chemical distribution of MacrolactoneDB was analyzed in terms of important molecular properties and we have utilized three targets of interest (Plasmodium falciparum, Hepatitis C virus and T-cells) to demonstrate the value of compiling this data. Regression machine learning models were generated to predict biological endpoints using seven molecular descriptor sets and eight machine learning algorithms. Our results show that merging descriptors yields the best predictive power with Random Forest models, often boosted by consensus or hybrid modeling approaches. Our study provides cheminformatics insights into this privileged, underexplored structural class of compounds with high therapeutic potential. |
format | Online Article Text |
id | pubmed-7156526 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-71565262020-04-19 Cheminformatics Analysis and Modeling with MacrolactoneDB Zin, Phyo Phyo Kyaw Williams, Gavin J. Ekins, Sean Sci Rep Article Macrolactones, macrocyclic lactones with at least twelve atoms within the core ring, include diverse natural products such as macrolides with potent bioactivities (e.g. antibiotics) and useful drug-like characteristics. We have developed MacrolactoneDB, which integrates nearly 14,000 existing macrolactones and their bioactivity information from different public databases, and new molecular descriptors to better characterize macrolide structures. The chemical distribution of MacrolactoneDB was analyzed in terms of important molecular properties and we have utilized three targets of interest (Plasmodium falciparum, Hepatitis C virus and T-cells) to demonstrate the value of compiling this data. Regression machine learning models were generated to predict biological endpoints using seven molecular descriptor sets and eight machine learning algorithms. Our results show that merging descriptors yields the best predictive power with Random Forest models, often boosted by consensus or hybrid modeling approaches. Our study provides cheminformatics insights into this privileged, underexplored structural class of compounds with high therapeutic potential. Nature Publishing Group UK 2020-04-14 /pmc/articles/PMC7156526/ /pubmed/32286395 http://dx.doi.org/10.1038/s41598-020-63192-4 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Zin, Phyo Phyo Kyaw Williams, Gavin J. Ekins, Sean Cheminformatics Analysis and Modeling with MacrolactoneDB |
title | Cheminformatics Analysis and Modeling with MacrolactoneDB |
title_full | Cheminformatics Analysis and Modeling with MacrolactoneDB |
title_fullStr | Cheminformatics Analysis and Modeling with MacrolactoneDB |
title_full_unstemmed | Cheminformatics Analysis and Modeling with MacrolactoneDB |
title_short | Cheminformatics Analysis and Modeling with MacrolactoneDB |
title_sort | cheminformatics analysis and modeling with macrolactonedb |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7156526/ https://www.ncbi.nlm.nih.gov/pubmed/32286395 http://dx.doi.org/10.1038/s41598-020-63192-4 |
work_keys_str_mv | AT zinphyophyokyaw cheminformaticsanalysisandmodelingwithmacrolactonedb AT williamsgavinj cheminformaticsanalysisandmodelingwithmacrolactonedb AT ekinssean cheminformaticsanalysisandmodelingwithmacrolactonedb |