Cargando…
Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds
[Image: see text] Designing molecules for drugs has been a hot topic for many decades. However, it is hard and expensive to find a new molecule. Thus, the cost of the final drug is also increased. Machine learning can provide the fastest way to predict the biological activity of druglike molecules....
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798507/ https://www.ncbi.nlm.nih.gov/pubmed/36591131 http://dx.doi.org/10.1021/acsomega.2c06174 |
_version_ | 1784860918969008128 |
---|---|
author | Ishfaq, Muhammad Aamir, Muhammad Ahmad, Farooq M Mebed, Abdelazim Elshahat, Sayed |
author_facet | Ishfaq, Muhammad Aamir, Muhammad Ahmad, Farooq M Mebed, Abdelazim Elshahat, Sayed |
author_sort | Ishfaq, Muhammad |
collection | PubMed |
description | [Image: see text] Designing molecules for drugs has been a hot topic for many decades. However, it is hard and expensive to find a new molecule. Thus, the cost of the final drug is also increased. Machine learning can provide the fastest way to predict the biological activity of druglike molecules. In the present work, machine learning models are trained for the prediction of the biological activity of aromatase inhibitors. Data was collected from the literature. Molecular descriptors are calculated to be used as independent features for model training. The results showed that the R(2) values for linear regression, random forest regression, gradient boosting regression, and bagging regression are 0.58, 0.84, 0.77, and 0.80, respectively. Using these models, it is possible to predict the activity of new molecules in a short period of time and at a reasonable cost. Furthermore, Tanimoto similarity is used for similarity analysis, as well as a chemical database is mined to search for similar molecules. Nonetheless, this study provides a framework for repurposing other effective drug molecules to prevent cancer. |
format | Online Article Text |
id | pubmed-9798507 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-97985072022-12-30 Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds Ishfaq, Muhammad Aamir, Muhammad Ahmad, Farooq M Mebed, Abdelazim Elshahat, Sayed ACS Omega [Image: see text] Designing molecules for drugs has been a hot topic for many decades. However, it is hard and expensive to find a new molecule. Thus, the cost of the final drug is also increased. Machine learning can provide the fastest way to predict the biological activity of druglike molecules. In the present work, machine learning models are trained for the prediction of the biological activity of aromatase inhibitors. Data was collected from the literature. Molecular descriptors are calculated to be used as independent features for model training. The results showed that the R(2) values for linear regression, random forest regression, gradient boosting regression, and bagging regression are 0.58, 0.84, 0.77, and 0.80, respectively. Using these models, it is possible to predict the activity of new molecules in a short period of time and at a reasonable cost. Furthermore, Tanimoto similarity is used for similarity analysis, as well as a chemical database is mined to search for similar molecules. Nonetheless, this study provides a framework for repurposing other effective drug molecules to prevent cancer. American Chemical Society 2022-12-13 /pmc/articles/PMC9798507/ /pubmed/36591131 http://dx.doi.org/10.1021/acsomega.2c06174 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Ishfaq, Muhammad Aamir, Muhammad Ahmad, Farooq M Mebed, Abdelazim Elshahat, Sayed Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds |
title | Machine Learning-Assisted Prediction of the Biological
Activity of Aromatase Inhibitors and Data Mining to Explore Similar
Compounds |
title_full | Machine Learning-Assisted Prediction of the Biological
Activity of Aromatase Inhibitors and Data Mining to Explore Similar
Compounds |
title_fullStr | Machine Learning-Assisted Prediction of the Biological
Activity of Aromatase Inhibitors and Data Mining to Explore Similar
Compounds |
title_full_unstemmed | Machine Learning-Assisted Prediction of the Biological
Activity of Aromatase Inhibitors and Data Mining to Explore Similar
Compounds |
title_short | Machine Learning-Assisted Prediction of the Biological
Activity of Aromatase Inhibitors and Data Mining to Explore Similar
Compounds |
title_sort | machine learning-assisted prediction of the biological
activity of aromatase inhibitors and data mining to explore similar
compounds |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798507/ https://www.ncbi.nlm.nih.gov/pubmed/36591131 http://dx.doi.org/10.1021/acsomega.2c06174 |
work_keys_str_mv | AT ishfaqmuhammad machinelearningassistedpredictionofthebiologicalactivityofaromataseinhibitorsanddataminingtoexploresimilarcompounds AT aamirmuhammad machinelearningassistedpredictionofthebiologicalactivityofaromataseinhibitorsanddataminingtoexploresimilarcompounds AT ahmadfarooq machinelearningassistedpredictionofthebiologicalactivityofaromataseinhibitorsanddataminingtoexploresimilarcompounds AT mmebedabdelazim machinelearningassistedpredictionofthebiologicalactivityofaromataseinhibitorsanddataminingtoexploresimilarcompounds AT elshahatsayed machinelearningassistedpredictionofthebiologicalactivityofaromataseinhibitorsanddataminingtoexploresimilarcompounds |