Cargando…

Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds

[Image: see text] Designing molecules for drugs has been a hot topic for many decades. However, it is hard and expensive to find a new molecule. Thus, the cost of the final drug is also increased. Machine learning can provide the fastest way to predict the biological activity of druglike molecules....

Descripción completa

Detalles Bibliográficos
Autores principales: Ishfaq, Muhammad, Aamir, Muhammad, Ahmad, Farooq, M Mebed, Abdelazim, Elshahat, Sayed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798507/
https://www.ncbi.nlm.nih.gov/pubmed/36591131
http://dx.doi.org/10.1021/acsomega.2c06174
_version_ 1784860918969008128
author Ishfaq, Muhammad
Aamir, Muhammad
Ahmad, Farooq
M Mebed, Abdelazim
Elshahat, Sayed
author_facet Ishfaq, Muhammad
Aamir, Muhammad
Ahmad, Farooq
M Mebed, Abdelazim
Elshahat, Sayed
author_sort Ishfaq, Muhammad
collection PubMed
description [Image: see text] Designing molecules for drugs has been a hot topic for many decades. However, it is hard and expensive to find a new molecule. Thus, the cost of the final drug is also increased. Machine learning can provide the fastest way to predict the biological activity of druglike molecules. In the present work, machine learning models are trained for the prediction of the biological activity of aromatase inhibitors. Data was collected from the literature. Molecular descriptors are calculated to be used as independent features for model training. The results showed that the R(2) values for linear regression, random forest regression, gradient boosting regression, and bagging regression are 0.58, 0.84, 0.77, and 0.80, respectively. Using these models, it is possible to predict the activity of new molecules in a short period of time and at a reasonable cost. Furthermore, Tanimoto similarity is used for similarity analysis, as well as a chemical database is mined to search for similar molecules. Nonetheless, this study provides a framework for repurposing other effective drug molecules to prevent cancer.
format Online
Article
Text
id pubmed-9798507
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-97985072022-12-30 Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds Ishfaq, Muhammad Aamir, Muhammad Ahmad, Farooq M Mebed, Abdelazim Elshahat, Sayed ACS Omega [Image: see text] Designing molecules for drugs has been a hot topic for many decades. However, it is hard and expensive to find a new molecule. Thus, the cost of the final drug is also increased. Machine learning can provide the fastest way to predict the biological activity of druglike molecules. In the present work, machine learning models are trained for the prediction of the biological activity of aromatase inhibitors. Data was collected from the literature. Molecular descriptors are calculated to be used as independent features for model training. The results showed that the R(2) values for linear regression, random forest regression, gradient boosting regression, and bagging regression are 0.58, 0.84, 0.77, and 0.80, respectively. Using these models, it is possible to predict the activity of new molecules in a short period of time and at a reasonable cost. Furthermore, Tanimoto similarity is used for similarity analysis, as well as a chemical database is mined to search for similar molecules. Nonetheless, this study provides a framework for repurposing other effective drug molecules to prevent cancer. American Chemical Society 2022-12-13 /pmc/articles/PMC9798507/ /pubmed/36591131 http://dx.doi.org/10.1021/acsomega.2c06174 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Ishfaq, Muhammad
Aamir, Muhammad
Ahmad, Farooq
M Mebed, Abdelazim
Elshahat, Sayed
Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds
title Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds
title_full Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds
title_fullStr Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds
title_full_unstemmed Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds
title_short Machine Learning-Assisted Prediction of the Biological Activity of Aromatase Inhibitors and Data Mining to Explore Similar Compounds
title_sort machine learning-assisted prediction of the biological activity of aromatase inhibitors and data mining to explore similar compounds
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798507/
https://www.ncbi.nlm.nih.gov/pubmed/36591131
http://dx.doi.org/10.1021/acsomega.2c06174
work_keys_str_mv AT ishfaqmuhammad machinelearningassistedpredictionofthebiologicalactivityofaromataseinhibitorsanddataminingtoexploresimilarcompounds
AT aamirmuhammad machinelearningassistedpredictionofthebiologicalactivityofaromataseinhibitorsanddataminingtoexploresimilarcompounds
AT ahmadfarooq machinelearningassistedpredictionofthebiologicalactivityofaromataseinhibitorsanddataminingtoexploresimilarcompounds
AT mmebedabdelazim machinelearningassistedpredictionofthebiologicalactivityofaromataseinhibitorsanddataminingtoexploresimilarcompounds
AT elshahatsayed machinelearningassistedpredictionofthebiologicalactivityofaromataseinhibitorsanddataminingtoexploresimilarcompounds