Cargando…

Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer

[Image: see text] Prostate cancer (PCa) is a major leading cause of mortality of cancer among males. There have been numerous studies to develop antagonists against androgen receptor (AR), a crucial therapeutic target for PCa. This study is a systematic cheminformatic analysis and machine learning m...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Tianshi, Nantasenamat, Chanin, Kachenton, Supicha, Anuwongcharoen, Nuttapat, Piacham, Theeraphon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9948163/
https://www.ncbi.nlm.nih.gov/pubmed/36844574
http://dx.doi.org/10.1021/acsomega.2c07346
_version_ 1784892718600683520
author Yu, Tianshi
Nantasenamat, Chanin
Kachenton, Supicha
Anuwongcharoen, Nuttapat
Piacham, Theeraphon
author_facet Yu, Tianshi
Nantasenamat, Chanin
Kachenton, Supicha
Anuwongcharoen, Nuttapat
Piacham, Theeraphon
author_sort Yu, Tianshi
collection PubMed
description [Image: see text] Prostate cancer (PCa) is a major leading cause of mortality of cancer among males. There have been numerous studies to develop antagonists against androgen receptor (AR), a crucial therapeutic target for PCa. This study is a systematic cheminformatic analysis and machine learning modeling to study the chemical space, scaffolds, structure–activity relationship, and landscape of human AR antagonists. There are 1678 molecules as final data sets. Chemical space visualization by physicochemical property visualization has demonstrated that molecules from the potent/active class generally have a mildly smaller molecular weight (MW), octanol–water partition coefficient (log P), number of hydrogen-bond acceptors (nHA), number of rotatable bonds (nRot), and topological polar surface area (TPSA) than molecules from intermediate/inactive class. The chemical space visualization in the principal component analysis (PCA) plot shows significant overlapping distributions between potent/active class molecules and intermediate/inactive class molecules; potent/active class molecules are intensively distributed, while intermediate/inactive class molecules are widely and sparsely distributed. Murcko scaffold analysis has shown low scaffold diversity in general, and scaffold diversity of potent/active class molecules is even lower than intermediate/inactive class molecules, indicating the necessity for developing molecules with novel scaffolds. Furthermore, scaffold visualization has identified 16 representative Murcko scaffolds. Among them, scaffolds 1, 2, 3, 4, 7, 8, 10, 11, 15, and 16 are highly favorable scaffolds due to their high scaffold enrichment factor values. Based on scaffold analysis, their local structure–activity relationships (SARs) were investigated and summarized. In addition, the global SAR landscape was explored by quantitative structure–activity relationship (QSAR) modelings and structure–activity landscape visualization. A QSAR classification model incorporating all of the 1678 molecules stands out as the best model from a total of 12 candidate models for AR antagonists (built on PubChem fingerprint, extra trees algorithm, accuracy for training set: 0.935, 10-fold cross-validation set: 0.735 and test set: 0.756). Deeper insights into the structure–activity landscape highlighted a total of seven significant activity cliff (AC) generators (ChEMBL molecule IDs: 160257, 418198, 4082265, 348918, 390728, 4080698, and 6530), which provide valuable SAR information for medicinal chemistry. The findings in this study provide new insights and guidelines for hit identification and lead optimization for the development of novel AR antagonists.
format Online
Article
Text
id pubmed-9948163
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-99481632023-02-24 Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer Yu, Tianshi Nantasenamat, Chanin Kachenton, Supicha Anuwongcharoen, Nuttapat Piacham, Theeraphon ACS Omega [Image: see text] Prostate cancer (PCa) is a major leading cause of mortality of cancer among males. There have been numerous studies to develop antagonists against androgen receptor (AR), a crucial therapeutic target for PCa. This study is a systematic cheminformatic analysis and machine learning modeling to study the chemical space, scaffolds, structure–activity relationship, and landscape of human AR antagonists. There are 1678 molecules as final data sets. Chemical space visualization by physicochemical property visualization has demonstrated that molecules from the potent/active class generally have a mildly smaller molecular weight (MW), octanol–water partition coefficient (log P), number of hydrogen-bond acceptors (nHA), number of rotatable bonds (nRot), and topological polar surface area (TPSA) than molecules from intermediate/inactive class. The chemical space visualization in the principal component analysis (PCA) plot shows significant overlapping distributions between potent/active class molecules and intermediate/inactive class molecules; potent/active class molecules are intensively distributed, while intermediate/inactive class molecules are widely and sparsely distributed. Murcko scaffold analysis has shown low scaffold diversity in general, and scaffold diversity of potent/active class molecules is even lower than intermediate/inactive class molecules, indicating the necessity for developing molecules with novel scaffolds. Furthermore, scaffold visualization has identified 16 representative Murcko scaffolds. Among them, scaffolds 1, 2, 3, 4, 7, 8, 10, 11, 15, and 16 are highly favorable scaffolds due to their high scaffold enrichment factor values. Based on scaffold analysis, their local structure–activity relationships (SARs) were investigated and summarized. In addition, the global SAR landscape was explored by quantitative structure–activity relationship (QSAR) modelings and structure–activity landscape visualization. A QSAR classification model incorporating all of the 1678 molecules stands out as the best model from a total of 12 candidate models for AR antagonists (built on PubChem fingerprint, extra trees algorithm, accuracy for training set: 0.935, 10-fold cross-validation set: 0.735 and test set: 0.756). Deeper insights into the structure–activity landscape highlighted a total of seven significant activity cliff (AC) generators (ChEMBL molecule IDs: 160257, 418198, 4082265, 348918, 390728, 4080698, and 6530), which provide valuable SAR information for medicinal chemistry. The findings in this study provide new insights and guidelines for hit identification and lead optimization for the development of novel AR antagonists. American Chemical Society 2023-02-13 /pmc/articles/PMC9948163/ /pubmed/36844574 http://dx.doi.org/10.1021/acsomega.2c07346 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Yu, Tianshi
Nantasenamat, Chanin
Kachenton, Supicha
Anuwongcharoen, Nuttapat
Piacham, Theeraphon
Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer
title Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer
title_full Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer
title_fullStr Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer
title_full_unstemmed Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer
title_short Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer
title_sort cheminformatic analysis and machine learning modeling to investigate androgen receptor antagonists to combat prostate cancer
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9948163/
https://www.ncbi.nlm.nih.gov/pubmed/36844574
http://dx.doi.org/10.1021/acsomega.2c07346
work_keys_str_mv AT yutianshi cheminformaticanalysisandmachinelearningmodelingtoinvestigateandrogenreceptorantagoniststocombatprostatecancer
AT nantasenamatchanin cheminformaticanalysisandmachinelearningmodelingtoinvestigateandrogenreceptorantagoniststocombatprostatecancer
AT kachentonsupicha cheminformaticanalysisandmachinelearningmodelingtoinvestigateandrogenreceptorantagoniststocombatprostatecancer
AT anuwongcharoennuttapat cheminformaticanalysisandmachinelearningmodelingtoinvestigateandrogenreceptorantagoniststocombatprostatecancer
AT piachamtheeraphon cheminformaticanalysisandmachinelearningmodelingtoinvestigateandrogenreceptorantagoniststocombatprostatecancer