Cargando…

ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning

Breast cancer resistance protein (BCRP/ABCG2), an ATP-binding cassette (ABC) efflux transporter, plays a critical role in multi-drug resistance (MDR) to anti-cancer drugs and drug–drug interactions. The prediction of BCRP inhibition can facilitate evaluating potential drug resistance and drug–drug i...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jiang, Dejun, Lei, Tailong, Wang, Zhe, Shen, Chao, Cao, Dongsheng, Hou, Tingjun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2020
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7059329/ https://www.ncbi.nlm.nih.gov/pubmed/33430990 http://dx.doi.org/10.1186/s13321-020-00421-y

_version_	1783504027829403648
author	Jiang, Dejun Lei, Tailong Wang, Zhe Shen, Chao Cao, Dongsheng Hou, Tingjun
author_facet	Jiang, Dejun Lei, Tailong Wang, Zhe Shen, Chao Cao, Dongsheng Hou, Tingjun
author_sort	Jiang, Dejun
collection	PubMed
description	Breast cancer resistance protein (BCRP/ABCG2), an ATP-binding cassette (ABC) efflux transporter, plays a critical role in multi-drug resistance (MDR) to anti-cancer drugs and drug–drug interactions. The prediction of BCRP inhibition can facilitate evaluating potential drug resistance and drug–drug interactions in early stage of drug discovery. Here we reported a structurally diverse dataset consisting of 1098 BCRP inhibitors and 1701 non-inhibitors. Analysis of various physicochemical properties illustrates that BCRP inhibitors are more hydrophobic and aromatic than non-inhibitors. We then developed a series of quantitative structure–activity relationship (QSAR) models to discriminate between BCRP inhibitors and non-inhibitors. The optimal feature subset was determined by a wrapper feature selection method named rfSA (simulated annealing algorithm coupled with random forest), and the classification models were established by using seven machine learning approaches based on the optimal feature subset, including a deep learning method, two ensemble learning methods, and four classical machine learning methods. The statistical results demonstrated that three methods, including support vector machine (SVM), deep neural networks (DNN) and extreme gradient boosting (XGBoost), outperformed the others, and the SVM classifier yielded the best predictions (MCC = 0.812 and AUC = 0.958 for the test set). Then, a perturbation-based model-agnostic method was used to interpret our models and analyze the representative features for different models. The application domain analysis demonstrated the prediction reliability of our models. Moreover, the important structural fragments related to BCRP inhibition were identified by the information gain (IG) method along with the frequency analysis. In conclusion, we believe that the classification models developed in this study can be regarded as simple and accurate tools to distinguish BCRP inhibitors from non-inhibitors in drug design and discovery pipelines. [Image: see text]
format	Online Article Text
id	pubmed-7059329
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-70593292020-03-11 ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning Jiang, Dejun Lei, Tailong Wang, Zhe Shen, Chao Cao, Dongsheng Hou, Tingjun J Cheminform Research Article Breast cancer resistance protein (BCRP/ABCG2), an ATP-binding cassette (ABC) efflux transporter, plays a critical role in multi-drug resistance (MDR) to anti-cancer drugs and drug–drug interactions. The prediction of BCRP inhibition can facilitate evaluating potential drug resistance and drug–drug interactions in early stage of drug discovery. Here we reported a structurally diverse dataset consisting of 1098 BCRP inhibitors and 1701 non-inhibitors. Analysis of various physicochemical properties illustrates that BCRP inhibitors are more hydrophobic and aromatic than non-inhibitors. We then developed a series of quantitative structure–activity relationship (QSAR) models to discriminate between BCRP inhibitors and non-inhibitors. The optimal feature subset was determined by a wrapper feature selection method named rfSA (simulated annealing algorithm coupled with random forest), and the classification models were established by using seven machine learning approaches based on the optimal feature subset, including a deep learning method, two ensemble learning methods, and four classical machine learning methods. The statistical results demonstrated that three methods, including support vector machine (SVM), deep neural networks (DNN) and extreme gradient boosting (XGBoost), outperformed the others, and the SVM classifier yielded the best predictions (MCC = 0.812 and AUC = 0.958 for the test set). Then, a perturbation-based model-agnostic method was used to interpret our models and analyze the representative features for different models. The application domain analysis demonstrated the prediction reliability of our models. Moreover, the important structural fragments related to BCRP inhibition were identified by the information gain (IG) method along with the frequency analysis. In conclusion, we believe that the classification models developed in this study can be regarded as simple and accurate tools to distinguish BCRP inhibitors from non-inhibitors in drug design and discovery pipelines. [Image: see text] Springer International Publishing 2020-03-05 /pmc/articles/PMC7059329/ /pubmed/33430990 http://dx.doi.org/10.1186/s13321-020-00421-y Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Article Jiang, Dejun Lei, Tailong Wang, Zhe Shen, Chao Cao, Dongsheng Hou, Tingjun ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning
title	ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning
title_full	ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning
title_fullStr	ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning
title_full_unstemmed	ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning
title_short	ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning
title_sort	admet evaluation in drug discovery. 20. prediction of breast cancer resistance protein inhibition through machine learning
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7059329/ https://www.ncbi.nlm.nih.gov/pubmed/33430990 http://dx.doi.org/10.1186/s13321-020-00421-y
work_keys_str_mv	AT jiangdejun admetevaluationindrugdiscovery20predictionofbreastcancerresistanceproteininhibitionthroughmachinelearning AT leitailong admetevaluationindrugdiscovery20predictionofbreastcancerresistanceproteininhibitionthroughmachinelearning AT wangzhe admetevaluationindrugdiscovery20predictionofbreastcancerresistanceproteininhibitionthroughmachinelearning AT shenchao admetevaluationindrugdiscovery20predictionofbreastcancerresistanceproteininhibitionthroughmachinelearning AT caodongsheng admetevaluationindrugdiscovery20predictionofbreastcancerresistanceproteininhibitionthroughmachinelearning AT houtingjun admetevaluationindrugdiscovery20predictionofbreastcancerresistanceproteininhibitionthroughmachinelearning

ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning

Ejemplares similares