Cargando…
QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network
The Caco-2 cell model is widely used to evaluate the in vitro human intestinal permeability of drugs due to its morphological and functional similarity to human enterocytes. Although it is safe and relatively economic, it is time-consuming. A rapid and accurate quantitative structure-property relati...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9058322/ https://www.ncbi.nlm.nih.gov/pubmed/35514900 http://dx.doi.org/10.1039/d0ra08209k |
_version_ | 1784698088617672704 |
---|---|
author | Wang, Yukun Chen, Xuebo |
author_facet | Wang, Yukun Chen, Xuebo |
author_sort | Wang, Yukun |
collection | PubMed |
description | The Caco-2 cell model is widely used to evaluate the in vitro human intestinal permeability of drugs due to its morphological and functional similarity to human enterocytes. Although it is safe and relatively economic, it is time-consuming. A rapid and accurate quantitative structure-property relationship (QSPR) model of Caco-2 permeability is helpful to improve the efficiency of oral drug development. The aim of our study is to explore the predictive ability of the QSPR model, to study its permeation mechanism, and to develop a potential permeability prediction model, for Caco-2 cells. In our study, a relatively large data set was collected and the abnormal data were eliminated using the Monte Carlo regression and hybrid quantum particle swarm optimization (HQPSO) algorithm. Then, the remaining 1827 compounds were used to establish QSPR models. To generate multiple chemically diverse training and test sets, we used a combination of principal component analysis (PCA) and self-organizing mapping (SOM) neural networks to split the modeling data set characterized by PaDEL-descriptors. After preliminary selection of descriptors by the mean decrease impurity (MDI) method, the HQPSO algorithm was used to select the key descriptors. Six different methods, namely, multivariate linear regression (MLR), support vector machine regression (SVR), xgboost, radial basis function (RBF) neural networks, dual-SVR and dual-RBF were employed to develop QSPR models. The best dual-RBF model was obtained finally with R(2) = 0.91, and R(cv5)(2) = 0.77, for the training set, and R(T)(2) = 0.77, for the test set. A series of validation methods were used to assess the robustness and predictive ability of the dual-RBF model under OECD principles. A new application domain (AD) definition method based on the descriptor importance-weighted and distance-based (IWD) method was proposed, and the outliers were analyzed carefully. Combined with the importance of the descriptors used in the dual-RBF model, we concluded that the “H E-state” and hydrogen bonds are important factors affecting the permeability of drugs passing through the Caco-2 cell. Compared with the reported studies, our method exhibits certain advantages in data size, transparency of modeling process and prediction accuracy to some extent, and is a promising tool for virtual screening in the early stage of drug development. |
format | Online Article Text |
id | pubmed-9058322 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-90583222022-05-04 QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network Wang, Yukun Chen, Xuebo RSC Adv Chemistry The Caco-2 cell model is widely used to evaluate the in vitro human intestinal permeability of drugs due to its morphological and functional similarity to human enterocytes. Although it is safe and relatively economic, it is time-consuming. A rapid and accurate quantitative structure-property relationship (QSPR) model of Caco-2 permeability is helpful to improve the efficiency of oral drug development. The aim of our study is to explore the predictive ability of the QSPR model, to study its permeation mechanism, and to develop a potential permeability prediction model, for Caco-2 cells. In our study, a relatively large data set was collected and the abnormal data were eliminated using the Monte Carlo regression and hybrid quantum particle swarm optimization (HQPSO) algorithm. Then, the remaining 1827 compounds were used to establish QSPR models. To generate multiple chemically diverse training and test sets, we used a combination of principal component analysis (PCA) and self-organizing mapping (SOM) neural networks to split the modeling data set characterized by PaDEL-descriptors. After preliminary selection of descriptors by the mean decrease impurity (MDI) method, the HQPSO algorithm was used to select the key descriptors. Six different methods, namely, multivariate linear regression (MLR), support vector machine regression (SVR), xgboost, radial basis function (RBF) neural networks, dual-SVR and dual-RBF were employed to develop QSPR models. The best dual-RBF model was obtained finally with R(2) = 0.91, and R(cv5)(2) = 0.77, for the training set, and R(T)(2) = 0.77, for the test set. A series of validation methods were used to assess the robustness and predictive ability of the dual-RBF model under OECD principles. A new application domain (AD) definition method based on the descriptor importance-weighted and distance-based (IWD) method was proposed, and the outliers were analyzed carefully. Combined with the importance of the descriptors used in the dual-RBF model, we concluded that the “H E-state” and hydrogen bonds are important factors affecting the permeability of drugs passing through the Caco-2 cell. Compared with the reported studies, our method exhibits certain advantages in data size, transparency of modeling process and prediction accuracy to some extent, and is a promising tool for virtual screening in the early stage of drug development. The Royal Society of Chemistry 2020-11-26 /pmc/articles/PMC9058322/ /pubmed/35514900 http://dx.doi.org/10.1039/d0ra08209k Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/ |
spellingShingle | Chemistry Wang, Yukun Chen, Xuebo QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network |
title | QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network |
title_full | QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network |
title_fullStr | QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network |
title_full_unstemmed | QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network |
title_short | QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network |
title_sort | qspr model for caco-2 cell permeability prediction using a combination of hqpso and dual-rbf neural network |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9058322/ https://www.ncbi.nlm.nih.gov/pubmed/35514900 http://dx.doi.org/10.1039/d0ra08209k |
work_keys_str_mv | AT wangyukun qsprmodelforcaco2cellpermeabilitypredictionusingacombinationofhqpsoanddualrbfneuralnetwork AT chenxuebo qsprmodelforcaco2cellpermeabilitypredictionusingacombinationofhqpsoanddualrbfneuralnetwork |