Cargando…

QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network

The Caco-2 cell model is widely used to evaluate the in vitro human intestinal permeability of drugs due to its morphological and functional similarity to human enterocytes. Although it is safe and relatively economic, it is time-consuming. A rapid and accurate quantitative structure-property relati...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yukun, Chen, Xuebo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9058322/
https://www.ncbi.nlm.nih.gov/pubmed/35514900
http://dx.doi.org/10.1039/d0ra08209k
_version_ 1784698088617672704
author Wang, Yukun
Chen, Xuebo
author_facet Wang, Yukun
Chen, Xuebo
author_sort Wang, Yukun
collection PubMed
description The Caco-2 cell model is widely used to evaluate the in vitro human intestinal permeability of drugs due to its morphological and functional similarity to human enterocytes. Although it is safe and relatively economic, it is time-consuming. A rapid and accurate quantitative structure-property relationship (QSPR) model of Caco-2 permeability is helpful to improve the efficiency of oral drug development. The aim of our study is to explore the predictive ability of the QSPR model, to study its permeation mechanism, and to develop a potential permeability prediction model, for Caco-2 cells. In our study, a relatively large data set was collected and the abnormal data were eliminated using the Monte Carlo regression and hybrid quantum particle swarm optimization (HQPSO) algorithm. Then, the remaining 1827 compounds were used to establish QSPR models. To generate multiple chemically diverse training and test sets, we used a combination of principal component analysis (PCA) and self-organizing mapping (SOM) neural networks to split the modeling data set characterized by PaDEL-descriptors. After preliminary selection of descriptors by the mean decrease impurity (MDI) method, the HQPSO algorithm was used to select the key descriptors. Six different methods, namely, multivariate linear regression (MLR), support vector machine regression (SVR), xgboost, radial basis function (RBF) neural networks, dual-SVR and dual-RBF were employed to develop QSPR models. The best dual-RBF model was obtained finally with R(2) = 0.91, and R(cv5)(2) = 0.77, for the training set, and R(T)(2) = 0.77, for the test set. A series of validation methods were used to assess the robustness and predictive ability of the dual-RBF model under OECD principles. A new application domain (AD) definition method based on the descriptor importance-weighted and distance-based (IWD) method was proposed, and the outliers were analyzed carefully. Combined with the importance of the descriptors used in the dual-RBF model, we concluded that the “H E-state” and hydrogen bonds are important factors affecting the permeability of drugs passing through the Caco-2 cell. Compared with the reported studies, our method exhibits certain advantages in data size, transparency of modeling process and prediction accuracy to some extent, and is a promising tool for virtual screening in the early stage of drug development.
format Online
Article
Text
id pubmed-9058322
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-90583222022-05-04 QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network Wang, Yukun Chen, Xuebo RSC Adv Chemistry The Caco-2 cell model is widely used to evaluate the in vitro human intestinal permeability of drugs due to its morphological and functional similarity to human enterocytes. Although it is safe and relatively economic, it is time-consuming. A rapid and accurate quantitative structure-property relationship (QSPR) model of Caco-2 permeability is helpful to improve the efficiency of oral drug development. The aim of our study is to explore the predictive ability of the QSPR model, to study its permeation mechanism, and to develop a potential permeability prediction model, for Caco-2 cells. In our study, a relatively large data set was collected and the abnormal data were eliminated using the Monte Carlo regression and hybrid quantum particle swarm optimization (HQPSO) algorithm. Then, the remaining 1827 compounds were used to establish QSPR models. To generate multiple chemically diverse training and test sets, we used a combination of principal component analysis (PCA) and self-organizing mapping (SOM) neural networks to split the modeling data set characterized by PaDEL-descriptors. After preliminary selection of descriptors by the mean decrease impurity (MDI) method, the HQPSO algorithm was used to select the key descriptors. Six different methods, namely, multivariate linear regression (MLR), support vector machine regression (SVR), xgboost, radial basis function (RBF) neural networks, dual-SVR and dual-RBF were employed to develop QSPR models. The best dual-RBF model was obtained finally with R(2) = 0.91, and R(cv5)(2) = 0.77, for the training set, and R(T)(2) = 0.77, for the test set. A series of validation methods were used to assess the robustness and predictive ability of the dual-RBF model under OECD principles. A new application domain (AD) definition method based on the descriptor importance-weighted and distance-based (IWD) method was proposed, and the outliers were analyzed carefully. Combined with the importance of the descriptors used in the dual-RBF model, we concluded that the “H E-state” and hydrogen bonds are important factors affecting the permeability of drugs passing through the Caco-2 cell. Compared with the reported studies, our method exhibits certain advantages in data size, transparency of modeling process and prediction accuracy to some extent, and is a promising tool for virtual screening in the early stage of drug development. The Royal Society of Chemistry 2020-11-26 /pmc/articles/PMC9058322/ /pubmed/35514900 http://dx.doi.org/10.1039/d0ra08209k Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/
spellingShingle Chemistry
Wang, Yukun
Chen, Xuebo
QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network
title QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network
title_full QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network
title_fullStr QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network
title_full_unstemmed QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network
title_short QSPR model for Caco-2 cell permeability prediction using a combination of HQPSO and dual-RBF neural network
title_sort qspr model for caco-2 cell permeability prediction using a combination of hqpso and dual-rbf neural network
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9058322/
https://www.ncbi.nlm.nih.gov/pubmed/35514900
http://dx.doi.org/10.1039/d0ra08209k
work_keys_str_mv AT wangyukun qsprmodelforcaco2cellpermeabilitypredictionusingacombinationofhqpsoanddualrbfneuralnetwork
AT chenxuebo qsprmodelforcaco2cellpermeabilitypredictionusingacombinationofhqpsoanddualrbfneuralnetwork