Cargando…
Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches
The heterogeneity of the Caco-2 cell line and differences in experimental protocols for permeability assessment using this cell-based method have resulted in the high variability of Caco-2 permeability measurements. These problems have limited the generation of large datasets to develop accurate and...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610902/ https://www.ncbi.nlm.nih.gov/pubmed/36297432 http://dx.doi.org/10.3390/pharmaceutics14101998 |
_version_ | 1784819392866942976 |
---|---|
author | Falcón-Cano, Gabriela Molina, Christophe Cabrera-Pérez, Miguel Ángel |
author_facet | Falcón-Cano, Gabriela Molina, Christophe Cabrera-Pérez, Miguel Ángel |
author_sort | Falcón-Cano, Gabriela |
collection | PubMed |
description | The heterogeneity of the Caco-2 cell line and differences in experimental protocols for permeability assessment using this cell-based method have resulted in the high variability of Caco-2 permeability measurements. These problems have limited the generation of large datasets to develop accurate and applicable regression models. This study presents a QSPR approach developed on the KNIME analytical platform and based on a structurally diverse dataset of over 4900 molecules. Interpretable models were obtained using random forest supervised recursive algorithms for data cleaning and feature selection. The development of a conditional consensus model based on regional and global regression random forest produced models with RMSE values between 0.43–0.51 for all validation sets. The potential applicability of the model as a surrogate for the in vitro Caco-2 assay was demonstrated through blind prediction of 32 drugs recommended by the International Council for the Harmonization of Technical Requirements for Pharmaceuticals (ICH) for validation of in vitro permeability methods. The model was validated for the preliminary estimation of the BCS/BDDCS class. The KNIME workflow developed to automate new drug prediction is freely available. The results suggest that this automated prediction platform is a reliable tool for identifying the most promising compounds with high intestinal permeability during the early stages of drug discovery. |
format | Online Article Text |
id | pubmed-9610902 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96109022022-10-28 Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches Falcón-Cano, Gabriela Molina, Christophe Cabrera-Pérez, Miguel Ángel Pharmaceutics Article The heterogeneity of the Caco-2 cell line and differences in experimental protocols for permeability assessment using this cell-based method have resulted in the high variability of Caco-2 permeability measurements. These problems have limited the generation of large datasets to develop accurate and applicable regression models. This study presents a QSPR approach developed on the KNIME analytical platform and based on a structurally diverse dataset of over 4900 molecules. Interpretable models were obtained using random forest supervised recursive algorithms for data cleaning and feature selection. The development of a conditional consensus model based on regional and global regression random forest produced models with RMSE values between 0.43–0.51 for all validation sets. The potential applicability of the model as a surrogate for the in vitro Caco-2 assay was demonstrated through blind prediction of 32 drugs recommended by the International Council for the Harmonization of Technical Requirements for Pharmaceuticals (ICH) for validation of in vitro permeability methods. The model was validated for the preliminary estimation of the BCS/BDDCS class. The KNIME workflow developed to automate new drug prediction is freely available. The results suggest that this automated prediction platform is a reliable tool for identifying the most promising compounds with high intestinal permeability during the early stages of drug discovery. MDPI 2022-09-21 /pmc/articles/PMC9610902/ /pubmed/36297432 http://dx.doi.org/10.3390/pharmaceutics14101998 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Falcón-Cano, Gabriela Molina, Christophe Cabrera-Pérez, Miguel Ángel Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches |
title | Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches |
title_full | Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches |
title_fullStr | Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches |
title_full_unstemmed | Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches |
title_short | Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches |
title_sort | reliable prediction of caco-2 permeability by supervised recursive machine learning approaches |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610902/ https://www.ncbi.nlm.nih.gov/pubmed/36297432 http://dx.doi.org/10.3390/pharmaceutics14101998 |
work_keys_str_mv | AT falconcanogabriela reliablepredictionofcaco2permeabilitybysupervisedrecursivemachinelearningapproaches AT molinachristophe reliablepredictionofcaco2permeabilitybysupervisedrecursivemachinelearningapproaches AT cabreraperezmiguelangel reliablepredictionofcaco2permeabilitybysupervisedrecursivemachinelearningapproaches |