Cargando…

Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches

The heterogeneity of the Caco-2 cell line and differences in experimental protocols for permeability assessment using this cell-based method have resulted in the high variability of Caco-2 permeability measurements. These problems have limited the generation of large datasets to develop accurate and...

Descripción completa

Detalles Bibliográficos
Autores principales: Falcón-Cano, Gabriela, Molina, Christophe, Cabrera-Pérez, Miguel Ángel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610902/
https://www.ncbi.nlm.nih.gov/pubmed/36297432
http://dx.doi.org/10.3390/pharmaceutics14101998
_version_ 1784819392866942976
author Falcón-Cano, Gabriela
Molina, Christophe
Cabrera-Pérez, Miguel Ángel
author_facet Falcón-Cano, Gabriela
Molina, Christophe
Cabrera-Pérez, Miguel Ángel
author_sort Falcón-Cano, Gabriela
collection PubMed
description The heterogeneity of the Caco-2 cell line and differences in experimental protocols for permeability assessment using this cell-based method have resulted in the high variability of Caco-2 permeability measurements. These problems have limited the generation of large datasets to develop accurate and applicable regression models. This study presents a QSPR approach developed on the KNIME analytical platform and based on a structurally diverse dataset of over 4900 molecules. Interpretable models were obtained using random forest supervised recursive algorithms for data cleaning and feature selection. The development of a conditional consensus model based on regional and global regression random forest produced models with RMSE values between 0.43–0.51 for all validation sets. The potential applicability of the model as a surrogate for the in vitro Caco-2 assay was demonstrated through blind prediction of 32 drugs recommended by the International Council for the Harmonization of Technical Requirements for Pharmaceuticals (ICH) for validation of in vitro permeability methods. The model was validated for the preliminary estimation of the BCS/BDDCS class. The KNIME workflow developed to automate new drug prediction is freely available. The results suggest that this automated prediction platform is a reliable tool for identifying the most promising compounds with high intestinal permeability during the early stages of drug discovery.
format Online
Article
Text
id pubmed-9610902
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96109022022-10-28 Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches Falcón-Cano, Gabriela Molina, Christophe Cabrera-Pérez, Miguel Ángel Pharmaceutics Article The heterogeneity of the Caco-2 cell line and differences in experimental protocols for permeability assessment using this cell-based method have resulted in the high variability of Caco-2 permeability measurements. These problems have limited the generation of large datasets to develop accurate and applicable regression models. This study presents a QSPR approach developed on the KNIME analytical platform and based on a structurally diverse dataset of over 4900 molecules. Interpretable models were obtained using random forest supervised recursive algorithms for data cleaning and feature selection. The development of a conditional consensus model based on regional and global regression random forest produced models with RMSE values between 0.43–0.51 for all validation sets. The potential applicability of the model as a surrogate for the in vitro Caco-2 assay was demonstrated through blind prediction of 32 drugs recommended by the International Council for the Harmonization of Technical Requirements for Pharmaceuticals (ICH) for validation of in vitro permeability methods. The model was validated for the preliminary estimation of the BCS/BDDCS class. The KNIME workflow developed to automate new drug prediction is freely available. The results suggest that this automated prediction platform is a reliable tool for identifying the most promising compounds with high intestinal permeability during the early stages of drug discovery. MDPI 2022-09-21 /pmc/articles/PMC9610902/ /pubmed/36297432 http://dx.doi.org/10.3390/pharmaceutics14101998 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Falcón-Cano, Gabriela
Molina, Christophe
Cabrera-Pérez, Miguel Ángel
Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches
title Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches
title_full Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches
title_fullStr Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches
title_full_unstemmed Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches
title_short Reliable Prediction of Caco-2 Permeability by Supervised Recursive Machine Learning Approaches
title_sort reliable prediction of caco-2 permeability by supervised recursive machine learning approaches
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610902/
https://www.ncbi.nlm.nih.gov/pubmed/36297432
http://dx.doi.org/10.3390/pharmaceutics14101998
work_keys_str_mv AT falconcanogabriela reliablepredictionofcaco2permeabilitybysupervisedrecursivemachinelearningapproaches
AT molinachristophe reliablepredictionofcaco2permeabilitybysupervisedrecursivemachinelearningapproaches
AT cabreraperezmiguelangel reliablepredictionofcaco2permeabilitybysupervisedrecursivemachinelearningapproaches