Cargando…

ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches

In-silico prediction of aqueous solubility plays an important role during the drug discovery and development processes. For many years, the limited performance of in-silico solubility models has been attributed to the lack of high-quality solubility data for pharmaceutical molecules. However, some s...

Descripción completa

Detalles Bibliográficos
Autores principales: Falcón-Cano, Gabriela, Molina, Christophe, Cabrera-Pérez, Miguel Ángel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Association of Physical Chemists 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915604/
https://www.ncbi.nlm.nih.gov/pubmed/35300309
http://dx.doi.org/10.5599/admet.852
_version_ 1784668069132500992
author Falcón-Cano, Gabriela
Molina, Christophe
Cabrera-Pérez, Miguel Ángel
author_facet Falcón-Cano, Gabriela
Molina, Christophe
Cabrera-Pérez, Miguel Ángel
author_sort Falcón-Cano, Gabriela
collection PubMed
description In-silico prediction of aqueous solubility plays an important role during the drug discovery and development processes. For many years, the limited performance of in-silico solubility models has been attributed to the lack of high-quality solubility data for pharmaceutical molecules. However, some studies suggest that the poor accuracy of solubility prediction is not related to the quality of the experimental data and that more precise methodologies (algorithms and/or set of descriptors) are required for predicting aqueous solubility for pharmaceutical molecules. In this study a large and diverse database was generated with aqueous solubility values collected from two public sources; two new recursive machine-learning approaches were developed for data cleaning and variable selection, and a consensus model based on regression and classification algorithms was created. The modeling protocol, which includes the curation of chemical and experimental data, was implemented in KNIME, with the aim of obtaining an automated workflow for the prediction of new databases. Finally, we compared several methods or models available in the literature with our consensus model, showing results comparable or even outperforming previous published models.
format Online
Article
Text
id pubmed-8915604
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher International Association of Physical Chemists
record_format MEDLINE/PubMed
spelling pubmed-89156042022-03-16 ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches Falcón-Cano, Gabriela Molina, Christophe Cabrera-Pérez, Miguel Ángel ADMET DMPK Original Scientific Paper In-silico prediction of aqueous solubility plays an important role during the drug discovery and development processes. For many years, the limited performance of in-silico solubility models has been attributed to the lack of high-quality solubility data for pharmaceutical molecules. However, some studies suggest that the poor accuracy of solubility prediction is not related to the quality of the experimental data and that more precise methodologies (algorithms and/or set of descriptors) are required for predicting aqueous solubility for pharmaceutical molecules. In this study a large and diverse database was generated with aqueous solubility values collected from two public sources; two new recursive machine-learning approaches were developed for data cleaning and variable selection, and a consensus model based on regression and classification algorithms was created. The modeling protocol, which includes the curation of chemical and experimental data, was implemented in KNIME, with the aim of obtaining an automated workflow for the prediction of new databases. Finally, we compared several methods or models available in the literature with our consensus model, showing results comparable or even outperforming previous published models. International Association of Physical Chemists 2020-08-07 /pmc/articles/PMC8915604/ /pubmed/35300309 http://dx.doi.org/10.5599/admet.852 Text en Copyright © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Original Scientific Paper
Falcón-Cano, Gabriela
Molina, Christophe
Cabrera-Pérez, Miguel Ángel
ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches
title ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches
title_full ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches
title_fullStr ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches
title_full_unstemmed ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches
title_short ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches
title_sort adme prediction with knime: in silico aqueous solubility consensus model based on supervised recursive random forest approaches
topic Original Scientific Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915604/
https://www.ncbi.nlm.nih.gov/pubmed/35300309
http://dx.doi.org/10.5599/admet.852
work_keys_str_mv AT falconcanogabriela admepredictionwithknimeinsilicoaqueoussolubilityconsensusmodelbasedonsupervisedrecursiverandomforestapproaches
AT molinachristophe admepredictionwithknimeinsilicoaqueoussolubilityconsensusmodelbasedonsupervisedrecursiverandomforestapproaches
AT cabreraperezmiguelangel admepredictionwithknimeinsilicoaqueoussolubilityconsensusmodelbasedonsupervisedrecursiverandomforestapproaches