Cargando…
ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches
In-silico prediction of aqueous solubility plays an important role during the drug discovery and development processes. For many years, the limited performance of in-silico solubility models has been attributed to the lack of high-quality solubility data for pharmaceutical molecules. However, some s...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
International Association of Physical Chemists
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915604/ https://www.ncbi.nlm.nih.gov/pubmed/35300309 http://dx.doi.org/10.5599/admet.852 |
_version_ | 1784668069132500992 |
---|---|
author | Falcón-Cano, Gabriela Molina, Christophe Cabrera-Pérez, Miguel Ángel |
author_facet | Falcón-Cano, Gabriela Molina, Christophe Cabrera-Pérez, Miguel Ángel |
author_sort | Falcón-Cano, Gabriela |
collection | PubMed |
description | In-silico prediction of aqueous solubility plays an important role during the drug discovery and development processes. For many years, the limited performance of in-silico solubility models has been attributed to the lack of high-quality solubility data for pharmaceutical molecules. However, some studies suggest that the poor accuracy of solubility prediction is not related to the quality of the experimental data and that more precise methodologies (algorithms and/or set of descriptors) are required for predicting aqueous solubility for pharmaceutical molecules. In this study a large and diverse database was generated with aqueous solubility values collected from two public sources; two new recursive machine-learning approaches were developed for data cleaning and variable selection, and a consensus model based on regression and classification algorithms was created. The modeling protocol, which includes the curation of chemical and experimental data, was implemented in KNIME, with the aim of obtaining an automated workflow for the prediction of new databases. Finally, we compared several methods or models available in the literature with our consensus model, showing results comparable or even outperforming previous published models. |
format | Online Article Text |
id | pubmed-8915604 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | International Association of Physical Chemists |
record_format | MEDLINE/PubMed |
spelling | pubmed-89156042022-03-16 ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches Falcón-Cano, Gabriela Molina, Christophe Cabrera-Pérez, Miguel Ángel ADMET DMPK Original Scientific Paper In-silico prediction of aqueous solubility plays an important role during the drug discovery and development processes. For many years, the limited performance of in-silico solubility models has been attributed to the lack of high-quality solubility data for pharmaceutical molecules. However, some studies suggest that the poor accuracy of solubility prediction is not related to the quality of the experimental data and that more precise methodologies (algorithms and/or set of descriptors) are required for predicting aqueous solubility for pharmaceutical molecules. In this study a large and diverse database was generated with aqueous solubility values collected from two public sources; two new recursive machine-learning approaches were developed for data cleaning and variable selection, and a consensus model based on regression and classification algorithms was created. The modeling protocol, which includes the curation of chemical and experimental data, was implemented in KNIME, with the aim of obtaining an automated workflow for the prediction of new databases. Finally, we compared several methods or models available in the literature with our consensus model, showing results comparable or even outperforming previous published models. International Association of Physical Chemists 2020-08-07 /pmc/articles/PMC8915604/ /pubmed/35300309 http://dx.doi.org/10.5599/admet.852 Text en Copyright © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ). |
spellingShingle | Original Scientific Paper Falcón-Cano, Gabriela Molina, Christophe Cabrera-Pérez, Miguel Ángel ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches |
title | ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches |
title_full | ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches |
title_fullStr | ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches |
title_full_unstemmed | ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches |
title_short | ADME prediction with KNIME: In silico aqueous solubility consensus model based on supervised recursive random forest approaches |
title_sort | adme prediction with knime: in silico aqueous solubility consensus model based on supervised recursive random forest approaches |
topic | Original Scientific Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8915604/ https://www.ncbi.nlm.nih.gov/pubmed/35300309 http://dx.doi.org/10.5599/admet.852 |
work_keys_str_mv | AT falconcanogabriela admepredictionwithknimeinsilicoaqueoussolubilityconsensusmodelbasedonsupervisedrecursiverandomforestapproaches AT molinachristophe admepredictionwithknimeinsilicoaqueoussolubilityconsensusmodelbasedonsupervisedrecursiverandomforestapproaches AT cabreraperezmiguelangel admepredictionwithknimeinsilicoaqueoussolubilityconsensusmodelbasedonsupervisedrecursiverandomforestapproaches |