Cargando…
Predicting Solubility of Newly-Approved Drugs (2016–2020) with a Simple ABSOLV and GSE(Flexible-Acceptor) Consensus Model Outperforming Random Forest Regression
This study applies the ‘Flexible-Acceptor’ variant of the General Solubility Equation, GSE(Φ,B), to the prediction of the aqueous intrinsic solubility, log(10) S(0), of FDA recently-approved (2016–2020) ‘small-molecule’ new molecular entities (NMEs). The novel equation had been shown to predict the...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8818506/ https://www.ncbi.nlm.nih.gov/pubmed/35153342 http://dx.doi.org/10.1007/s10953-022-01141-7 |
_version_ | 1784645838458322944 |
---|---|
author | Avdeef, Alex Kansy, Manfred |
author_facet | Avdeef, Alex Kansy, Manfred |
author_sort | Avdeef, Alex |
collection | PubMed |
description | This study applies the ‘Flexible-Acceptor’ variant of the General Solubility Equation, GSE(Φ,B), to the prediction of the aqueous intrinsic solubility, log(10) S(0), of FDA recently-approved (2016–2020) ‘small-molecule’ new molecular entities (NMEs). The novel equation had been shown to predict the solubility of drugs beyond Lipinski’s ‘Rule of 5’ chemical space (bRo5) to a precision nearly matching that of the Random Forest Regression (RFR) machine learning method. Since then, it was found that the GSE(Φ,B) appears to work well not only for bRo5 NMEs, but also for Ro5 drugs. To put context to GSE(Φ,B), Yalkowsky’s GSE(classic), Abraham’s ABSOLV, and Breiman’s RFR models were also applied to predict log(10) S(0) of 72 newly-approve NMEs, for which useable reported solubility values could be accessed (nearly 60% from FDA New Drug Application published reports). Except for GSE (classic), the prediction models were retrained with an enlarged version of the Wiki-pS(0) database (nearly 400 added log(10) S(0) entries since our recent previous study). Thus, these four models were further validated by the additional independent solubility measurements which the newly-approved drugs introduced. The prediction methods ranked RFR ~ GSE (Φ,B) > ABSOLV > GSE (classic) in performance. It was further demonstrated that the biases generated in the four separate models could be nearly eliminated in a consensus model based on the average of just two of the methods: GSE (Φ,B) and ABSOLV. The resulting consensus prediction equation is simple in form and can be easily incorporated into spreadsheet calculations. Even more significant, it slightly outperformed the RFR method. |
format | Online Article Text |
id | pubmed-8818506 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-88185062022-02-07 Predicting Solubility of Newly-Approved Drugs (2016–2020) with a Simple ABSOLV and GSE(Flexible-Acceptor) Consensus Model Outperforming Random Forest Regression Avdeef, Alex Kansy, Manfred J Solution Chem Article This study applies the ‘Flexible-Acceptor’ variant of the General Solubility Equation, GSE(Φ,B), to the prediction of the aqueous intrinsic solubility, log(10) S(0), of FDA recently-approved (2016–2020) ‘small-molecule’ new molecular entities (NMEs). The novel equation had been shown to predict the solubility of drugs beyond Lipinski’s ‘Rule of 5’ chemical space (bRo5) to a precision nearly matching that of the Random Forest Regression (RFR) machine learning method. Since then, it was found that the GSE(Φ,B) appears to work well not only for bRo5 NMEs, but also for Ro5 drugs. To put context to GSE(Φ,B), Yalkowsky’s GSE(classic), Abraham’s ABSOLV, and Breiman’s RFR models were also applied to predict log(10) S(0) of 72 newly-approve NMEs, for which useable reported solubility values could be accessed (nearly 60% from FDA New Drug Application published reports). Except for GSE (classic), the prediction models were retrained with an enlarged version of the Wiki-pS(0) database (nearly 400 added log(10) S(0) entries since our recent previous study). Thus, these four models were further validated by the additional independent solubility measurements which the newly-approved drugs introduced. The prediction methods ranked RFR ~ GSE (Φ,B) > ABSOLV > GSE (classic) in performance. It was further demonstrated that the biases generated in the four separate models could be nearly eliminated in a consensus model based on the average of just two of the methods: GSE (Φ,B) and ABSOLV. The resulting consensus prediction equation is simple in form and can be easily incorporated into spreadsheet calculations. Even more significant, it slightly outperformed the RFR method. Springer US 2022-02-07 2022 /pmc/articles/PMC8818506/ /pubmed/35153342 http://dx.doi.org/10.1007/s10953-022-01141-7 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Avdeef, Alex Kansy, Manfred Predicting Solubility of Newly-Approved Drugs (2016–2020) with a Simple ABSOLV and GSE(Flexible-Acceptor) Consensus Model Outperforming Random Forest Regression |
title | Predicting Solubility of Newly-Approved Drugs (2016–2020) with a Simple ABSOLV and GSE(Flexible-Acceptor) Consensus Model Outperforming Random Forest Regression |
title_full | Predicting Solubility of Newly-Approved Drugs (2016–2020) with a Simple ABSOLV and GSE(Flexible-Acceptor) Consensus Model Outperforming Random Forest Regression |
title_fullStr | Predicting Solubility of Newly-Approved Drugs (2016–2020) with a Simple ABSOLV and GSE(Flexible-Acceptor) Consensus Model Outperforming Random Forest Regression |
title_full_unstemmed | Predicting Solubility of Newly-Approved Drugs (2016–2020) with a Simple ABSOLV and GSE(Flexible-Acceptor) Consensus Model Outperforming Random Forest Regression |
title_short | Predicting Solubility of Newly-Approved Drugs (2016–2020) with a Simple ABSOLV and GSE(Flexible-Acceptor) Consensus Model Outperforming Random Forest Regression |
title_sort | predicting solubility of newly-approved drugs (2016–2020) with a simple absolv and gse(flexible-acceptor) consensus model outperforming random forest regression |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8818506/ https://www.ncbi.nlm.nih.gov/pubmed/35153342 http://dx.doi.org/10.1007/s10953-022-01141-7 |
work_keys_str_mv | AT avdeefalex predictingsolubilityofnewlyapproveddrugs20162020withasimpleabsolvandgseflexibleacceptorconsensusmodeloutperformingrandomforestregression AT kansymanfred predictingsolubilityofnewlyapproveddrugs20162020withasimpleabsolvandgseflexibleacceptorconsensusmodeloutperformingrandomforestregression |