Cargando…

Do you know your r(2)?

The prediction of solubility of drugs usually calls on the use of several open-source/commercially-available computer programs in the various calculation steps. Popular statistics to indicate the strength of the prediction model include the coefficient of determination (r(2)), Pearson’s linear corre...

Descripción completa

Detalles Bibliográficos
Autor principal: Avdeef, Alex
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Association of Physical Chemists 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8923304/
https://www.ncbi.nlm.nih.gov/pubmed/35299878
http://dx.doi.org/10.5599/admet.888
Descripción
Sumario:The prediction of solubility of drugs usually calls on the use of several open-source/commercially-available computer programs in the various calculation steps. Popular statistics to indicate the strength of the prediction model include the coefficient of determination (r(2)), Pearson’s linear correlation coefficient (r(Pearson)), and the root-mean-square error (RMSE), among many others. When a program calculates these statistics, slightly different definitions may be used. This commentary briefly reviews the definitions of three types of r(2) and RMSE statistics (model validation, bias compensation, and Pearson) and how systematic errors due to shortcomings in solubility prediction models can be differently indicated by the choice of statistical indices. The indices we have employed in recently published papers on the prediction of solubility of druglike molecules were unclear, especially in cases of drugs from ‘beyond the Rule of 5’ chemical space, as simple prediction models showed distinctive ‘bias-tilt’ systematic type scatter.