Cargando…

On Two Novel Parameters for Validation of Predictive QSAR Models

Validation is a crucial aspect of quantitative structure–activity relationship (QSAR) modeling. The present paper shows that traditionally used validation parameters (leave-one-out Q(2) for internal validation and predictive R(2) for external validation) may be supplemented with two novel parameters...

Descripción completa

Detalles Bibliográficos
Autores principales: Pratim Roy, Partha, Paul, Somnath, Mitra, Indrani, Roy, Kunal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Molecular Diversity Preservation International 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6254296/
https://www.ncbi.nlm.nih.gov/pubmed/19471190
http://dx.doi.org/10.3390/molecules14051660
Descripción
Sumario:Validation is a crucial aspect of quantitative structure–activity relationship (QSAR) modeling. The present paper shows that traditionally used validation parameters (leave-one-out Q(2) for internal validation and predictive R(2) for external validation) may be supplemented with two novel parameters r(m)(2) and R(p)(2) for a stricter test of validation. The parameter r(m)(2)((overall)) penalizes a model for large differences between observed and predicted values of the compounds of the whole set (considering both training and test sets) while the parameter R(p)(2) penalizes model R(2) for large differences between determination coefficient of nonrandom model and square of mean correlation coefficient of random models in case of a randomization test. Two other variants of r(m)(2) parameter, r(m)(2)((LOO)) and r(m)(2)((test)), penalize a model more strictly than Q(2) and R(2)(pred) respectively. Three different data sets of moderate to large size have been used to develop multiple models in order to indicate the suitability of the novel parameters in QSAR studies. The results show that in many cases the developed models could satisfy the requirements of conventional parameters (Q(2) and R(2)(pred)) but fail to achieve the required values for the novel parameters r(m)(2) and R(p)(2). Moreover, these parameters also help in identifying the best models from among a set of comparable models. Thus, a test for these two parameters is suggested to be a more stringent requirement than the traditional validation parameters to decide acceptability of a predictive QSAR model, especially when a regulatory decision is involved.