Cargando…
Applicability Domain of Polyparameter Linear Free Energy Relationship Models Evaluated by Leverage and Prediction Interval Calculation
[Image: see text] Polyparameter linear free energy relationships (PP-LFERs) are accurate and robust models employed to predict equilibrium partition coefficients (K) of organic chemicals. The accuracy of predictions by a PP-LFER depends on the composition of the respective calibration data set. Gene...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069697/ https://www.ncbi.nlm.nih.gov/pubmed/35420030 http://dx.doi.org/10.1021/acs.est.2c00865 |
_version_ | 1784700486353420288 |
---|---|
author | Endo, Satoshi |
author_facet | Endo, Satoshi |
author_sort | Endo, Satoshi |
collection | PubMed |
description | [Image: see text] Polyparameter linear free energy relationships (PP-LFERs) are accurate and robust models employed to predict equilibrium partition coefficients (K) of organic chemicals. The accuracy of predictions by a PP-LFER depends on the composition of the respective calibration data set. Generally, extrapolation outside the domain defined by the calibration data is likely to be less accurate than interpolation. In this study, the applicability domain (AD) of PP-LFERs was systematically evaluated by calculating the leverage (h) and prediction interval (PI). Repeated simulations with experimental data showed that the root mean squared error of predictions increased with h. However, the analysis also showed that PP-LFERs calibrated with a large number (e.g., 100) of training data were highly robust against extrapolation error. For such PP-LFERs, the common definition of extrapolation (h > 3 h(mean), where h(mean) is the mean h of all training compounds) may be excessively strict. Alternatively, the PI is proposed as a metric to define the AD of PP-LFERs, as it provides a concrete estimate of the error range that agrees well with the observed errors, even for extreme extrapolations. Additionally, published PP-LFERs were evaluated in terms of their AD using the new concept of AD probes, which indicated the varying predictive performance of PP-LFERs in the existing literature for environmentally relevant compounds. |
format | Online Article Text |
id | pubmed-9069697 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-90696972022-05-06 Applicability Domain of Polyparameter Linear Free Energy Relationship Models Evaluated by Leverage and Prediction Interval Calculation Endo, Satoshi Environ Sci Technol [Image: see text] Polyparameter linear free energy relationships (PP-LFERs) are accurate and robust models employed to predict equilibrium partition coefficients (K) of organic chemicals. The accuracy of predictions by a PP-LFER depends on the composition of the respective calibration data set. Generally, extrapolation outside the domain defined by the calibration data is likely to be less accurate than interpolation. In this study, the applicability domain (AD) of PP-LFERs was systematically evaluated by calculating the leverage (h) and prediction interval (PI). Repeated simulations with experimental data showed that the root mean squared error of predictions increased with h. However, the analysis also showed that PP-LFERs calibrated with a large number (e.g., 100) of training data were highly robust against extrapolation error. For such PP-LFERs, the common definition of extrapolation (h > 3 h(mean), where h(mean) is the mean h of all training compounds) may be excessively strict. Alternatively, the PI is proposed as a metric to define the AD of PP-LFERs, as it provides a concrete estimate of the error range that agrees well with the observed errors, even for extreme extrapolations. Additionally, published PP-LFERs were evaluated in terms of their AD using the new concept of AD probes, which indicated the varying predictive performance of PP-LFERs in the existing literature for environmentally relevant compounds. American Chemical Society 2022-04-14 2022-05-03 /pmc/articles/PMC9069697/ /pubmed/35420030 http://dx.doi.org/10.1021/acs.est.2c00865 Text en © 2022 The Author. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Endo, Satoshi Applicability Domain of Polyparameter Linear Free Energy Relationship Models Evaluated by Leverage and Prediction Interval Calculation |
title | Applicability
Domain of Polyparameter Linear Free
Energy Relationship Models Evaluated by Leverage and Prediction Interval
Calculation |
title_full | Applicability
Domain of Polyparameter Linear Free
Energy Relationship Models Evaluated by Leverage and Prediction Interval
Calculation |
title_fullStr | Applicability
Domain of Polyparameter Linear Free
Energy Relationship Models Evaluated by Leverage and Prediction Interval
Calculation |
title_full_unstemmed | Applicability
Domain of Polyparameter Linear Free
Energy Relationship Models Evaluated by Leverage and Prediction Interval
Calculation |
title_short | Applicability
Domain of Polyparameter Linear Free
Energy Relationship Models Evaluated by Leverage and Prediction Interval
Calculation |
title_sort | applicability
domain of polyparameter linear free
energy relationship models evaluated by leverage and prediction interval
calculation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069697/ https://www.ncbi.nlm.nih.gov/pubmed/35420030 http://dx.doi.org/10.1021/acs.est.2c00865 |
work_keys_str_mv | AT endosatoshi applicabilitydomainofpolyparameterlinearfreeenergyrelationshipmodelsevaluatedbyleverageandpredictionintervalcalculation |