Cargando…
A confidence predictor for logD using conformal regression and a support-vector machine
Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water–octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds fro...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882484/ https://www.ncbi.nlm.nih.gov/pubmed/29616425 http://dx.doi.org/10.1186/s13321-018-0271-1 |
_version_ | 1783311480340348928 |
---|---|
author | Lapins, Maris Arvidsson, Staffan Lampa, Samuel Berg, Arvid Schaal, Wesley Alvarsson, Jonathan Spjuth, Ola |
author_facet | Lapins, Maris Arvidsson, Staffan Lampa, Samuel Berg, Arvid Schaal, Wesley Alvarsson, Jonathan Spjuth, Ola |
author_sort | Lapins, Maris |
collection | PubMed |
description | Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water–octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds from the ChEMBL database, models are created and evaluated by a support-vector machine with a linear kernel using conformal prediction methodology, outputting prediction intervals at a specified confidence level. The resulting model shows a predictive ability of [Formula: see text] and with the best performing nonconformity measure having median prediction interval of [Formula: see text] log units at 80% confidence and [Formula: see text] log units at 90% confidence. The model is available as an online service via an OpenAPI interface, a web page with a molecular editor, and we also publish predictive values at 90% confidence level for 91 M PubChem structures in RDF format for download and as an URI resolver service. [Image: see text] |
format | Online Article Text |
id | pubmed-5882484 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-58824842018-04-11 A confidence predictor for logD using conformal regression and a support-vector machine Lapins, Maris Arvidsson, Staffan Lampa, Samuel Berg, Arvid Schaal, Wesley Alvarsson, Jonathan Spjuth, Ola J Cheminform Research Article Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water–octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds from the ChEMBL database, models are created and evaluated by a support-vector machine with a linear kernel using conformal prediction methodology, outputting prediction intervals at a specified confidence level. The resulting model shows a predictive ability of [Formula: see text] and with the best performing nonconformity measure having median prediction interval of [Formula: see text] log units at 80% confidence and [Formula: see text] log units at 90% confidence. The model is available as an online service via an OpenAPI interface, a web page with a molecular editor, and we also publish predictive values at 90% confidence level for 91 M PubChem structures in RDF format for download and as an URI resolver service. [Image: see text] Springer International Publishing 2018-04-03 /pmc/articles/PMC5882484/ /pubmed/29616425 http://dx.doi.org/10.1186/s13321-018-0271-1 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Lapins, Maris Arvidsson, Staffan Lampa, Samuel Berg, Arvid Schaal, Wesley Alvarsson, Jonathan Spjuth, Ola A confidence predictor for logD using conformal regression and a support-vector machine |
title | A confidence predictor for logD using conformal regression and a support-vector machine |
title_full | A confidence predictor for logD using conformal regression and a support-vector machine |
title_fullStr | A confidence predictor for logD using conformal regression and a support-vector machine |
title_full_unstemmed | A confidence predictor for logD using conformal regression and a support-vector machine |
title_short | A confidence predictor for logD using conformal regression and a support-vector machine |
title_sort | confidence predictor for logd using conformal regression and a support-vector machine |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882484/ https://www.ncbi.nlm.nih.gov/pubmed/29616425 http://dx.doi.org/10.1186/s13321-018-0271-1 |
work_keys_str_mv | AT lapinsmaris aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine AT arvidssonstaffan aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine AT lampasamuel aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine AT bergarvid aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine AT schaalwesley aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine AT alvarssonjonathan aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine AT spjuthola aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine AT lapinsmaris confidencepredictorforlogdusingconformalregressionandasupportvectormachine AT arvidssonstaffan confidencepredictorforlogdusingconformalregressionandasupportvectormachine AT lampasamuel confidencepredictorforlogdusingconformalregressionandasupportvectormachine AT bergarvid confidencepredictorforlogdusingconformalregressionandasupportvectormachine AT schaalwesley confidencepredictorforlogdusingconformalregressionandasupportvectormachine AT alvarssonjonathan confidencepredictorforlogdusingconformalregressionandasupportvectormachine AT spjuthola confidencepredictorforlogdusingconformalregressionandasupportvectormachine |