Cargando…

A confidence predictor for logD using conformal regression and a support-vector machine

Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water–octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds fro...

Descripción completa

Detalles Bibliográficos
Autores principales: Lapins, Maris, Arvidsson, Staffan, Lampa, Samuel, Berg, Arvid, Schaal, Wesley, Alvarsson, Jonathan, Spjuth, Ola
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882484/
https://www.ncbi.nlm.nih.gov/pubmed/29616425
http://dx.doi.org/10.1186/s13321-018-0271-1
_version_ 1783311480340348928
author Lapins, Maris
Arvidsson, Staffan
Lampa, Samuel
Berg, Arvid
Schaal, Wesley
Alvarsson, Jonathan
Spjuth, Ola
author_facet Lapins, Maris
Arvidsson, Staffan
Lampa, Samuel
Berg, Arvid
Schaal, Wesley
Alvarsson, Jonathan
Spjuth, Ola
author_sort Lapins, Maris
collection PubMed
description Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water–octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds from the ChEMBL database, models are created and evaluated by a support-vector machine with a linear kernel using conformal prediction methodology, outputting prediction intervals at a specified confidence level. The resulting model shows a predictive ability of [Formula: see text] and with the best performing nonconformity measure having median prediction interval of [Formula: see text] log units at 80% confidence and [Formula: see text] log units at 90% confidence. The model is available as an online service via an OpenAPI interface, a web page with a molecular editor, and we also publish predictive values at 90% confidence level for 91 M PubChem structures in RDF format for download and as an URI resolver service. [Image: see text]
format Online
Article
Text
id pubmed-5882484
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-58824842018-04-11 A confidence predictor for logD using conformal regression and a support-vector machine Lapins, Maris Arvidsson, Staffan Lampa, Samuel Berg, Arvid Schaal, Wesley Alvarsson, Jonathan Spjuth, Ola J Cheminform Research Article Lipophilicity is a major determinant of ADMET properties and overall suitability of drug candidates. We have developed large-scale models to predict water–octanol distribution coefficient (logD) for chemical compounds, aiding drug discovery projects. Using ACD/logD data for 1.6 million compounds from the ChEMBL database, models are created and evaluated by a support-vector machine with a linear kernel using conformal prediction methodology, outputting prediction intervals at a specified confidence level. The resulting model shows a predictive ability of [Formula: see text] and with the best performing nonconformity measure having median prediction interval of [Formula: see text] log units at 80% confidence and [Formula: see text] log units at 90% confidence. The model is available as an online service via an OpenAPI interface, a web page with a molecular editor, and we also publish predictive values at 90% confidence level for 91 M PubChem structures in RDF format for download and as an URI resolver service. [Image: see text] Springer International Publishing 2018-04-03 /pmc/articles/PMC5882484/ /pubmed/29616425 http://dx.doi.org/10.1186/s13321-018-0271-1 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Lapins, Maris
Arvidsson, Staffan
Lampa, Samuel
Berg, Arvid
Schaal, Wesley
Alvarsson, Jonathan
Spjuth, Ola
A confidence predictor for logD using conformal regression and a support-vector machine
title A confidence predictor for logD using conformal regression and a support-vector machine
title_full A confidence predictor for logD using conformal regression and a support-vector machine
title_fullStr A confidence predictor for logD using conformal regression and a support-vector machine
title_full_unstemmed A confidence predictor for logD using conformal regression and a support-vector machine
title_short A confidence predictor for logD using conformal regression and a support-vector machine
title_sort confidence predictor for logd using conformal regression and a support-vector machine
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882484/
https://www.ncbi.nlm.nih.gov/pubmed/29616425
http://dx.doi.org/10.1186/s13321-018-0271-1
work_keys_str_mv AT lapinsmaris aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT arvidssonstaffan aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT lampasamuel aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT bergarvid aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT schaalwesley aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT alvarssonjonathan aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT spjuthola aconfidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT lapinsmaris confidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT arvidssonstaffan confidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT lampasamuel confidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT bergarvid confidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT schaalwesley confidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT alvarssonjonathan confidencepredictorforlogdusingconformalregressionandasupportvectormachine
AT spjuthola confidencepredictorforlogdusingconformalregressionandasupportvectormachine