Cargando…
JPlogP: an improved logP predictor trained using predicted data
The partition coefficient between octanol and water (logP) has been an important descriptor in QSAR predictions for many years and therefore the prediction of logP has been examined countless times. One of the best performing models is to predict the logP using multiple methods and average the resul...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6755606/ https://www.ncbi.nlm.nih.gov/pubmed/30552535 http://dx.doi.org/10.1186/s13321-018-0316-5 |
_version_ | 1783453269106884608 |
---|---|
author | Plante, Jeffrey Werner, Stephane |
author_facet | Plante, Jeffrey Werner, Stephane |
author_sort | Plante, Jeffrey |
collection | PubMed |
description | The partition coefficient between octanol and water (logP) has been an important descriptor in QSAR predictions for many years and therefore the prediction of logP has been examined countless times. One of the best performing models is to predict the logP using multiple methods and average the result. We have used those averaged predictions to develop a training-set which was able to distil the information present across the disparate logP methods into one single model. Our model was built using extendable atom-types, where each atom is distilled down into a 6 digit number, and each individual atom is assumed to have a small additive effect on the overall logP of the molecule. Beyond the simple coefficient model a consensus model is evaluated, which uses known compounds as a starting point in the calculation and modifies the experimental logP using the same coefficients as in the first model. We then test the performance of our models against two different datasets, one where many different models routinely perform well against, and another designed to more represent pharmaceutical space. The true strength of the model is represented in the pharmaceutical benchmark set, where both models perform better than any previously developed models. [Image: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-018-0316-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6755606 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-67556062019-09-26 JPlogP: an improved logP predictor trained using predicted data Plante, Jeffrey Werner, Stephane J Cheminform Research Article The partition coefficient between octanol and water (logP) has been an important descriptor in QSAR predictions for many years and therefore the prediction of logP has been examined countless times. One of the best performing models is to predict the logP using multiple methods and average the result. We have used those averaged predictions to develop a training-set which was able to distil the information present across the disparate logP methods into one single model. Our model was built using extendable atom-types, where each atom is distilled down into a 6 digit number, and each individual atom is assumed to have a small additive effect on the overall logP of the molecule. Beyond the simple coefficient model a consensus model is evaluated, which uses known compounds as a starting point in the calculation and modifies the experimental logP using the same coefficients as in the first model. We then test the performance of our models against two different datasets, one where many different models routinely perform well against, and another designed to more represent pharmaceutical space. The true strength of the model is represented in the pharmaceutical benchmark set, where both models perform better than any previously developed models. [Image: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-018-0316-5) contains supplementary material, which is available to authorized users. Springer International Publishing 2018-12-14 /pmc/articles/PMC6755606/ /pubmed/30552535 http://dx.doi.org/10.1186/s13321-018-0316-5 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Plante, Jeffrey Werner, Stephane JPlogP: an improved logP predictor trained using predicted data |
title | JPlogP: an improved logP predictor trained using predicted data |
title_full | JPlogP: an improved logP predictor trained using predicted data |
title_fullStr | JPlogP: an improved logP predictor trained using predicted data |
title_full_unstemmed | JPlogP: an improved logP predictor trained using predicted data |
title_short | JPlogP: an improved logP predictor trained using predicted data |
title_sort | jplogp: an improved logp predictor trained using predicted data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6755606/ https://www.ncbi.nlm.nih.gov/pubmed/30552535 http://dx.doi.org/10.1186/s13321-018-0316-5 |
work_keys_str_mv | AT plantejeffrey jplogpanimprovedlogppredictortrainedusingpredicteddata AT wernerstephane jplogpanimprovedlogppredictortrainedusingpredicteddata |