Cargando…
Machine learning with physicochemical relationships: solubility prediction in organic solvents and water
Solubility prediction remains a critical challenge in drug development, synthetic route and chemical process design, extraction and crystallisation. Here we report a successful approach to solubility prediction in organic solvents and water using a combination of machine learning (ANN, SVM, RF, Extr...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7666209/ https://www.ncbi.nlm.nih.gov/pubmed/33188226 http://dx.doi.org/10.1038/s41467-020-19594-z |
_version_ | 1783610091746885632 |
---|---|
author | Boobier, Samuel Hose, David R. J. Blacker, A. John Nguyen, Bao N. |
author_facet | Boobier, Samuel Hose, David R. J. Blacker, A. John Nguyen, Bao N. |
author_sort | Boobier, Samuel |
collection | PubMed |
description | Solubility prediction remains a critical challenge in drug development, synthetic route and chemical process design, extraction and crystallisation. Here we report a successful approach to solubility prediction in organic solvents and water using a combination of machine learning (ANN, SVM, RF, ExtraTrees, Bagging and GP) and computational chemistry. Rational interpretation of dissolution process into a numerical problem led to a small set of selected descriptors and subsequent predictions which are independent of the applied machine learning method. These models gave significantly more accurate predictions compared to benchmarked open-access and commercial tools, achieving accuracy close to the expected level of noise in training data (LogS ± 0.7). Finally, they reproduced physicochemical relationship between solubility and molecular properties in different solvents, which led to rational approaches to improve the accuracy of each models. |
format | Online Article Text |
id | pubmed-7666209 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-76662092020-11-17 Machine learning with physicochemical relationships: solubility prediction in organic solvents and water Boobier, Samuel Hose, David R. J. Blacker, A. John Nguyen, Bao N. Nat Commun Article Solubility prediction remains a critical challenge in drug development, synthetic route and chemical process design, extraction and crystallisation. Here we report a successful approach to solubility prediction in organic solvents and water using a combination of machine learning (ANN, SVM, RF, ExtraTrees, Bagging and GP) and computational chemistry. Rational interpretation of dissolution process into a numerical problem led to a small set of selected descriptors and subsequent predictions which are independent of the applied machine learning method. These models gave significantly more accurate predictions compared to benchmarked open-access and commercial tools, achieving accuracy close to the expected level of noise in training data (LogS ± 0.7). Finally, they reproduced physicochemical relationship between solubility and molecular properties in different solvents, which led to rational approaches to improve the accuracy of each models. Nature Publishing Group UK 2020-11-13 /pmc/articles/PMC7666209/ /pubmed/33188226 http://dx.doi.org/10.1038/s41467-020-19594-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Boobier, Samuel Hose, David R. J. Blacker, A. John Nguyen, Bao N. Machine learning with physicochemical relationships: solubility prediction in organic solvents and water |
title | Machine learning with physicochemical relationships: solubility prediction in organic solvents and water |
title_full | Machine learning with physicochemical relationships: solubility prediction in organic solvents and water |
title_fullStr | Machine learning with physicochemical relationships: solubility prediction in organic solvents and water |
title_full_unstemmed | Machine learning with physicochemical relationships: solubility prediction in organic solvents and water |
title_short | Machine learning with physicochemical relationships: solubility prediction in organic solvents and water |
title_sort | machine learning with physicochemical relationships: solubility prediction in organic solvents and water |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7666209/ https://www.ncbi.nlm.nih.gov/pubmed/33188226 http://dx.doi.org/10.1038/s41467-020-19594-z |
work_keys_str_mv | AT boobiersamuel machinelearningwithphysicochemicalrelationshipssolubilitypredictioninorganicsolventsandwater AT hosedavidrj machinelearningwithphysicochemicalrelationshipssolubilitypredictioninorganicsolventsandwater AT blackerajohn machinelearningwithphysicochemicalrelationshipssolubilitypredictioninorganicsolventsandwater AT nguyenbaon machinelearningwithphysicochemicalrelationshipssolubilitypredictioninorganicsolventsandwater |