Cargando…

The influence of solid state information and descriptor selection on statistical models of temperature dependent aqueous solubility

Predicting the equilibrium solubility of organic, crystalline materials at all relevant temperatures is crucial to the digital design of manufacturing unit operations in the chemical industries. The work reported in our current publication builds upon the limited number of recently published quantit...

Descripción completa

Detalles Bibliográficos
Autores principales: Marchese Robinson, Richard L., Roberts, Kevin J., Martin, Elaine B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6115327/
https://www.ncbi.nlm.nih.gov/pubmed/30159699
http://dx.doi.org/10.1186/s13321-018-0298-3
_version_ 1783351358403903488
author Marchese Robinson, Richard L.
Roberts, Kevin J.
Martin, Elaine B.
author_facet Marchese Robinson, Richard L.
Roberts, Kevin J.
Martin, Elaine B.
author_sort Marchese Robinson, Richard L.
collection PubMed
description Predicting the equilibrium solubility of organic, crystalline materials at all relevant temperatures is crucial to the digital design of manufacturing unit operations in the chemical industries. The work reported in our current publication builds upon the limited number of recently published quantitative structure–property relationship studies which modelled the temperature dependence of aqueous solubility. One set of models was built to directly predict temperature dependent solubility, including for materials with no solubility data at any temperature. We propose that a modified cross-validation protocol is required to evaluate these models. Another set of models was built to predict the related enthalpy of solution term, which can be used to estimate solubility at one temperature based upon solubility data for the same material at another temperature. We investigated whether various kinds of solid state descriptors improved the models obtained with a variety of molecular descriptor combinations: lattice energies or 3D descriptors calculated from crystal structures or melting point data. We found that none of these greatly improved the best direct predictions of temperature dependent solubility or the related enthalpy of solution endpoint. This finding is surprising because the importance of the solid state contribution to both endpoints is clear. We suggest our findings may, in part, reflect limitations in the descriptors calculated from crystal structures and, more generally, the limited availability of polymorph specific data. We present curated temperature dependent solubility and enthalpy of solution datasets, integrated with molecular and crystal structures, for future investigations. [Image: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-018-0298-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6115327
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-61153272018-09-10 The influence of solid state information and descriptor selection on statistical models of temperature dependent aqueous solubility Marchese Robinson, Richard L. Roberts, Kevin J. Martin, Elaine B. J Cheminform Research Article Predicting the equilibrium solubility of organic, crystalline materials at all relevant temperatures is crucial to the digital design of manufacturing unit operations in the chemical industries. The work reported in our current publication builds upon the limited number of recently published quantitative structure–property relationship studies which modelled the temperature dependence of aqueous solubility. One set of models was built to directly predict temperature dependent solubility, including for materials with no solubility data at any temperature. We propose that a modified cross-validation protocol is required to evaluate these models. Another set of models was built to predict the related enthalpy of solution term, which can be used to estimate solubility at one temperature based upon solubility data for the same material at another temperature. We investigated whether various kinds of solid state descriptors improved the models obtained with a variety of molecular descriptor combinations: lattice energies or 3D descriptors calculated from crystal structures or melting point data. We found that none of these greatly improved the best direct predictions of temperature dependent solubility or the related enthalpy of solution endpoint. This finding is surprising because the importance of the solid state contribution to both endpoints is clear. We suggest our findings may, in part, reflect limitations in the descriptors calculated from crystal structures and, more generally, the limited availability of polymorph specific data. We present curated temperature dependent solubility and enthalpy of solution datasets, integrated with molecular and crystal structures, for future investigations. [Image: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-018-0298-3) contains supplementary material, which is available to authorized users. Springer International Publishing 2018-08-29 /pmc/articles/PMC6115327/ /pubmed/30159699 http://dx.doi.org/10.1186/s13321-018-0298-3 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Marchese Robinson, Richard L.
Roberts, Kevin J.
Martin, Elaine B.
The influence of solid state information and descriptor selection on statistical models of temperature dependent aqueous solubility
title The influence of solid state information and descriptor selection on statistical models of temperature dependent aqueous solubility
title_full The influence of solid state information and descriptor selection on statistical models of temperature dependent aqueous solubility
title_fullStr The influence of solid state information and descriptor selection on statistical models of temperature dependent aqueous solubility
title_full_unstemmed The influence of solid state information and descriptor selection on statistical models of temperature dependent aqueous solubility
title_short The influence of solid state information and descriptor selection on statistical models of temperature dependent aqueous solubility
title_sort influence of solid state information and descriptor selection on statistical models of temperature dependent aqueous solubility
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6115327/
https://www.ncbi.nlm.nih.gov/pubmed/30159699
http://dx.doi.org/10.1186/s13321-018-0298-3
work_keys_str_mv AT marcheserobinsonrichardl theinfluenceofsolidstateinformationanddescriptorselectiononstatisticalmodelsoftemperaturedependentaqueoussolubility
AT robertskevinj theinfluenceofsolidstateinformationanddescriptorselectiononstatisticalmodelsoftemperaturedependentaqueoussolubility
AT martinelaineb theinfluenceofsolidstateinformationanddescriptorselectiononstatisticalmodelsoftemperaturedependentaqueoussolubility
AT marcheserobinsonrichardl influenceofsolidstateinformationanddescriptorselectiononstatisticalmodelsoftemperaturedependentaqueoussolubility
AT robertskevinj influenceofsolidstateinformationanddescriptorselectiononstatisticalmodelsoftemperaturedependentaqueoussolubility
AT martinelaineb influenceofsolidstateinformationanddescriptorselectiononstatisticalmodelsoftemperaturedependentaqueoussolubility