Cargando…
Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data
[Image: see text] Knowledge of the glass transition temperature of molecular compounds that occur in atmospheric aerosol particles is important for estimating their viscosity, as it directly influences the kinetics of chemical reactions and particle phase state. While there is a great diversity of o...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10077449/ https://www.ncbi.nlm.nih.gov/pubmed/37033862 http://dx.doi.org/10.1021/acsomega.2c08146 |
_version_ | 1785020304912809984 |
---|---|
author | Armeli, Gianluca Peters, Jan-Hendrik Koop, Thomas |
author_facet | Armeli, Gianluca Peters, Jan-Hendrik Koop, Thomas |
author_sort | Armeli, Gianluca |
collection | PubMed |
description | [Image: see text] Knowledge of the glass transition temperature of molecular compounds that occur in atmospheric aerosol particles is important for estimating their viscosity, as it directly influences the kinetics of chemical reactions and particle phase state. While there is a great diversity of organic compounds present in aerosol particles, for only a minor fraction of them experimental glass transition temperatures are known. Therefore, we have developed a machine learning model designed to predict the glass transition temperature of organic molecular compounds based on molecule-derived input variables. The extremely randomized trees (extra trees) procedure was chosen for this purpose. Two approaches using different sets of input variables were followed. The first one uses the number of selected functional groups present in the compound, while the second one generates descriptors from a SMILES (Simplified Molecular Input Line Entry System) string. Organic compounds containing carbon, hydrogen, oxygen, nitrogen, and halogen atoms are included. For improved results, both approaches can be combined with the melting temperature of the compound as an additional input variable. The results show that the predictions of both approaches show a similar mean absolute error of about 12–13 K, with the SMILES-based predictions performing slightly better. In general, the model shows good predictive power considering the diversity of the experimental input data. Furthermore, we also show that its performance exceeds that of previous parameterizations developed for this purpose and also performs better than existing machine learning models. In order to provide user-friendly versions of the model for applications, we have developed a web site where the model can be run by interested scientists via a web-based interface without prior technical knowledge. We also provide Python code of the model. Additionally, all experimental input data are provided in form of the Bielefeld Molecular Organic Glasses (BIMOG) database. We believe that this model is a powerful tool for many applications in atmospheric aerosol science and material science. |
format | Online Article Text |
id | pubmed-10077449 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-100774492023-04-07 Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data Armeli, Gianluca Peters, Jan-Hendrik Koop, Thomas ACS Omega [Image: see text] Knowledge of the glass transition temperature of molecular compounds that occur in atmospheric aerosol particles is important for estimating their viscosity, as it directly influences the kinetics of chemical reactions and particle phase state. While there is a great diversity of organic compounds present in aerosol particles, for only a minor fraction of them experimental glass transition temperatures are known. Therefore, we have developed a machine learning model designed to predict the glass transition temperature of organic molecular compounds based on molecule-derived input variables. The extremely randomized trees (extra trees) procedure was chosen for this purpose. Two approaches using different sets of input variables were followed. The first one uses the number of selected functional groups present in the compound, while the second one generates descriptors from a SMILES (Simplified Molecular Input Line Entry System) string. Organic compounds containing carbon, hydrogen, oxygen, nitrogen, and halogen atoms are included. For improved results, both approaches can be combined with the melting temperature of the compound as an additional input variable. The results show that the predictions of both approaches show a similar mean absolute error of about 12–13 K, with the SMILES-based predictions performing slightly better. In general, the model shows good predictive power considering the diversity of the experimental input data. Furthermore, we also show that its performance exceeds that of previous parameterizations developed for this purpose and also performs better than existing machine learning models. In order to provide user-friendly versions of the model for applications, we have developed a web site where the model can be run by interested scientists via a web-based interface without prior technical knowledge. We also provide Python code of the model. Additionally, all experimental input data are provided in form of the Bielefeld Molecular Organic Glasses (BIMOG) database. We believe that this model is a powerful tool for many applications in atmospheric aerosol science and material science. American Chemical Society 2023-03-22 /pmc/articles/PMC10077449/ /pubmed/37033862 http://dx.doi.org/10.1021/acsomega.2c08146 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Armeli, Gianluca Peters, Jan-Hendrik Koop, Thomas Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data |
title | Machine-Learning-Based Prediction of the Glass Transition
Temperature of Organic Compounds Using Experimental Data |
title_full | Machine-Learning-Based Prediction of the Glass Transition
Temperature of Organic Compounds Using Experimental Data |
title_fullStr | Machine-Learning-Based Prediction of the Glass Transition
Temperature of Organic Compounds Using Experimental Data |
title_full_unstemmed | Machine-Learning-Based Prediction of the Glass Transition
Temperature of Organic Compounds Using Experimental Data |
title_short | Machine-Learning-Based Prediction of the Glass Transition
Temperature of Organic Compounds Using Experimental Data |
title_sort | machine-learning-based prediction of the glass transition
temperature of organic compounds using experimental data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10077449/ https://www.ncbi.nlm.nih.gov/pubmed/37033862 http://dx.doi.org/10.1021/acsomega.2c08146 |
work_keys_str_mv | AT armeligianluca machinelearningbasedpredictionoftheglasstransitiontemperatureoforganiccompoundsusingexperimentaldata AT petersjanhendrik machinelearningbasedpredictionoftheglasstransitiontemperatureoforganiccompoundsusingexperimentaldata AT koopthomas machinelearningbasedpredictionoftheglasstransitiontemperatureoforganiccompoundsusingexperimentaldata |