Cargando…

Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data

[Image: see text] Knowledge of the glass transition temperature of molecular compounds that occur in atmospheric aerosol particles is important for estimating their viscosity, as it directly influences the kinetics of chemical reactions and particle phase state. While there is a great diversity of o...

Descripción completa

Detalles Bibliográficos
Autores principales: Armeli, Gianluca, Peters, Jan-Hendrik, Koop, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10077449/
https://www.ncbi.nlm.nih.gov/pubmed/37033862
http://dx.doi.org/10.1021/acsomega.2c08146
_version_ 1785020304912809984
author Armeli, Gianluca
Peters, Jan-Hendrik
Koop, Thomas
author_facet Armeli, Gianluca
Peters, Jan-Hendrik
Koop, Thomas
author_sort Armeli, Gianluca
collection PubMed
description [Image: see text] Knowledge of the glass transition temperature of molecular compounds that occur in atmospheric aerosol particles is important for estimating their viscosity, as it directly influences the kinetics of chemical reactions and particle phase state. While there is a great diversity of organic compounds present in aerosol particles, for only a minor fraction of them experimental glass transition temperatures are known. Therefore, we have developed a machine learning model designed to predict the glass transition temperature of organic molecular compounds based on molecule-derived input variables. The extremely randomized trees (extra trees) procedure was chosen for this purpose. Two approaches using different sets of input variables were followed. The first one uses the number of selected functional groups present in the compound, while the second one generates descriptors from a SMILES (Simplified Molecular Input Line Entry System) string. Organic compounds containing carbon, hydrogen, oxygen, nitrogen, and halogen atoms are included. For improved results, both approaches can be combined with the melting temperature of the compound as an additional input variable. The results show that the predictions of both approaches show a similar mean absolute error of about 12–13 K, with the SMILES-based predictions performing slightly better. In general, the model shows good predictive power considering the diversity of the experimental input data. Furthermore, we also show that its performance exceeds that of previous parameterizations developed for this purpose and also performs better than existing machine learning models. In order to provide user-friendly versions of the model for applications, we have developed a web site where the model can be run by interested scientists via a web-based interface without prior technical knowledge. We also provide Python code of the model. Additionally, all experimental input data are provided in form of the Bielefeld Molecular Organic Glasses (BIMOG) database. We believe that this model is a powerful tool for many applications in atmospheric aerosol science and material science.
format Online
Article
Text
id pubmed-10077449
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-100774492023-04-07 Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data Armeli, Gianluca Peters, Jan-Hendrik Koop, Thomas ACS Omega [Image: see text] Knowledge of the glass transition temperature of molecular compounds that occur in atmospheric aerosol particles is important for estimating their viscosity, as it directly influences the kinetics of chemical reactions and particle phase state. While there is a great diversity of organic compounds present in aerosol particles, for only a minor fraction of them experimental glass transition temperatures are known. Therefore, we have developed a machine learning model designed to predict the glass transition temperature of organic molecular compounds based on molecule-derived input variables. The extremely randomized trees (extra trees) procedure was chosen for this purpose. Two approaches using different sets of input variables were followed. The first one uses the number of selected functional groups present in the compound, while the second one generates descriptors from a SMILES (Simplified Molecular Input Line Entry System) string. Organic compounds containing carbon, hydrogen, oxygen, nitrogen, and halogen atoms are included. For improved results, both approaches can be combined with the melting temperature of the compound as an additional input variable. The results show that the predictions of both approaches show a similar mean absolute error of about 12–13 K, with the SMILES-based predictions performing slightly better. In general, the model shows good predictive power considering the diversity of the experimental input data. Furthermore, we also show that its performance exceeds that of previous parameterizations developed for this purpose and also performs better than existing machine learning models. In order to provide user-friendly versions of the model for applications, we have developed a web site where the model can be run by interested scientists via a web-based interface without prior technical knowledge. We also provide Python code of the model. Additionally, all experimental input data are provided in form of the Bielefeld Molecular Organic Glasses (BIMOG) database. We believe that this model is a powerful tool for many applications in atmospheric aerosol science and material science. American Chemical Society 2023-03-22 /pmc/articles/PMC10077449/ /pubmed/37033862 http://dx.doi.org/10.1021/acsomega.2c08146 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Armeli, Gianluca
Peters, Jan-Hendrik
Koop, Thomas
Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data
title Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data
title_full Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data
title_fullStr Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data
title_full_unstemmed Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data
title_short Machine-Learning-Based Prediction of the Glass Transition Temperature of Organic Compounds Using Experimental Data
title_sort machine-learning-based prediction of the glass transition temperature of organic compounds using experimental data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10077449/
https://www.ncbi.nlm.nih.gov/pubmed/37033862
http://dx.doi.org/10.1021/acsomega.2c08146
work_keys_str_mv AT armeligianluca machinelearningbasedpredictionoftheglasstransitiontemperatureoforganiccompoundsusingexperimentaldata
AT petersjanhendrik machinelearningbasedpredictionoftheglasstransitiontemperatureoforganiccompoundsusingexperimentaldata
AT koopthomas machinelearningbasedpredictionoftheglasstransitiontemperatureoforganiccompoundsusingexperimentaldata