Cargando…

Predicting Polymers’ Glass Transition Temperature by a Chemical Language Processing Model

We propose a chemical language processing model to predict polymers’ glass transition temperature ([Formula: see text]) through a polymer language (SMILES, Simplified Molecular Input Line Entry System) embedding and recurrent neural network. This model only receives the SMILES strings of a polymer’s...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Guang, Tao, Lei, Li, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8201381/
https://www.ncbi.nlm.nih.gov/pubmed/34200505
http://dx.doi.org/10.3390/polym13111898
_version_ 1783707805715267584
author Chen, Guang
Tao, Lei
Li, Ying
author_facet Chen, Guang
Tao, Lei
Li, Ying
author_sort Chen, Guang
collection PubMed
description We propose a chemical language processing model to predict polymers’ glass transition temperature ([Formula: see text]) through a polymer language (SMILES, Simplified Molecular Input Line Entry System) embedding and recurrent neural network. This model only receives the SMILES strings of a polymer’s repeat units as inputs and considers the SMILES strings as sequential data at the character level. Using this method, there is no need to calculate any additional molecular descriptors or fingerprints of polymers, and thereby, being very computationally efficient. More importantly, it avoids the difficulties to generate molecular descriptors for repeat units containing polymerization point ‘*’. Results show that the trained model demonstrates reasonable prediction performance on unseen polymer’s [Formula: see text]. Besides, this model is further applied for high-throughput screening on an unlabeled polymer database to identify high-temperature polymers that are desired for applications in extreme environments. Our work demonstrates that the SMILES strings of polymer repeat units can be used as an effective feature representation to develop a chemical language processing model for predictions of polymer [Formula: see text]. The framework of this model is general and can be used to construct structure–property relationships for other polymer properties.
format Online
Article
Text
id pubmed-8201381
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-82013812021-06-15 Predicting Polymers’ Glass Transition Temperature by a Chemical Language Processing Model Chen, Guang Tao, Lei Li, Ying Polymers (Basel) Article We propose a chemical language processing model to predict polymers’ glass transition temperature ([Formula: see text]) through a polymer language (SMILES, Simplified Molecular Input Line Entry System) embedding and recurrent neural network. This model only receives the SMILES strings of a polymer’s repeat units as inputs and considers the SMILES strings as sequential data at the character level. Using this method, there is no need to calculate any additional molecular descriptors or fingerprints of polymers, and thereby, being very computationally efficient. More importantly, it avoids the difficulties to generate molecular descriptors for repeat units containing polymerization point ‘*’. Results show that the trained model demonstrates reasonable prediction performance on unseen polymer’s [Formula: see text]. Besides, this model is further applied for high-throughput screening on an unlabeled polymer database to identify high-temperature polymers that are desired for applications in extreme environments. Our work demonstrates that the SMILES strings of polymer repeat units can be used as an effective feature representation to develop a chemical language processing model for predictions of polymer [Formula: see text]. The framework of this model is general and can be used to construct structure–property relationships for other polymer properties. MDPI 2021-06-07 /pmc/articles/PMC8201381/ /pubmed/34200505 http://dx.doi.org/10.3390/polym13111898 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chen, Guang
Tao, Lei
Li, Ying
Predicting Polymers’ Glass Transition Temperature by a Chemical Language Processing Model
title Predicting Polymers’ Glass Transition Temperature by a Chemical Language Processing Model
title_full Predicting Polymers’ Glass Transition Temperature by a Chemical Language Processing Model
title_fullStr Predicting Polymers’ Glass Transition Temperature by a Chemical Language Processing Model
title_full_unstemmed Predicting Polymers’ Glass Transition Temperature by a Chemical Language Processing Model
title_short Predicting Polymers’ Glass Transition Temperature by a Chemical Language Processing Model
title_sort predicting polymers’ glass transition temperature by a chemical language processing model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8201381/
https://www.ncbi.nlm.nih.gov/pubmed/34200505
http://dx.doi.org/10.3390/polym13111898
work_keys_str_mv AT chenguang predictingpolymersglasstransitiontemperaturebyachemicallanguageprocessingmodel
AT taolei predictingpolymersglasstransitiontemperaturebyachemicallanguageprocessingmodel
AT liying predictingpolymersglasstransitiontemperaturebyachemicallanguageprocessingmodel