Cargando…

Comparison of Machine Learning Methods towards Developing Interpretable Polyamide Property Prediction

Polyamides are often used for their superior thermal, mechanical, and chemical properties. They form a diverse set of materials that have a large variation in properties between linear to aromatic compounds, which renders the traditional quantitative structure–property relationship (QSPR) challengin...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Franklin Langlang, Park, Jaehong, Goyal, Sushmit, Qaroush, Yousef, Wang, Shihu, Yoon, Hong, Rammohan, Aravind, Shim, Youngseon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587315/
https://www.ncbi.nlm.nih.gov/pubmed/34771210
http://dx.doi.org/10.3390/polym13213653
_version_ 1784598111592644608
author Lee, Franklin Langlang
Park, Jaehong
Goyal, Sushmit
Qaroush, Yousef
Wang, Shihu
Yoon, Hong
Rammohan, Aravind
Shim, Youngseon
author_facet Lee, Franklin Langlang
Park, Jaehong
Goyal, Sushmit
Qaroush, Yousef
Wang, Shihu
Yoon, Hong
Rammohan, Aravind
Shim, Youngseon
author_sort Lee, Franklin Langlang
collection PubMed
description Polyamides are often used for their superior thermal, mechanical, and chemical properties. They form a diverse set of materials that have a large variation in properties between linear to aromatic compounds, which renders the traditional quantitative structure–property relationship (QSPR) challenging. We use extended connectivity fingerprints (ECFP) and traditional QSPR fingerprints to develop machine learning models to perform high fidelity prediction of glass transition temperature ([Formula: see text]), melting temperature ([Formula: see text]), density ([Formula: see text]), and tensile modulus ([Formula: see text]). The non-linear model using random forest is in general found to be more accurate than linear regression; however, using feature selection or regularization, the accuracy of linear models is shown to be improved significantly to become comparable to the more complex nonlinear algorithm. We find that none of the models or fingerprints were able to accurately predict the tensile modulus [Formula: see text] , which we hypothesize is due to heterogeneity in data and data sources, as well as inherent challenges in measuring it. Finally, QSPR models revealed that the fraction of rotatable bonds, and the rotational degree of freedom affects polyamide properties most profoundly and can be used for back of the envelope calculations for a quick estimate of the polymer attributes (glass transition temperature, melting temperature, and density). These QSPR models, although having slightly lower prediction accuracy, show the most promise for the polymer chemist seeking to develop an intuition of ways to modify the chemistry to enhance specific attributes.
format Online
Article
Text
id pubmed-8587315
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85873152021-11-13 Comparison of Machine Learning Methods towards Developing Interpretable Polyamide Property Prediction Lee, Franklin Langlang Park, Jaehong Goyal, Sushmit Qaroush, Yousef Wang, Shihu Yoon, Hong Rammohan, Aravind Shim, Youngseon Polymers (Basel) Article Polyamides are often used for their superior thermal, mechanical, and chemical properties. They form a diverse set of materials that have a large variation in properties between linear to aromatic compounds, which renders the traditional quantitative structure–property relationship (QSPR) challenging. We use extended connectivity fingerprints (ECFP) and traditional QSPR fingerprints to develop machine learning models to perform high fidelity prediction of glass transition temperature ([Formula: see text]), melting temperature ([Formula: see text]), density ([Formula: see text]), and tensile modulus ([Formula: see text]). The non-linear model using random forest is in general found to be more accurate than linear regression; however, using feature selection or regularization, the accuracy of linear models is shown to be improved significantly to become comparable to the more complex nonlinear algorithm. We find that none of the models or fingerprints were able to accurately predict the tensile modulus [Formula: see text] , which we hypothesize is due to heterogeneity in data and data sources, as well as inherent challenges in measuring it. Finally, QSPR models revealed that the fraction of rotatable bonds, and the rotational degree of freedom affects polyamide properties most profoundly and can be used for back of the envelope calculations for a quick estimate of the polymer attributes (glass transition temperature, melting temperature, and density). These QSPR models, although having slightly lower prediction accuracy, show the most promise for the polymer chemist seeking to develop an intuition of ways to modify the chemistry to enhance specific attributes. MDPI 2021-10-23 /pmc/articles/PMC8587315/ /pubmed/34771210 http://dx.doi.org/10.3390/polym13213653 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lee, Franklin Langlang
Park, Jaehong
Goyal, Sushmit
Qaroush, Yousef
Wang, Shihu
Yoon, Hong
Rammohan, Aravind
Shim, Youngseon
Comparison of Machine Learning Methods towards Developing Interpretable Polyamide Property Prediction
title Comparison of Machine Learning Methods towards Developing Interpretable Polyamide Property Prediction
title_full Comparison of Machine Learning Methods towards Developing Interpretable Polyamide Property Prediction
title_fullStr Comparison of Machine Learning Methods towards Developing Interpretable Polyamide Property Prediction
title_full_unstemmed Comparison of Machine Learning Methods towards Developing Interpretable Polyamide Property Prediction
title_short Comparison of Machine Learning Methods towards Developing Interpretable Polyamide Property Prediction
title_sort comparison of machine learning methods towards developing interpretable polyamide property prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587315/
https://www.ncbi.nlm.nih.gov/pubmed/34771210
http://dx.doi.org/10.3390/polym13213653
work_keys_str_mv AT leefranklinlanglang comparisonofmachinelearningmethodstowardsdevelopinginterpretablepolyamidepropertyprediction
AT parkjaehong comparisonofmachinelearningmethodstowardsdevelopinginterpretablepolyamidepropertyprediction
AT goyalsushmit comparisonofmachinelearningmethodstowardsdevelopinginterpretablepolyamidepropertyprediction
AT qaroushyousef comparisonofmachinelearningmethodstowardsdevelopinginterpretablepolyamidepropertyprediction
AT wangshihu comparisonofmachinelearningmethodstowardsdevelopinginterpretablepolyamidepropertyprediction
AT yoonhong comparisonofmachinelearningmethodstowardsdevelopinginterpretablepolyamidepropertyprediction
AT rammohanaravind comparisonofmachinelearningmethodstowardsdevelopinginterpretablepolyamidepropertyprediction
AT shimyoungseon comparisonofmachinelearningmethodstowardsdevelopinginterpretablepolyamidepropertyprediction