Cargando…

Explainable graph neural networks for organic cages

The development of accurate and explicable machine learning models to predict the properties of topologically complex systems is a challenge in materials science. Porous organic cages, a class of polycyclic molecular materials, have potential application in molecular separations, catalysis and encap...

Descripción completa

Detalles Bibliográficos
Autores principales: Yuan, Qi, Szczypiński, Filip T., Jelfs, Kim E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: RSC 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8996732/
https://www.ncbi.nlm.nih.gov/pubmed/35515082
http://dx.doi.org/10.1039/d1dd00039j
_version_ 1784684543273336832
author Yuan, Qi
Szczypiński, Filip T.
Jelfs, Kim E.
author_facet Yuan, Qi
Szczypiński, Filip T.
Jelfs, Kim E.
author_sort Yuan, Qi
collection PubMed
description The development of accurate and explicable machine learning models to predict the properties of topologically complex systems is a challenge in materials science. Porous organic cages, a class of polycyclic molecular materials, have potential application in molecular separations, catalysis and encapsulation. For most applications of porous organic cages, having a permanent internal cavity in the absence of solvent, a property termed “shape persistence” is critical. Here, we report the development of Graph Neural Networks (GNNs) to predict the shape persistence of organic cages. Graph neural networks are a class of neural networks where the data, in our case that of organic cages, are represented by graphs. The performance of the GNN models was measured against a previously reported computational database of organic cages formed through a range of [4 + 6] reactions with a variety of reaction chemistries. The reported GNNs have an improved prediction accuracy and transferability compared to random forest predictions. Apart from the improvement in predictive power, we explored the explicability of the GNNs by computing the integrated gradient of the GNN input. The contribution of monomers and molecular fragments to the shape persistence of the organic cages could be quantitatively evaluated with integrated gradients. With the added explicability of the GNNs, it was possible not only to accurately predict the property of organic materials, but also to interpret the predictions of the deep learning models and provide structural insights for the discovery of future materials.
format Online
Article
Text
id pubmed-8996732
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher RSC
record_format MEDLINE/PubMed
spelling pubmed-89967322022-05-03 Explainable graph neural networks for organic cages Yuan, Qi Szczypiński, Filip T. Jelfs, Kim E. Digit Discov Chemistry The development of accurate and explicable machine learning models to predict the properties of topologically complex systems is a challenge in materials science. Porous organic cages, a class of polycyclic molecular materials, have potential application in molecular separations, catalysis and encapsulation. For most applications of porous organic cages, having a permanent internal cavity in the absence of solvent, a property termed “shape persistence” is critical. Here, we report the development of Graph Neural Networks (GNNs) to predict the shape persistence of organic cages. Graph neural networks are a class of neural networks where the data, in our case that of organic cages, are represented by graphs. The performance of the GNN models was measured against a previously reported computational database of organic cages formed through a range of [4 + 6] reactions with a variety of reaction chemistries. The reported GNNs have an improved prediction accuracy and transferability compared to random forest predictions. Apart from the improvement in predictive power, we explored the explicability of the GNNs by computing the integrated gradient of the GNN input. The contribution of monomers and molecular fragments to the shape persistence of the organic cages could be quantitatively evaluated with integrated gradients. With the added explicability of the GNNs, it was possible not only to accurately predict the property of organic materials, but also to interpret the predictions of the deep learning models and provide structural insights for the discovery of future materials. RSC 2022-02-11 /pmc/articles/PMC8996732/ /pubmed/35515082 http://dx.doi.org/10.1039/d1dd00039j Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/
spellingShingle Chemistry
Yuan, Qi
Szczypiński, Filip T.
Jelfs, Kim E.
Explainable graph neural networks for organic cages
title Explainable graph neural networks for organic cages
title_full Explainable graph neural networks for organic cages
title_fullStr Explainable graph neural networks for organic cages
title_full_unstemmed Explainable graph neural networks for organic cages
title_short Explainable graph neural networks for organic cages
title_sort explainable graph neural networks for organic cages
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8996732/
https://www.ncbi.nlm.nih.gov/pubmed/35515082
http://dx.doi.org/10.1039/d1dd00039j
work_keys_str_mv AT yuanqi explainablegraphneuralnetworksfororganiccages
AT szczypinskifilipt explainablegraphneuralnetworksfororganiccages
AT jelfskime explainablegraphneuralnetworksfororganiccages