Cargando…
Explainable graph neural networks for organic cages
The development of accurate and explicable machine learning models to predict the properties of topologically complex systems is a challenge in materials science. Porous organic cages, a class of polycyclic molecular materials, have potential application in molecular separations, catalysis and encap...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
RSC
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8996732/ https://www.ncbi.nlm.nih.gov/pubmed/35515082 http://dx.doi.org/10.1039/d1dd00039j |
_version_ | 1784684543273336832 |
---|---|
author | Yuan, Qi Szczypiński, Filip T. Jelfs, Kim E. |
author_facet | Yuan, Qi Szczypiński, Filip T. Jelfs, Kim E. |
author_sort | Yuan, Qi |
collection | PubMed |
description | The development of accurate and explicable machine learning models to predict the properties of topologically complex systems is a challenge in materials science. Porous organic cages, a class of polycyclic molecular materials, have potential application in molecular separations, catalysis and encapsulation. For most applications of porous organic cages, having a permanent internal cavity in the absence of solvent, a property termed “shape persistence” is critical. Here, we report the development of Graph Neural Networks (GNNs) to predict the shape persistence of organic cages. Graph neural networks are a class of neural networks where the data, in our case that of organic cages, are represented by graphs. The performance of the GNN models was measured against a previously reported computational database of organic cages formed through a range of [4 + 6] reactions with a variety of reaction chemistries. The reported GNNs have an improved prediction accuracy and transferability compared to random forest predictions. Apart from the improvement in predictive power, we explored the explicability of the GNNs by computing the integrated gradient of the GNN input. The contribution of monomers and molecular fragments to the shape persistence of the organic cages could be quantitatively evaluated with integrated gradients. With the added explicability of the GNNs, it was possible not only to accurately predict the property of organic materials, but also to interpret the predictions of the deep learning models and provide structural insights for the discovery of future materials. |
format | Online Article Text |
id | pubmed-8996732 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | RSC |
record_format | MEDLINE/PubMed |
spelling | pubmed-89967322022-05-03 Explainable graph neural networks for organic cages Yuan, Qi Szczypiński, Filip T. Jelfs, Kim E. Digit Discov Chemistry The development of accurate and explicable machine learning models to predict the properties of topologically complex systems is a challenge in materials science. Porous organic cages, a class of polycyclic molecular materials, have potential application in molecular separations, catalysis and encapsulation. For most applications of porous organic cages, having a permanent internal cavity in the absence of solvent, a property termed “shape persistence” is critical. Here, we report the development of Graph Neural Networks (GNNs) to predict the shape persistence of organic cages. Graph neural networks are a class of neural networks where the data, in our case that of organic cages, are represented by graphs. The performance of the GNN models was measured against a previously reported computational database of organic cages formed through a range of [4 + 6] reactions with a variety of reaction chemistries. The reported GNNs have an improved prediction accuracy and transferability compared to random forest predictions. Apart from the improvement in predictive power, we explored the explicability of the GNNs by computing the integrated gradient of the GNN input. The contribution of monomers and molecular fragments to the shape persistence of the organic cages could be quantitatively evaluated with integrated gradients. With the added explicability of the GNNs, it was possible not only to accurately predict the property of organic materials, but also to interpret the predictions of the deep learning models and provide structural insights for the discovery of future materials. RSC 2022-02-11 /pmc/articles/PMC8996732/ /pubmed/35515082 http://dx.doi.org/10.1039/d1dd00039j Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/ |
spellingShingle | Chemistry Yuan, Qi Szczypiński, Filip T. Jelfs, Kim E. Explainable graph neural networks for organic cages |
title | Explainable graph neural networks for organic cages |
title_full | Explainable graph neural networks for organic cages |
title_fullStr | Explainable graph neural networks for organic cages |
title_full_unstemmed | Explainable graph neural networks for organic cages |
title_short | Explainable graph neural networks for organic cages |
title_sort | explainable graph neural networks for organic cages |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8996732/ https://www.ncbi.nlm.nih.gov/pubmed/35515082 http://dx.doi.org/10.1039/d1dd00039j |
work_keys_str_mv | AT yuanqi explainablegraphneuralnetworksfororganiccages AT szczypinskifilipt explainablegraphneuralnetworksfororganiccages AT jelfskime explainablegraphneuralnetworksfororganiccages |