Cargando…
MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction
[Image: see text] Metal–organic frameworks (MOFs) are materials with a high degree of porosity that can be used for many applications. However, the chemical space of MOFs is enormous due to the large variety of possible combinations of building blocks and topology. Discovering the optimal MOFs for s...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041520/ https://www.ncbi.nlm.nih.gov/pubmed/36706365 http://dx.doi.org/10.1021/jacs.2c11420 |
_version_ | 1784912737828077568 |
---|---|
author | Cao, Zhonglin Magar, Rishikesh Wang, Yuyang Barati Farimani, Amir |
author_facet | Cao, Zhonglin Magar, Rishikesh Wang, Yuyang Barati Farimani, Amir |
author_sort | Cao, Zhonglin |
collection | PubMed |
description | [Image: see text] Metal–organic frameworks (MOFs) are materials with a high degree of porosity that can be used for many applications. However, the chemical space of MOFs is enormous due to the large variety of possible combinations of building blocks and topology. Discovering the optimal MOFs for specific applications requires an efficient and accurate search over countless potential candidates. Previous high-throughput screening methods using computational simulations like DFT can be time-consuming. Such methods also require the 3D atomic structures of MOFs, which adds one extra step when evaluating hypothetical MOFs. In this work, we propose a structure-agnostic deep learning method based on the Transformer model, named as MOFormer, for property predictions of MOFs. MOFormer takes a text string representation of MOF (MOFid) as input, thus circumventing the need of obtaining the 3D structure of a hypothetical MOF and accelerating the screening process. By comparing to other descriptors such as Stoichiometric-120 and revised autocorrelations, we demonstrate that MOFormer can achieve state-of-the-art structure-agnostic prediction accuracy on all benchmarks. Furthermore, we introduce a self-supervised learning framework that pretrains the MOFormer via maximizing the cross-correlation between its structure-agnostic representations and structure-based representations of the crystal graph convolutional neural network (CGCNN) on >400k publicly available MOF data. Benchmarks show that pretraining improves the prediction accuracy of both models on various downstream prediction tasks. Furthermore, we revealed that MOFormer can be more data-efficient on quantum-chemical property prediction than structure-based CGCNN when training data is limited. Overall, MOFormer provides a novel perspective on efficient MOF property prediction using deep learning. |
format | Online Article Text |
id | pubmed-10041520 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-100415202023-03-28 MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction Cao, Zhonglin Magar, Rishikesh Wang, Yuyang Barati Farimani, Amir J Am Chem Soc [Image: see text] Metal–organic frameworks (MOFs) are materials with a high degree of porosity that can be used for many applications. However, the chemical space of MOFs is enormous due to the large variety of possible combinations of building blocks and topology. Discovering the optimal MOFs for specific applications requires an efficient and accurate search over countless potential candidates. Previous high-throughput screening methods using computational simulations like DFT can be time-consuming. Such methods also require the 3D atomic structures of MOFs, which adds one extra step when evaluating hypothetical MOFs. In this work, we propose a structure-agnostic deep learning method based on the Transformer model, named as MOFormer, for property predictions of MOFs. MOFormer takes a text string representation of MOF (MOFid) as input, thus circumventing the need of obtaining the 3D structure of a hypothetical MOF and accelerating the screening process. By comparing to other descriptors such as Stoichiometric-120 and revised autocorrelations, we demonstrate that MOFormer can achieve state-of-the-art structure-agnostic prediction accuracy on all benchmarks. Furthermore, we introduce a self-supervised learning framework that pretrains the MOFormer via maximizing the cross-correlation between its structure-agnostic representations and structure-based representations of the crystal graph convolutional neural network (CGCNN) on >400k publicly available MOF data. Benchmarks show that pretraining improves the prediction accuracy of both models on various downstream prediction tasks. Furthermore, we revealed that MOFormer can be more data-efficient on quantum-chemical property prediction than structure-based CGCNN when training data is limited. Overall, MOFormer provides a novel perspective on efficient MOF property prediction using deep learning. American Chemical Society 2023-01-27 /pmc/articles/PMC10041520/ /pubmed/36706365 http://dx.doi.org/10.1021/jacs.2c11420 Text en © 2023 American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Cao, Zhonglin Magar, Rishikesh Wang, Yuyang Barati Farimani, Amir MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction |
title | MOFormer: Self-Supervised
Transformer Model for Metal–Organic
Framework Property Prediction |
title_full | MOFormer: Self-Supervised
Transformer Model for Metal–Organic
Framework Property Prediction |
title_fullStr | MOFormer: Self-Supervised
Transformer Model for Metal–Organic
Framework Property Prediction |
title_full_unstemmed | MOFormer: Self-Supervised
Transformer Model for Metal–Organic
Framework Property Prediction |
title_short | MOFormer: Self-Supervised
Transformer Model for Metal–Organic
Framework Property Prediction |
title_sort | moformer: self-supervised
transformer model for metal–organic
framework property prediction |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041520/ https://www.ncbi.nlm.nih.gov/pubmed/36706365 http://dx.doi.org/10.1021/jacs.2c11420 |
work_keys_str_mv | AT caozhonglin moformerselfsupervisedtransformermodelformetalorganicframeworkpropertyprediction AT magarrishikesh moformerselfsupervisedtransformermodelformetalorganicframeworkpropertyprediction AT wangyuyang moformerselfsupervisedtransformermodelformetalorganicframeworkpropertyprediction AT baratifarimaniamir moformerselfsupervisedtransformermodelformetalorganicframeworkpropertyprediction |