Cargando…

MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction

[Image: see text] Metal–organic frameworks (MOFs) are materials with a high degree of porosity that can be used for many applications. However, the chemical space of MOFs is enormous due to the large variety of possible combinations of building blocks and topology. Discovering the optimal MOFs for s...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Zhonglin, Magar, Rishikesh, Wang, Yuyang, Barati Farimani, Amir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041520/
https://www.ncbi.nlm.nih.gov/pubmed/36706365
http://dx.doi.org/10.1021/jacs.2c11420
_version_ 1784912737828077568
author Cao, Zhonglin
Magar, Rishikesh
Wang, Yuyang
Barati Farimani, Amir
author_facet Cao, Zhonglin
Magar, Rishikesh
Wang, Yuyang
Barati Farimani, Amir
author_sort Cao, Zhonglin
collection PubMed
description [Image: see text] Metal–organic frameworks (MOFs) are materials with a high degree of porosity that can be used for many applications. However, the chemical space of MOFs is enormous due to the large variety of possible combinations of building blocks and topology. Discovering the optimal MOFs for specific applications requires an efficient and accurate search over countless potential candidates. Previous high-throughput screening methods using computational simulations like DFT can be time-consuming. Such methods also require the 3D atomic structures of MOFs, which adds one extra step when evaluating hypothetical MOFs. In this work, we propose a structure-agnostic deep learning method based on the Transformer model, named as MOFormer, for property predictions of MOFs. MOFormer takes a text string representation of MOF (MOFid) as input, thus circumventing the need of obtaining the 3D structure of a hypothetical MOF and accelerating the screening process. By comparing to other descriptors such as Stoichiometric-120 and revised autocorrelations, we demonstrate that MOFormer can achieve state-of-the-art structure-agnostic prediction accuracy on all benchmarks. Furthermore, we introduce a self-supervised learning framework that pretrains the MOFormer via maximizing the cross-correlation between its structure-agnostic representations and structure-based representations of the crystal graph convolutional neural network (CGCNN) on >400k publicly available MOF data. Benchmarks show that pretraining improves the prediction accuracy of both models on various downstream prediction tasks. Furthermore, we revealed that MOFormer can be more data-efficient on quantum-chemical property prediction than structure-based CGCNN when training data is limited. Overall, MOFormer provides a novel perspective on efficient MOF property prediction using deep learning.
format Online
Article
Text
id pubmed-10041520
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-100415202023-03-28 MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction Cao, Zhonglin Magar, Rishikesh Wang, Yuyang Barati Farimani, Amir J Am Chem Soc [Image: see text] Metal–organic frameworks (MOFs) are materials with a high degree of porosity that can be used for many applications. However, the chemical space of MOFs is enormous due to the large variety of possible combinations of building blocks and topology. Discovering the optimal MOFs for specific applications requires an efficient and accurate search over countless potential candidates. Previous high-throughput screening methods using computational simulations like DFT can be time-consuming. Such methods also require the 3D atomic structures of MOFs, which adds one extra step when evaluating hypothetical MOFs. In this work, we propose a structure-agnostic deep learning method based on the Transformer model, named as MOFormer, for property predictions of MOFs. MOFormer takes a text string representation of MOF (MOFid) as input, thus circumventing the need of obtaining the 3D structure of a hypothetical MOF and accelerating the screening process. By comparing to other descriptors such as Stoichiometric-120 and revised autocorrelations, we demonstrate that MOFormer can achieve state-of-the-art structure-agnostic prediction accuracy on all benchmarks. Furthermore, we introduce a self-supervised learning framework that pretrains the MOFormer via maximizing the cross-correlation between its structure-agnostic representations and structure-based representations of the crystal graph convolutional neural network (CGCNN) on >400k publicly available MOF data. Benchmarks show that pretraining improves the prediction accuracy of both models on various downstream prediction tasks. Furthermore, we revealed that MOFormer can be more data-efficient on quantum-chemical property prediction than structure-based CGCNN when training data is limited. Overall, MOFormer provides a novel perspective on efficient MOF property prediction using deep learning. American Chemical Society 2023-01-27 /pmc/articles/PMC10041520/ /pubmed/36706365 http://dx.doi.org/10.1021/jacs.2c11420 Text en © 2023 American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Cao, Zhonglin
Magar, Rishikesh
Wang, Yuyang
Barati Farimani, Amir
MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction
title MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction
title_full MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction
title_fullStr MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction
title_full_unstemmed MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction
title_short MOFormer: Self-Supervised Transformer Model for Metal–Organic Framework Property Prediction
title_sort moformer: self-supervised transformer model for metal–organic framework property prediction
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041520/
https://www.ncbi.nlm.nih.gov/pubmed/36706365
http://dx.doi.org/10.1021/jacs.2c11420
work_keys_str_mv AT caozhonglin moformerselfsupervisedtransformermodelformetalorganicframeworkpropertyprediction
AT magarrishikesh moformerselfsupervisedtransformermodelformetalorganicframeworkpropertyprediction
AT wangyuyang moformerselfsupervisedtransformermodelformetalorganicframeworkpropertyprediction
AT baratifarimaniamir moformerselfsupervisedtransformermodelformetalorganicframeworkpropertyprediction