Cargando…

Composition Based Oxidation State Prediction of Materials Using Deep Learning Language Models

Oxidation states (OS) are the charges on atoms due to electrons gained or lost upon applying an ionic approximation to their bonds. As a fundamental property, OS has been widely used in charge‐neutrality verification, crystal structure determination, and reaction estimation. Currently, only heuristi...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Nihang, Hu, Jeffrey, Feng, Ying, Morrison, Gregory, zur Loye, Hans‐Conrad, Hu, Jianjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558692/
https://www.ncbi.nlm.nih.gov/pubmed/37551059
http://dx.doi.org/10.1002/advs.202301011
_version_ 1785117334071934976
author Fu, Nihang
Hu, Jeffrey
Feng, Ying
Morrison, Gregory
zur Loye, Hans‐Conrad
Hu, Jianjun
author_facet Fu, Nihang
Hu, Jeffrey
Feng, Ying
Morrison, Gregory
zur Loye, Hans‐Conrad
Hu, Jianjun
author_sort Fu, Nihang
collection PubMed
description Oxidation states (OS) are the charges on atoms due to electrons gained or lost upon applying an ionic approximation to their bonds. As a fundamental property, OS has been widely used in charge‐neutrality verification, crystal structure determination, and reaction estimation. Currently, only heuristic rules exist for guessing the oxidation states of a given compound with many exceptions. Recent work has developed machine learning models based on heuristic structural features for predicting the oxidation states of metal ions. However, composition‐based oxidation state prediction still remains elusive so far, which has significant implications for the discovery of new materials for which the structures have not been determined. This work proposes a novel deep learning‐based BERT transformer language model BERTOS for predicting the oxidation states for all elements of inorganic compounds given only their chemical composition. This model achieves 96.82% accuracy for all‐element oxidation states prediction benchmarked on the cleaned ICSD dataset and achieves 97.61% accuracy for oxide materials. It is also demonstrated how it can be used to conduct large‐scale screening of hypothetical material compositions for materials discovery.
format Online
Article
Text
id pubmed-10558692
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-105586922023-10-08 Composition Based Oxidation State Prediction of Materials Using Deep Learning Language Models Fu, Nihang Hu, Jeffrey Feng, Ying Morrison, Gregory zur Loye, Hans‐Conrad Hu, Jianjun Adv Sci (Weinh) Research Articles Oxidation states (OS) are the charges on atoms due to electrons gained or lost upon applying an ionic approximation to their bonds. As a fundamental property, OS has been widely used in charge‐neutrality verification, crystal structure determination, and reaction estimation. Currently, only heuristic rules exist for guessing the oxidation states of a given compound with many exceptions. Recent work has developed machine learning models based on heuristic structural features for predicting the oxidation states of metal ions. However, composition‐based oxidation state prediction still remains elusive so far, which has significant implications for the discovery of new materials for which the structures have not been determined. This work proposes a novel deep learning‐based BERT transformer language model BERTOS for predicting the oxidation states for all elements of inorganic compounds given only their chemical composition. This model achieves 96.82% accuracy for all‐element oxidation states prediction benchmarked on the cleaned ICSD dataset and achieves 97.61% accuracy for oxide materials. It is also demonstrated how it can be used to conduct large‐scale screening of hypothetical material compositions for materials discovery. John Wiley and Sons Inc. 2023-08-07 /pmc/articles/PMC10558692/ /pubmed/37551059 http://dx.doi.org/10.1002/advs.202301011 Text en © 2023 The Authors. Advanced Science published by Wiley‐VCH GmbH https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Fu, Nihang
Hu, Jeffrey
Feng, Ying
Morrison, Gregory
zur Loye, Hans‐Conrad
Hu, Jianjun
Composition Based Oxidation State Prediction of Materials Using Deep Learning Language Models
title Composition Based Oxidation State Prediction of Materials Using Deep Learning Language Models
title_full Composition Based Oxidation State Prediction of Materials Using Deep Learning Language Models
title_fullStr Composition Based Oxidation State Prediction of Materials Using Deep Learning Language Models
title_full_unstemmed Composition Based Oxidation State Prediction of Materials Using Deep Learning Language Models
title_short Composition Based Oxidation State Prediction of Materials Using Deep Learning Language Models
title_sort composition based oxidation state prediction of materials using deep learning language models
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558692/
https://www.ncbi.nlm.nih.gov/pubmed/37551059
http://dx.doi.org/10.1002/advs.202301011
work_keys_str_mv AT funihang compositionbasedoxidationstatepredictionofmaterialsusingdeeplearninglanguagemodels
AT hujeffrey compositionbasedoxidationstatepredictionofmaterialsusingdeeplearninglanguagemodels
AT fengying compositionbasedoxidationstatepredictionofmaterialsusingdeeplearninglanguagemodels
AT morrisongregory compositionbasedoxidationstatepredictionofmaterialsusingdeeplearninglanguagemodels
AT zurloyehansconrad compositionbasedoxidationstatepredictionofmaterialsusingdeeplearninglanguagemodels
AT hujianjun compositionbasedoxidationstatepredictionofmaterialsusingdeeplearninglanguagemodels