Cargando…

Machine Learning-Based Prediction of Crystal Systems and Space Groups from Inorganic Materials Compositions

[Image: see text] Structural information of materials such as the crystal systems and space groups are highly useful for analyzing their physical properties. However, the enormous composition space of materials makes experimental X-ray diffraction (XRD) or first-principle-based structure determinati...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Yong, Cui, Yuxin, Xiong, Zheng, Jin, Jing, Liu, Zhonghao, Dong, Rongzhi, Hu, Jianjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2020
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7045551/
https://www.ncbi.nlm.nih.gov/pubmed/32118175
http://dx.doi.org/10.1021/acsomega.9b04012
_version_ 1783501799760592896
author Zhao, Yong
Cui, Yuxin
Xiong, Zheng
Jin, Jing
Liu, Zhonghao
Dong, Rongzhi
Hu, Jianjun
author_facet Zhao, Yong
Cui, Yuxin
Xiong, Zheng
Jin, Jing
Liu, Zhonghao
Dong, Rongzhi
Hu, Jianjun
author_sort Zhao, Yong
collection PubMed
description [Image: see text] Structural information of materials such as the crystal systems and space groups are highly useful for analyzing their physical properties. However, the enormous composition space of materials makes experimental X-ray diffraction (XRD) or first-principle-based structure determination methods infeasible for large-scale material screening in the composition space. Herein, we propose and evaluate machine-learning algorithms for determining the structure type of materials, given only their compositions. We couple random forest (RF) and multiple layer perceptron (MLP) neural network models with three types of features: Magpie, atom vector, and one-hot encoding (atom frequency) for the crystal system and space group prediction of materials. Four types of models for predicting crystal systems and space groups are proposed, trained, and evaluated including one-versus-all binary classifiers, multiclass classifiers, polymorphism predictors, and multilabel classifiers. The synthetic minority over-sampling technique (SMOTE) is conducted to mitigate the effects of imbalanced data sets. Our results demonstrate that RF with Magpie features generally outperforms other algorithms for binary and multiclass prediction of crystal systems and space groups, while MLP with atom frequency features is the best one for structural polymorphism prediction. For multilabel prediction, MLP with atom frequency and binary relevance with Magpie models are the best for predicting crystal systems and space groups, respectively. Our analysis of the related descriptors identifies a few key contributing features for structural-type prediction such as electronegativity, covalent radius, and Mendeleev number. Our work thus paves a way for fast composition-based structural screening of inorganic materials via predicted material structural properties.
format Online
Article
Text
id pubmed-7045551
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-70455512020-02-28 Machine Learning-Based Prediction of Crystal Systems and Space Groups from Inorganic Materials Compositions Zhao, Yong Cui, Yuxin Xiong, Zheng Jin, Jing Liu, Zhonghao Dong, Rongzhi Hu, Jianjun ACS Omega [Image: see text] Structural information of materials such as the crystal systems and space groups are highly useful for analyzing their physical properties. However, the enormous composition space of materials makes experimental X-ray diffraction (XRD) or first-principle-based structure determination methods infeasible for large-scale material screening in the composition space. Herein, we propose and evaluate machine-learning algorithms for determining the structure type of materials, given only their compositions. We couple random forest (RF) and multiple layer perceptron (MLP) neural network models with three types of features: Magpie, atom vector, and one-hot encoding (atom frequency) for the crystal system and space group prediction of materials. Four types of models for predicting crystal systems and space groups are proposed, trained, and evaluated including one-versus-all binary classifiers, multiclass classifiers, polymorphism predictors, and multilabel classifiers. The synthetic minority over-sampling technique (SMOTE) is conducted to mitigate the effects of imbalanced data sets. Our results demonstrate that RF with Magpie features generally outperforms other algorithms for binary and multiclass prediction of crystal systems and space groups, while MLP with atom frequency features is the best one for structural polymorphism prediction. For multilabel prediction, MLP with atom frequency and binary relevance with Magpie models are the best for predicting crystal systems and space groups, respectively. Our analysis of the related descriptors identifies a few key contributing features for structural-type prediction such as electronegativity, covalent radius, and Mendeleev number. Our work thus paves a way for fast composition-based structural screening of inorganic materials via predicted material structural properties. American Chemical Society 2020-02-13 /pmc/articles/PMC7045551/ /pubmed/32118175 http://dx.doi.org/10.1021/acsomega.9b04012 Text en Copyright © 2020 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle Zhao, Yong
Cui, Yuxin
Xiong, Zheng
Jin, Jing
Liu, Zhonghao
Dong, Rongzhi
Hu, Jianjun
Machine Learning-Based Prediction of Crystal Systems and Space Groups from Inorganic Materials Compositions
title Machine Learning-Based Prediction of Crystal Systems and Space Groups from Inorganic Materials Compositions
title_full Machine Learning-Based Prediction of Crystal Systems and Space Groups from Inorganic Materials Compositions
title_fullStr Machine Learning-Based Prediction of Crystal Systems and Space Groups from Inorganic Materials Compositions
title_full_unstemmed Machine Learning-Based Prediction of Crystal Systems and Space Groups from Inorganic Materials Compositions
title_short Machine Learning-Based Prediction of Crystal Systems and Space Groups from Inorganic Materials Compositions
title_sort machine learning-based prediction of crystal systems and space groups from inorganic materials compositions
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7045551/
https://www.ncbi.nlm.nih.gov/pubmed/32118175
http://dx.doi.org/10.1021/acsomega.9b04012
work_keys_str_mv AT zhaoyong machinelearningbasedpredictionofcrystalsystemsandspacegroupsfrominorganicmaterialscompositions
AT cuiyuxin machinelearningbasedpredictionofcrystalsystemsandspacegroupsfrominorganicmaterialscompositions
AT xiongzheng machinelearningbasedpredictionofcrystalsystemsandspacegroupsfrominorganicmaterialscompositions
AT jinjing machinelearningbasedpredictionofcrystalsystemsandspacegroupsfrominorganicmaterialscompositions
AT liuzhonghao machinelearningbasedpredictionofcrystalsystemsandspacegroupsfrominorganicmaterialscompositions
AT dongrongzhi machinelearningbasedpredictionofcrystalsystemsandspacegroupsfrominorganicmaterialscompositions
AT hujianjun machinelearningbasedpredictionofcrystalsystemsandspacegroupsfrominorganicmaterialscompositions