Cargando…
Feature Blending: An Approach toward Generalized Machine Learning Models for Property Prediction
[Image: see text] From studying the atomic structure and chemical behavior to the discovery of new materials and investigating properties of existing materials, machine learning (ML) has been employed in realms that are arduous to probe experimentally. While numerous highly accurate models, specific...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2021
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9718311/ https://www.ncbi.nlm.nih.gov/pubmed/36855577 http://dx.doi.org/10.1021/acsphyschemau.1c00017 |
_version_ | 1784843065555419136 |
---|---|
author | Satsangi, Swanti Mishra, Avanish Singh, Abhishek K. |
author_facet | Satsangi, Swanti Mishra, Avanish Singh, Abhishek K. |
author_sort | Satsangi, Swanti |
collection | PubMed |
description | [Image: see text] From studying the atomic structure and chemical behavior to the discovery of new materials and investigating properties of existing materials, machine learning (ML) has been employed in realms that are arduous to probe experimentally. While numerous highly accurate models, specifically for property prediction, have been reported in the literature, there has been a lack of a generalized framework. Herein we propose a novel feature selection approach that enables the development of a unified ML model for property prediction for several classes of materials. It involves an ingenious blending of selected features from various classes of data such that the resultant feature set equips the model with global data descriptors capturing both class-specific as well as global traits. We took accurate band gaps of three distinct classes of 2D materials as our target property to develop the proposed feature blending approach. Using Gaussian process regression (GPR) with the blended features, the ML model developed here resulted in an average root-mean-squared error of 0.12 eV for unseen data belonging to any of the participating classes. The feature blending approach proposed here can be extended to additional classes of materials and also to predict other properties. |
format | Online Article Text |
id | pubmed-9718311 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-97183112023-02-27 Feature Blending: An Approach toward Generalized Machine Learning Models for Property Prediction Satsangi, Swanti Mishra, Avanish Singh, Abhishek K. ACS Phys Chem Au [Image: see text] From studying the atomic structure and chemical behavior to the discovery of new materials and investigating properties of existing materials, machine learning (ML) has been employed in realms that are arduous to probe experimentally. While numerous highly accurate models, specifically for property prediction, have been reported in the literature, there has been a lack of a generalized framework. Herein we propose a novel feature selection approach that enables the development of a unified ML model for property prediction for several classes of materials. It involves an ingenious blending of selected features from various classes of data such that the resultant feature set equips the model with global data descriptors capturing both class-specific as well as global traits. We took accurate band gaps of three distinct classes of 2D materials as our target property to develop the proposed feature blending approach. Using Gaussian process regression (GPR) with the blended features, the ML model developed here resulted in an average root-mean-squared error of 0.12 eV for unseen data belonging to any of the participating classes. The feature blending approach proposed here can be extended to additional classes of materials and also to predict other properties. American Chemical Society 2021-09-17 /pmc/articles/PMC9718311/ /pubmed/36855577 http://dx.doi.org/10.1021/acsphyschemau.1c00017 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Satsangi, Swanti Mishra, Avanish Singh, Abhishek K. Feature Blending: An Approach toward Generalized Machine Learning Models for Property Prediction |
title | Feature Blending: An Approach toward Generalized
Machine Learning Models for Property Prediction |
title_full | Feature Blending: An Approach toward Generalized
Machine Learning Models for Property Prediction |
title_fullStr | Feature Blending: An Approach toward Generalized
Machine Learning Models for Property Prediction |
title_full_unstemmed | Feature Blending: An Approach toward Generalized
Machine Learning Models for Property Prediction |
title_short | Feature Blending: An Approach toward Generalized
Machine Learning Models for Property Prediction |
title_sort | feature blending: an approach toward generalized
machine learning models for property prediction |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9718311/ https://www.ncbi.nlm.nih.gov/pubmed/36855577 http://dx.doi.org/10.1021/acsphyschemau.1c00017 |
work_keys_str_mv | AT satsangiswanti featureblendinganapproachtowardgeneralizedmachinelearningmodelsforpropertyprediction AT mishraavanish featureblendinganapproachtowardgeneralizedmachinelearningmodelsforpropertyprediction AT singhabhishekk featureblendinganapproachtowardgeneralizedmachinelearningmodelsforpropertyprediction |