Cargando…
Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets
Dielectric constant (DC, ε) is a fundamental parameter in material sciences to measure polarizability of the system. In industrial processes, its value is an imperative indicator, which demonstrates the dielectric property of material and compiles information including separation information, chemic...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9396556/ https://www.ncbi.nlm.nih.gov/pubmed/36016529 http://dx.doi.org/10.1016/j.heliyon.2022.e10011 |
_version_ | 1784771952604348416 |
---|---|
author | Mao, Jiashun Zeb, Amir Kim, Min Sung Jeon, Hyeon-Nae Wang, Jianmin Guan, Shenghui NO, Kyoung Tai |
author_facet | Mao, Jiashun Zeb, Amir Kim, Min Sung Jeon, Hyeon-Nae Wang, Jianmin Guan, Shenghui NO, Kyoung Tai |
author_sort | Mao, Jiashun |
collection | PubMed |
description | Dielectric constant (DC, ε) is a fundamental parameter in material sciences to measure polarizability of the system. In industrial processes, its value is an imperative indicator, which demonstrates the dielectric property of material and compiles information including separation information, chemical equilibrium, chemical reactivity analysis, and solubility modeling. Since, the available ε-prediction models are fairly primitive and frequently suffer from serious failures especially when deals with strong polar compounds. Therefore, we have developed a novel data-driven system to improve the efficiency and wide-range applicability of ε using in material sciences. This innovative scheme adopts the correlation distance and genetic algorithm to discriminate features’ combination and avoid overfitting. Herein, the prediction output of the single ML model as a coding to estimate the target value by simulating the layer-by-layer extraction in deep learning, and enabling instant search for the optimal combination of features is recruited. Our model established an improved correlation value of 0.956 with target as compared to the previously available best traditional ML result of 0.877. Our framework established a profound improvement, especially for material systems possessing ε value >50. In terms of interpretability, we have derived a conceptual computational equation from a minimum generating tree. Our innovative data-driven system is preferentially superior over other methods due to its application for the prediction of dielectric constants as well as for the prediction of overall micro and macro-properties of any multi-components complex. |
format | Online Article Text |
id | pubmed-9396556 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-93965562022-08-24 Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets Mao, Jiashun Zeb, Amir Kim, Min Sung Jeon, Hyeon-Nae Wang, Jianmin Guan, Shenghui NO, Kyoung Tai Heliyon Research Article Dielectric constant (DC, ε) is a fundamental parameter in material sciences to measure polarizability of the system. In industrial processes, its value is an imperative indicator, which demonstrates the dielectric property of material and compiles information including separation information, chemical equilibrium, chemical reactivity analysis, and solubility modeling. Since, the available ε-prediction models are fairly primitive and frequently suffer from serious failures especially when deals with strong polar compounds. Therefore, we have developed a novel data-driven system to improve the efficiency and wide-range applicability of ε using in material sciences. This innovative scheme adopts the correlation distance and genetic algorithm to discriminate features’ combination and avoid overfitting. Herein, the prediction output of the single ML model as a coding to estimate the target value by simulating the layer-by-layer extraction in deep learning, and enabling instant search for the optimal combination of features is recruited. Our model established an improved correlation value of 0.956 with target as compared to the previously available best traditional ML result of 0.877. Our framework established a profound improvement, especially for material systems possessing ε value >50. In terms of interpretability, we have derived a conceptual computational equation from a minimum generating tree. Our innovative data-driven system is preferentially superior over other methods due to its application for the prediction of dielectric constants as well as for the prediction of overall micro and macro-properties of any multi-components complex. Elsevier 2022-08-04 /pmc/articles/PMC9396556/ /pubmed/36016529 http://dx.doi.org/10.1016/j.heliyon.2022.e10011 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Mao, Jiashun Zeb, Amir Kim, Min Sung Jeon, Hyeon-Nae Wang, Jianmin Guan, Shenghui NO, Kyoung Tai Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets |
title | Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets |
title_full | Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets |
title_fullStr | Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets |
title_full_unstemmed | Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets |
title_short | Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets |
title_sort | development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9396556/ https://www.ncbi.nlm.nih.gov/pubmed/36016529 http://dx.doi.org/10.1016/j.heliyon.2022.e10011 |
work_keys_str_mv | AT maojiashun developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets AT zebamir developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets AT kimminsung developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets AT jeonhyeonnae developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets AT wangjianmin developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets AT guanshenghui developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets AT nokyoungtai developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets |