Cargando…

Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets

Dielectric constant (DC, ε) is a fundamental parameter in material sciences to measure polarizability of the system. In industrial processes, its value is an imperative indicator, which demonstrates the dielectric property of material and compiles information including separation information, chemic...

Descripción completa

Detalles Bibliográficos
Autores principales: Mao, Jiashun, Zeb, Amir, Kim, Min Sung, Jeon, Hyeon-Nae, Wang, Jianmin, Guan, Shenghui, NO, Kyoung Tai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9396556/
https://www.ncbi.nlm.nih.gov/pubmed/36016529
http://dx.doi.org/10.1016/j.heliyon.2022.e10011
_version_ 1784771952604348416
author Mao, Jiashun
Zeb, Amir
Kim, Min Sung
Jeon, Hyeon-Nae
Wang, Jianmin
Guan, Shenghui
NO, Kyoung Tai
author_facet Mao, Jiashun
Zeb, Amir
Kim, Min Sung
Jeon, Hyeon-Nae
Wang, Jianmin
Guan, Shenghui
NO, Kyoung Tai
author_sort Mao, Jiashun
collection PubMed
description Dielectric constant (DC, ε) is a fundamental parameter in material sciences to measure polarizability of the system. In industrial processes, its value is an imperative indicator, which demonstrates the dielectric property of material and compiles information including separation information, chemical equilibrium, chemical reactivity analysis, and solubility modeling. Since, the available ε-prediction models are fairly primitive and frequently suffer from serious failures especially when deals with strong polar compounds. Therefore, we have developed a novel data-driven system to improve the efficiency and wide-range applicability of ε using in material sciences. This innovative scheme adopts the correlation distance and genetic algorithm to discriminate features’ combination and avoid overfitting. Herein, the prediction output of the single ML model as a coding to estimate the target value by simulating the layer-by-layer extraction in deep learning, and enabling instant search for the optimal combination of features is recruited. Our model established an improved correlation value of 0.956 with target as compared to the previously available best traditional ML result of 0.877. Our framework established a profound improvement, especially for material systems possessing ε value >50. In terms of interpretability, we have derived a conceptual computational equation from a minimum generating tree. Our innovative data-driven system is preferentially superior over other methods due to its application for the prediction of dielectric constants as well as for the prediction of overall micro and macro-properties of any multi-components complex.
format Online
Article
Text
id pubmed-9396556
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-93965562022-08-24 Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets Mao, Jiashun Zeb, Amir Kim, Min Sung Jeon, Hyeon-Nae Wang, Jianmin Guan, Shenghui NO, Kyoung Tai Heliyon Research Article Dielectric constant (DC, ε) is a fundamental parameter in material sciences to measure polarizability of the system. In industrial processes, its value is an imperative indicator, which demonstrates the dielectric property of material and compiles information including separation information, chemical equilibrium, chemical reactivity analysis, and solubility modeling. Since, the available ε-prediction models are fairly primitive and frequently suffer from serious failures especially when deals with strong polar compounds. Therefore, we have developed a novel data-driven system to improve the efficiency and wide-range applicability of ε using in material sciences. This innovative scheme adopts the correlation distance and genetic algorithm to discriminate features’ combination and avoid overfitting. Herein, the prediction output of the single ML model as a coding to estimate the target value by simulating the layer-by-layer extraction in deep learning, and enabling instant search for the optimal combination of features is recruited. Our model established an improved correlation value of 0.956 with target as compared to the previously available best traditional ML result of 0.877. Our framework established a profound improvement, especially for material systems possessing ε value >50. In terms of interpretability, we have derived a conceptual computational equation from a minimum generating tree. Our innovative data-driven system is preferentially superior over other methods due to its application for the prediction of dielectric constants as well as for the prediction of overall micro and macro-properties of any multi-components complex. Elsevier 2022-08-04 /pmc/articles/PMC9396556/ /pubmed/36016529 http://dx.doi.org/10.1016/j.heliyon.2022.e10011 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Mao, Jiashun
Zeb, Amir
Kim, Min Sung
Jeon, Hyeon-Nae
Wang, Jianmin
Guan, Shenghui
NO, Kyoung Tai
Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets
title Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets
title_full Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets
title_fullStr Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets
title_full_unstemmed Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets
title_short Development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets
title_sort development of an innovative data-driven system to generate descriptive prediction equation of dielectric constant on small sample sets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9396556/
https://www.ncbi.nlm.nih.gov/pubmed/36016529
http://dx.doi.org/10.1016/j.heliyon.2022.e10011
work_keys_str_mv AT maojiashun developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets
AT zebamir developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets
AT kimminsung developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets
AT jeonhyeonnae developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets
AT wangjianmin developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets
AT guanshenghui developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets
AT nokyoungtai developmentofaninnovativedatadrivensystemtogeneratedescriptivepredictionequationofdielectricconstantonsmallsamplesets