Cargando…
A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets
It remains an important challenge to apply machine learning in material discovery with limited-scale datasets available, in particular for the energetic materials. Motivated by the challenge, we developed a Property-oriented Adaptive Design Framework (PADF) to quickly design new energetic compounds...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9037014/ https://www.ncbi.nlm.nih.gov/pubmed/35478886 http://dx.doi.org/10.1039/d1ra03715c |
_version_ | 1784693639186743296 |
---|---|
author | Xie, Yunhao Liu, Yijing Hu, Renling Lin, Xu Hu, Jing Pu, Xuemei |
author_facet | Xie, Yunhao Liu, Yijing Hu, Renling Lin, Xu Hu, Jing Pu, Xuemei |
author_sort | Xie, Yunhao |
collection | PubMed |
description | It remains an important challenge to apply machine learning in material discovery with limited-scale datasets available, in particular for the energetic materials. Motivated by the challenge, we developed a Property-oriented Adaptive Design Framework (PADF) to quickly design new energetic compounds with desired properties. The PADF consists of a search space, machine learning model, optimization algorithm and an evaluator based on quantum mechanical calculations. The effectiveness and generality of the PADF were assessed by two case studies on the heat of formation and heat of explosion as the target properties. 88 compounds were selected as the initial training dataset from the search space containing 84 083 compounds generated. SVR.lin/Trade-off coupled with E-state + SOB and KRR/KG coupled with CDS + E-state + SOB were determined to be the best combination pairs for the heat of formation and the heat of explosion, respectively. Most of the ten compounds selected from the first ten iterations exhibit better properties than the optimal sample in the initial dataset. Besides, the heat of explosion as the target property outperforms the heat of formation in designing energetic compounds with high detonation performance. In particular, a new compound selected at the 3rd iteration exhibits high potential as an explosive. Our strategy could be extended to other domains limited by small-scale datasets labeled. |
format | Online Article Text |
id | pubmed-9037014 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-90370142022-04-26 A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets Xie, Yunhao Liu, Yijing Hu, Renling Lin, Xu Hu, Jing Pu, Xuemei RSC Adv Chemistry It remains an important challenge to apply machine learning in material discovery with limited-scale datasets available, in particular for the energetic materials. Motivated by the challenge, we developed a Property-oriented Adaptive Design Framework (PADF) to quickly design new energetic compounds with desired properties. The PADF consists of a search space, machine learning model, optimization algorithm and an evaluator based on quantum mechanical calculations. The effectiveness and generality of the PADF were assessed by two case studies on the heat of formation and heat of explosion as the target properties. 88 compounds were selected as the initial training dataset from the search space containing 84 083 compounds generated. SVR.lin/Trade-off coupled with E-state + SOB and KRR/KG coupled with CDS + E-state + SOB were determined to be the best combination pairs for the heat of formation and the heat of explosion, respectively. Most of the ten compounds selected from the first ten iterations exhibit better properties than the optimal sample in the initial dataset. Besides, the heat of explosion as the target property outperforms the heat of formation in designing energetic compounds with high detonation performance. In particular, a new compound selected at the 3rd iteration exhibits high potential as an explosive. Our strategy could be extended to other domains limited by small-scale datasets labeled. The Royal Society of Chemistry 2021-07-27 /pmc/articles/PMC9037014/ /pubmed/35478886 http://dx.doi.org/10.1039/d1ra03715c Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/ |
spellingShingle | Chemistry Xie, Yunhao Liu, Yijing Hu, Renling Lin, Xu Hu, Jing Pu, Xuemei A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets |
title | A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets |
title_full | A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets |
title_fullStr | A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets |
title_full_unstemmed | A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets |
title_short | A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets |
title_sort | property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9037014/ https://www.ncbi.nlm.nih.gov/pubmed/35478886 http://dx.doi.org/10.1039/d1ra03715c |
work_keys_str_mv | AT xieyunhao apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT liuyijing apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT hurenling apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT linxu apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT hujing apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT puxuemei apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT xieyunhao propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT liuyijing propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT hurenling propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT linxu propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT hujing propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets AT puxuemei propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets |