Cargando…

A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets

It remains an important challenge to apply machine learning in material discovery with limited-scale datasets available, in particular for the energetic materials. Motivated by the challenge, we developed a Property-oriented Adaptive Design Framework (PADF) to quickly design new energetic compounds...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Yunhao, Liu, Yijing, Hu, Renling, Lin, Xu, Hu, Jing, Pu, Xuemei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9037014/
https://www.ncbi.nlm.nih.gov/pubmed/35478886
http://dx.doi.org/10.1039/d1ra03715c
_version_ 1784693639186743296
author Xie, Yunhao
Liu, Yijing
Hu, Renling
Lin, Xu
Hu, Jing
Pu, Xuemei
author_facet Xie, Yunhao
Liu, Yijing
Hu, Renling
Lin, Xu
Hu, Jing
Pu, Xuemei
author_sort Xie, Yunhao
collection PubMed
description It remains an important challenge to apply machine learning in material discovery with limited-scale datasets available, in particular for the energetic materials. Motivated by the challenge, we developed a Property-oriented Adaptive Design Framework (PADF) to quickly design new energetic compounds with desired properties. The PADF consists of a search space, machine learning model, optimization algorithm and an evaluator based on quantum mechanical calculations. The effectiveness and generality of the PADF were assessed by two case studies on the heat of formation and heat of explosion as the target properties. 88 compounds were selected as the initial training dataset from the search space containing 84 083 compounds generated. SVR.lin/Trade-off coupled with E-state + SOB and KRR/KG coupled with CDS + E-state + SOB were determined to be the best combination pairs for the heat of formation and the heat of explosion, respectively. Most of the ten compounds selected from the first ten iterations exhibit better properties than the optimal sample in the initial dataset. Besides, the heat of explosion as the target property outperforms the heat of formation in designing energetic compounds with high detonation performance. In particular, a new compound selected at the 3rd iteration exhibits high potential as an explosive. Our strategy could be extended to other domains limited by small-scale datasets labeled.
format Online
Article
Text
id pubmed-9037014
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-90370142022-04-26 A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets Xie, Yunhao Liu, Yijing Hu, Renling Lin, Xu Hu, Jing Pu, Xuemei RSC Adv Chemistry It remains an important challenge to apply machine learning in material discovery with limited-scale datasets available, in particular for the energetic materials. Motivated by the challenge, we developed a Property-oriented Adaptive Design Framework (PADF) to quickly design new energetic compounds with desired properties. The PADF consists of a search space, machine learning model, optimization algorithm and an evaluator based on quantum mechanical calculations. The effectiveness and generality of the PADF were assessed by two case studies on the heat of formation and heat of explosion as the target properties. 88 compounds were selected as the initial training dataset from the search space containing 84 083 compounds generated. SVR.lin/Trade-off coupled with E-state + SOB and KRR/KG coupled with CDS + E-state + SOB were determined to be the best combination pairs for the heat of formation and the heat of explosion, respectively. Most of the ten compounds selected from the first ten iterations exhibit better properties than the optimal sample in the initial dataset. Besides, the heat of explosion as the target property outperforms the heat of formation in designing energetic compounds with high detonation performance. In particular, a new compound selected at the 3rd iteration exhibits high potential as an explosive. Our strategy could be extended to other domains limited by small-scale datasets labeled. The Royal Society of Chemistry 2021-07-27 /pmc/articles/PMC9037014/ /pubmed/35478886 http://dx.doi.org/10.1039/d1ra03715c Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Xie, Yunhao
Liu, Yijing
Hu, Renling
Lin, Xu
Hu, Jing
Pu, Xuemei
A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets
title A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets
title_full A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets
title_fullStr A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets
title_full_unstemmed A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets
title_short A property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets
title_sort property-oriented adaptive design framework for rapid discovery of energetic molecules based on small-scale labeled datasets
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9037014/
https://www.ncbi.nlm.nih.gov/pubmed/35478886
http://dx.doi.org/10.1039/d1ra03715c
work_keys_str_mv AT xieyunhao apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT liuyijing apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT hurenling apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT linxu apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT hujing apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT puxuemei apropertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT xieyunhao propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT liuyijing propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT hurenling propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT linxu propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT hujing propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets
AT puxuemei propertyorientedadaptivedesignframeworkforrapiddiscoveryofenergeticmoleculesbasedonsmallscalelabeleddatasets