Cargando…

Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials

[Image: see text] With the further development of the concept of green chemistry, the new generation of energetic materials tends to exhibit detonation properties such as higher insensitivity, higher density, and higher energy. Therefore, the precise molecular design and green and efficient synthesi...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Maogang, Lai, Weipeng, Li, Ruirui, Zhou, Jiajun, Liu, Yingzhe, Yu, Tao, Zhang, Tianlong, Tang, Hongsheng, Li, Hua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9850487/
https://www.ncbi.nlm.nih.gov/pubmed/36687054
http://dx.doi.org/10.1021/acsomega.2c07436
_version_ 1784872197537398784
author Li, Maogang
Lai, Weipeng
Li, Ruirui
Zhou, Jiajun
Liu, Yingzhe
Yu, Tao
Zhang, Tianlong
Tang, Hongsheng
Li, Hua
author_facet Li, Maogang
Lai, Weipeng
Li, Ruirui
Zhou, Jiajun
Liu, Yingzhe
Yu, Tao
Zhang, Tianlong
Tang, Hongsheng
Li, Hua
author_sort Li, Maogang
collection PubMed
description [Image: see text] With the further development of the concept of green chemistry, the new generation of energetic materials tends to exhibit detonation properties such as higher insensitivity, higher density, and higher energy. Therefore, the precise molecular design and green and efficient synthesis of energetic materials will be one of the serious challenges. For the purpose of accurate prediction of detonation performance of energetic materials, an ensemble modeling strategy based on the combination of Monte Carlo (MC) and variable importance measurement (VIM) improved random forest (RF) and quantitative structure–property relationship (QSPR) is proposed, which was successfully used for density prediction of energetic materials. First, the structure of 162 energetic compounds was optimized by Gaussian software, and the molecular descriptor data were calculated by CODESSA software based on the optimized molecular structure. Then, the MCVIMRF_Med ensemble model was constructed on the basis of the above molecular descriptor data and the corresponding energetic compound density index. The joint X–Y distance algorithm (SPXY) is used to partition the data set. And then, MC is used to further divide the calibration set data into multiple subsets for the construction of the ensemble model. The subset size and the number of iterations of the MCVIMRF_Med ensemble model were optimized through MC cross validation. The final output strategy of the ensemble model is optimized based on the optimized parameters, and an output optimization method based on median screening is proposed and successfully applied for the prediction performance optimization of the MCVIMRF_Med ensemble model. To further investigate the performance of the MCVIMRF_Med ensemble model, the performance of it was compared with partial least squares, RF, VIMRF, and MCVIMRF calibration models. It shows that the MCVIMRF_Med ensemble model can achieve a better prediction result for the density of energetic materials, with R(2)(CV) of 0.9596, RMSECV of 0.0437 g/cm(3), R(2)(P) of 0.9768, RMSEP of 0.0578 g/cm(3), and relative analysis deviation of prediction set of 3.951. Therefore, the MCVIMRF_Med ensemble modeling strategy combined with QSPR is an effective approach for the density prediction of energetic materials. This work is expected to provide new research ideas and technical support for accurate prediction of detonation performance of energetic materials.
format Online
Article
Text
id pubmed-9850487
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-98504872023-01-20 Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials Li, Maogang Lai, Weipeng Li, Ruirui Zhou, Jiajun Liu, Yingzhe Yu, Tao Zhang, Tianlong Tang, Hongsheng Li, Hua ACS Omega [Image: see text] With the further development of the concept of green chemistry, the new generation of energetic materials tends to exhibit detonation properties such as higher insensitivity, higher density, and higher energy. Therefore, the precise molecular design and green and efficient synthesis of energetic materials will be one of the serious challenges. For the purpose of accurate prediction of detonation performance of energetic materials, an ensemble modeling strategy based on the combination of Monte Carlo (MC) and variable importance measurement (VIM) improved random forest (RF) and quantitative structure–property relationship (QSPR) is proposed, which was successfully used for density prediction of energetic materials. First, the structure of 162 energetic compounds was optimized by Gaussian software, and the molecular descriptor data were calculated by CODESSA software based on the optimized molecular structure. Then, the MCVIMRF_Med ensemble model was constructed on the basis of the above molecular descriptor data and the corresponding energetic compound density index. The joint X–Y distance algorithm (SPXY) is used to partition the data set. And then, MC is used to further divide the calibration set data into multiple subsets for the construction of the ensemble model. The subset size and the number of iterations of the MCVIMRF_Med ensemble model were optimized through MC cross validation. The final output strategy of the ensemble model is optimized based on the optimized parameters, and an output optimization method based on median screening is proposed and successfully applied for the prediction performance optimization of the MCVIMRF_Med ensemble model. To further investigate the performance of the MCVIMRF_Med ensemble model, the performance of it was compared with partial least squares, RF, VIMRF, and MCVIMRF calibration models. It shows that the MCVIMRF_Med ensemble model can achieve a better prediction result for the density of energetic materials, with R(2)(CV) of 0.9596, RMSECV of 0.0437 g/cm(3), R(2)(P) of 0.9768, RMSEP of 0.0578 g/cm(3), and relative analysis deviation of prediction set of 3.951. Therefore, the MCVIMRF_Med ensemble modeling strategy combined with QSPR is an effective approach for the density prediction of energetic materials. This work is expected to provide new research ideas and technical support for accurate prediction of detonation performance of energetic materials. American Chemical Society 2023-01-04 /pmc/articles/PMC9850487/ /pubmed/36687054 http://dx.doi.org/10.1021/acsomega.2c07436 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Li, Maogang
Lai, Weipeng
Li, Ruirui
Zhou, Jiajun
Liu, Yingzhe
Yu, Tao
Zhang, Tianlong
Tang, Hongsheng
Li, Hua
Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials
title Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials
title_full Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials
title_fullStr Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials
title_full_unstemmed Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials
title_short Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials
title_sort novel random forest ensemble modeling strategy combined with quantitative structure–property relationship for density prediction of energetic materials
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9850487/
https://www.ncbi.nlm.nih.gov/pubmed/36687054
http://dx.doi.org/10.1021/acsomega.2c07436
work_keys_str_mv AT limaogang novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials
AT laiweipeng novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials
AT liruirui novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials
AT zhoujiajun novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials
AT liuyingzhe novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials
AT yutao novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials
AT zhangtianlong novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials
AT tanghongsheng novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials
AT lihua novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials