Cargando…
Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials
[Image: see text] With the further development of the concept of green chemistry, the new generation of energetic materials tends to exhibit detonation properties such as higher insensitivity, higher density, and higher energy. Therefore, the precise molecular design and green and efficient synthesi...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2023
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9850487/ https://www.ncbi.nlm.nih.gov/pubmed/36687054 http://dx.doi.org/10.1021/acsomega.2c07436 |
_version_ | 1784872197537398784 |
---|---|
author | Li, Maogang Lai, Weipeng Li, Ruirui Zhou, Jiajun Liu, Yingzhe Yu, Tao Zhang, Tianlong Tang, Hongsheng Li, Hua |
author_facet | Li, Maogang Lai, Weipeng Li, Ruirui Zhou, Jiajun Liu, Yingzhe Yu, Tao Zhang, Tianlong Tang, Hongsheng Li, Hua |
author_sort | Li, Maogang |
collection | PubMed |
description | [Image: see text] With the further development of the concept of green chemistry, the new generation of energetic materials tends to exhibit detonation properties such as higher insensitivity, higher density, and higher energy. Therefore, the precise molecular design and green and efficient synthesis of energetic materials will be one of the serious challenges. For the purpose of accurate prediction of detonation performance of energetic materials, an ensemble modeling strategy based on the combination of Monte Carlo (MC) and variable importance measurement (VIM) improved random forest (RF) and quantitative structure–property relationship (QSPR) is proposed, which was successfully used for density prediction of energetic materials. First, the structure of 162 energetic compounds was optimized by Gaussian software, and the molecular descriptor data were calculated by CODESSA software based on the optimized molecular structure. Then, the MCVIMRF_Med ensemble model was constructed on the basis of the above molecular descriptor data and the corresponding energetic compound density index. The joint X–Y distance algorithm (SPXY) is used to partition the data set. And then, MC is used to further divide the calibration set data into multiple subsets for the construction of the ensemble model. The subset size and the number of iterations of the MCVIMRF_Med ensemble model were optimized through MC cross validation. The final output strategy of the ensemble model is optimized based on the optimized parameters, and an output optimization method based on median screening is proposed and successfully applied for the prediction performance optimization of the MCVIMRF_Med ensemble model. To further investigate the performance of the MCVIMRF_Med ensemble model, the performance of it was compared with partial least squares, RF, VIMRF, and MCVIMRF calibration models. It shows that the MCVIMRF_Med ensemble model can achieve a better prediction result for the density of energetic materials, with R(2)(CV) of 0.9596, RMSECV of 0.0437 g/cm(3), R(2)(P) of 0.9768, RMSEP of 0.0578 g/cm(3), and relative analysis deviation of prediction set of 3.951. Therefore, the MCVIMRF_Med ensemble modeling strategy combined with QSPR is an effective approach for the density prediction of energetic materials. This work is expected to provide new research ideas and technical support for accurate prediction of detonation performance of energetic materials. |
format | Online Article Text |
id | pubmed-9850487 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-98504872023-01-20 Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials Li, Maogang Lai, Weipeng Li, Ruirui Zhou, Jiajun Liu, Yingzhe Yu, Tao Zhang, Tianlong Tang, Hongsheng Li, Hua ACS Omega [Image: see text] With the further development of the concept of green chemistry, the new generation of energetic materials tends to exhibit detonation properties such as higher insensitivity, higher density, and higher energy. Therefore, the precise molecular design and green and efficient synthesis of energetic materials will be one of the serious challenges. For the purpose of accurate prediction of detonation performance of energetic materials, an ensemble modeling strategy based on the combination of Monte Carlo (MC) and variable importance measurement (VIM) improved random forest (RF) and quantitative structure–property relationship (QSPR) is proposed, which was successfully used for density prediction of energetic materials. First, the structure of 162 energetic compounds was optimized by Gaussian software, and the molecular descriptor data were calculated by CODESSA software based on the optimized molecular structure. Then, the MCVIMRF_Med ensemble model was constructed on the basis of the above molecular descriptor data and the corresponding energetic compound density index. The joint X–Y distance algorithm (SPXY) is used to partition the data set. And then, MC is used to further divide the calibration set data into multiple subsets for the construction of the ensemble model. The subset size and the number of iterations of the MCVIMRF_Med ensemble model were optimized through MC cross validation. The final output strategy of the ensemble model is optimized based on the optimized parameters, and an output optimization method based on median screening is proposed and successfully applied for the prediction performance optimization of the MCVIMRF_Med ensemble model. To further investigate the performance of the MCVIMRF_Med ensemble model, the performance of it was compared with partial least squares, RF, VIMRF, and MCVIMRF calibration models. It shows that the MCVIMRF_Med ensemble model can achieve a better prediction result for the density of energetic materials, with R(2)(CV) of 0.9596, RMSECV of 0.0437 g/cm(3), R(2)(P) of 0.9768, RMSEP of 0.0578 g/cm(3), and relative analysis deviation of prediction set of 3.951. Therefore, the MCVIMRF_Med ensemble modeling strategy combined with QSPR is an effective approach for the density prediction of energetic materials. This work is expected to provide new research ideas and technical support for accurate prediction of detonation performance of energetic materials. American Chemical Society 2023-01-04 /pmc/articles/PMC9850487/ /pubmed/36687054 http://dx.doi.org/10.1021/acsomega.2c07436 Text en © 2023 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Li, Maogang Lai, Weipeng Li, Ruirui Zhou, Jiajun Liu, Yingzhe Yu, Tao Zhang, Tianlong Tang, Hongsheng Li, Hua Novel Random Forest Ensemble Modeling Strategy Combined with Quantitative Structure–Property Relationship for Density Prediction of Energetic Materials |
title | Novel Random Forest
Ensemble Modeling Strategy Combined
with Quantitative Structure–Property Relationship for Density
Prediction of Energetic Materials |
title_full | Novel Random Forest
Ensemble Modeling Strategy Combined
with Quantitative Structure–Property Relationship for Density
Prediction of Energetic Materials |
title_fullStr | Novel Random Forest
Ensemble Modeling Strategy Combined
with Quantitative Structure–Property Relationship for Density
Prediction of Energetic Materials |
title_full_unstemmed | Novel Random Forest
Ensemble Modeling Strategy Combined
with Quantitative Structure–Property Relationship for Density
Prediction of Energetic Materials |
title_short | Novel Random Forest
Ensemble Modeling Strategy Combined
with Quantitative Structure–Property Relationship for Density
Prediction of Energetic Materials |
title_sort | novel random forest
ensemble modeling strategy combined
with quantitative structure–property relationship for density
prediction of energetic materials |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9850487/ https://www.ncbi.nlm.nih.gov/pubmed/36687054 http://dx.doi.org/10.1021/acsomega.2c07436 |
work_keys_str_mv | AT limaogang novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials AT laiweipeng novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials AT liruirui novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials AT zhoujiajun novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials AT liuyingzhe novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials AT yutao novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials AT zhangtianlong novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials AT tanghongsheng novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials AT lihua novelrandomforestensemblemodelingstrategycombinedwithquantitativestructurepropertyrelationshipfordensitypredictionofenergeticmaterials |