Cargando…
Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data
Random Forest (RF) QSPR models were developed with a data set of homolytic bond dissociation energies (BDE) previously calculated by B3LYP/6‐311++G(d,p)//DFTB for 2263 sp3C−H covalent bonds. The best set of attributes consisted in 114 descriptors of the carbon atom (counts of atom types in 5 spheres...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10078411/ https://www.ncbi.nlm.nih.gov/pubmed/36167940 http://dx.doi.org/10.1002/minf.202200193 |
_version_ | 1785020512485769216 |
---|---|
author | Li, Wanli Luan, Yue Zhang, Qingyou Aires‐de‐Sousa, Joao |
author_facet | Li, Wanli Luan, Yue Zhang, Qingyou Aires‐de‐Sousa, Joao |
author_sort | Li, Wanli |
collection | PubMed |
description | Random Forest (RF) QSPR models were developed with a data set of homolytic bond dissociation energies (BDE) previously calculated by B3LYP/6‐311++G(d,p)//DFTB for 2263 sp3C−H covalent bonds. The best set of attributes consisted in 114 descriptors of the carbon atom (counts of atom types in 5 spheres around the kernel atom and ring descriptors). The optimized model predicted the DFT‐calculated BDE of an independent test set of 224 bonds with MAE=2.86 kcal/mol. A new data set of 409 bonds from the iBonD database (http://ibond.nankai.edu.cn) was predicted by the RF with a modest MAE (5.36 kcal/mol) but a relatively high R(2) (0.75) against experimental energies. A prediction scheme was explored that corrects the RF prediction with the average deviation observed for the k nearest neighbours (KNN) in an additional memory of experimental data. The corrected predictions achieved MAE=2.22 kcal/mol for an independent test set of 145 bonds and the corresponding experimental bond energies. |
format | Online Article Text |
id | pubmed-10078411 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-100784112023-04-07 Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data Li, Wanli Luan, Yue Zhang, Qingyou Aires‐de‐Sousa, Joao Mol Inform Research Articles Random Forest (RF) QSPR models were developed with a data set of homolytic bond dissociation energies (BDE) previously calculated by B3LYP/6‐311++G(d,p)//DFTB for 2263 sp3C−H covalent bonds. The best set of attributes consisted in 114 descriptors of the carbon atom (counts of atom types in 5 spheres around the kernel atom and ring descriptors). The optimized model predicted the DFT‐calculated BDE of an independent test set of 224 bonds with MAE=2.86 kcal/mol. A new data set of 409 bonds from the iBonD database (http://ibond.nankai.edu.cn) was predicted by the RF with a modest MAE (5.36 kcal/mol) but a relatively high R(2) (0.75) against experimental energies. A prediction scheme was explored that corrects the RF prediction with the average deviation observed for the k nearest neighbours (KNN) in an additional memory of experimental data. The corrected predictions achieved MAE=2.22 kcal/mol for an independent test set of 145 bonds and the corresponding experimental bond energies. John Wiley and Sons Inc. 2022-10-19 2023-01 /pmc/articles/PMC10078411/ /pubmed/36167940 http://dx.doi.org/10.1002/minf.202200193 Text en © 2022 The Authors. Molecular Informatics published by Wiley-VCH GmbH https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Research Articles Li, Wanli Luan, Yue Zhang, Qingyou Aires‐de‐Sousa, Joao Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data |
title | Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data |
title_full | Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data |
title_fullStr | Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data |
title_full_unstemmed | Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data |
title_short | Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data |
title_sort | machine learning to predict homolytic dissociation energies of c−h bonds: calibration of dft‐based models with experimental data |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10078411/ https://www.ncbi.nlm.nih.gov/pubmed/36167940 http://dx.doi.org/10.1002/minf.202200193 |
work_keys_str_mv | AT liwanli machinelearningtopredicthomolyticdissociationenergiesofchbondscalibrationofdftbasedmodelswithexperimentaldata AT luanyue machinelearningtopredicthomolyticdissociationenergiesofchbondscalibrationofdftbasedmodelswithexperimentaldata AT zhangqingyou machinelearningtopredicthomolyticdissociationenergiesofchbondscalibrationofdftbasedmodelswithexperimentaldata AT airesdesousajoao machinelearningtopredicthomolyticdissociationenergiesofchbondscalibrationofdftbasedmodelswithexperimentaldata |