Cargando…

Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data

Random Forest (RF) QSPR models were developed with a data set of homolytic bond dissociation energies (BDE) previously calculated by B3LYP/6‐311++G(d,p)//DFTB for 2263 sp3C−H covalent bonds. The best set of attributes consisted in 114 descriptors of the carbon atom (counts of atom types in 5 spheres...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Wanli, Luan, Yue, Zhang, Qingyou, Aires‐de‐Sousa, Joao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10078411/
https://www.ncbi.nlm.nih.gov/pubmed/36167940
http://dx.doi.org/10.1002/minf.202200193
_version_ 1785020512485769216
author Li, Wanli
Luan, Yue
Zhang, Qingyou
Aires‐de‐Sousa, Joao
author_facet Li, Wanli
Luan, Yue
Zhang, Qingyou
Aires‐de‐Sousa, Joao
author_sort Li, Wanli
collection PubMed
description Random Forest (RF) QSPR models were developed with a data set of homolytic bond dissociation energies (BDE) previously calculated by B3LYP/6‐311++G(d,p)//DFTB for 2263 sp3C−H covalent bonds. The best set of attributes consisted in 114 descriptors of the carbon atom (counts of atom types in 5 spheres around the kernel atom and ring descriptors). The optimized model predicted the DFT‐calculated BDE of an independent test set of 224 bonds with MAE=2.86 kcal/mol. A new data set of 409 bonds from the iBonD database (http://ibond.nankai.edu.cn) was predicted by the RF with a modest MAE (5.36 kcal/mol) but a relatively high R(2) (0.75) against experimental energies. A prediction scheme was explored that corrects the RF prediction with the average deviation observed for the k nearest neighbours (KNN) in an additional memory of experimental data. The corrected predictions achieved MAE=2.22 kcal/mol for an independent test set of 145 bonds and the corresponding experimental bond energies.
format Online
Article
Text
id pubmed-10078411
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-100784112023-04-07 Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data Li, Wanli Luan, Yue Zhang, Qingyou Aires‐de‐Sousa, Joao Mol Inform Research Articles Random Forest (RF) QSPR models were developed with a data set of homolytic bond dissociation energies (BDE) previously calculated by B3LYP/6‐311++G(d,p)//DFTB for 2263 sp3C−H covalent bonds. The best set of attributes consisted in 114 descriptors of the carbon atom (counts of atom types in 5 spheres around the kernel atom and ring descriptors). The optimized model predicted the DFT‐calculated BDE of an independent test set of 224 bonds with MAE=2.86 kcal/mol. A new data set of 409 bonds from the iBonD database (http://ibond.nankai.edu.cn) was predicted by the RF with a modest MAE (5.36 kcal/mol) but a relatively high R(2) (0.75) against experimental energies. A prediction scheme was explored that corrects the RF prediction with the average deviation observed for the k nearest neighbours (KNN) in an additional memory of experimental data. The corrected predictions achieved MAE=2.22 kcal/mol for an independent test set of 145 bonds and the corresponding experimental bond energies. John Wiley and Sons Inc. 2022-10-19 2023-01 /pmc/articles/PMC10078411/ /pubmed/36167940 http://dx.doi.org/10.1002/minf.202200193 Text en © 2022 The Authors. Molecular Informatics published by Wiley-VCH GmbH https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Research Articles
Li, Wanli
Luan, Yue
Zhang, Qingyou
Aires‐de‐Sousa, Joao
Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data
title Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data
title_full Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data
title_fullStr Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data
title_full_unstemmed Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data
title_short Machine Learning to Predict Homolytic Dissociation Energies of C−H Bonds: Calibration of DFT‐based Models with Experimental Data
title_sort machine learning to predict homolytic dissociation energies of c−h bonds: calibration of dft‐based models with experimental data
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10078411/
https://www.ncbi.nlm.nih.gov/pubmed/36167940
http://dx.doi.org/10.1002/minf.202200193
work_keys_str_mv AT liwanli machinelearningtopredicthomolyticdissociationenergiesofchbondscalibrationofdftbasedmodelswithexperimentaldata
AT luanyue machinelearningtopredicthomolyticdissociationenergiesofchbondscalibrationofdftbasedmodelswithexperimentaldata
AT zhangqingyou machinelearningtopredicthomolyticdissociationenergiesofchbondscalibrationofdftbasedmodelswithexperimentaldata
AT airesdesousajoao machinelearningtopredicthomolyticdissociationenergiesofchbondscalibrationofdftbasedmodelswithexperimentaldata