Cargando…

Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction

Modern QM modelling methods, such as DFT, have provided detailed mechanistic insights into countless reactions. However, their computational cost inhibits their ability to rapidly screen large numbers of substrates and catalysts in reaction discovery. For a C–C bond forming nitro-Michael addition, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Farrar, Elliot H. E., Grayson, Matthew N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9242013/
https://www.ncbi.nlm.nih.gov/pubmed/35872815
http://dx.doi.org/10.1039/d2sc02925a
_version_ 1784737959063322624
author Farrar, Elliot H. E.
Grayson, Matthew N.
author_facet Farrar, Elliot H. E.
Grayson, Matthew N.
author_sort Farrar, Elliot H. E.
collection PubMed
description Modern QM modelling methods, such as DFT, have provided detailed mechanistic insights into countless reactions. However, their computational cost inhibits their ability to rapidly screen large numbers of substrates and catalysts in reaction discovery. For a C–C bond forming nitro-Michael addition, we introduce a synergistic semi-empirical quantum mechanical (SQM) and machine learning (ML) approach that allows the prediction of DFT-quality reaction barriers in minutes, even on a standard laptop using widely available modelling software. Mean absolute errors (MAEs) are obtained that are below the accepted chemical accuracy threshold of 1 kcal mol(−1) and substantially better than SQM methods without ML correction (5.71 kcal mol(−1)). Predictive power is shown to hold when the ML models are applied to an unseen set of compounds from the toxicology literature. Mechanistic insight is also achieved via the generation of full SQM transition state (TS) structures which are found to be very good approximations for the DFT-level geometries, revealing important steric interactions in some TSs. This combination of speed, accuracy, and mechanistic insight is unprecedented; current ML barrier models compromise on at least one of these important criteria.
format Online
Article
Text
id pubmed-9242013
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-92420132022-07-22 Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction Farrar, Elliot H. E. Grayson, Matthew N. Chem Sci Chemistry Modern QM modelling methods, such as DFT, have provided detailed mechanistic insights into countless reactions. However, their computational cost inhibits their ability to rapidly screen large numbers of substrates and catalysts in reaction discovery. For a C–C bond forming nitro-Michael addition, we introduce a synergistic semi-empirical quantum mechanical (SQM) and machine learning (ML) approach that allows the prediction of DFT-quality reaction barriers in minutes, even on a standard laptop using widely available modelling software. Mean absolute errors (MAEs) are obtained that are below the accepted chemical accuracy threshold of 1 kcal mol(−1) and substantially better than SQM methods without ML correction (5.71 kcal mol(−1)). Predictive power is shown to hold when the ML models are applied to an unseen set of compounds from the toxicology literature. Mechanistic insight is also achieved via the generation of full SQM transition state (TS) structures which are found to be very good approximations for the DFT-level geometries, revealing important steric interactions in some TSs. This combination of speed, accuracy, and mechanistic insight is unprecedented; current ML barrier models compromise on at least one of these important criteria. The Royal Society of Chemistry 2022-06-14 /pmc/articles/PMC9242013/ /pubmed/35872815 http://dx.doi.org/10.1039/d2sc02925a Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/
spellingShingle Chemistry
Farrar, Elliot H. E.
Grayson, Matthew N.
Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction
title Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction
title_full Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction
title_fullStr Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction
title_full_unstemmed Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction
title_short Machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction
title_sort machine learning and semi-empirical calculations: a synergistic approach to rapid, accurate, and mechanism-based reaction barrier prediction
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9242013/
https://www.ncbi.nlm.nih.gov/pubmed/35872815
http://dx.doi.org/10.1039/d2sc02925a
work_keys_str_mv AT farrarelliothe machinelearningandsemiempiricalcalculationsasynergisticapproachtorapidaccurateandmechanismbasedreactionbarrierprediction
AT graysonmatthewn machinelearningandsemiempiricalcalculationsasynergisticapproachtorapidaccurateandmechanismbasedreactionbarrierprediction