Cargando…

Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts

Hundreds of catalytic methods are developed each year to meet the demand for high-purity chiral compounds. The computational design of enantioselective organocatalysts remains a significant challenge, as catalysts are typically discovered through experimental screening. Recent advances in combining...

Descripción completa

Detalles Bibliográficos
Autores principales: Gallarati, Simone, Fabregat, Raimon, Laplaza, Rubén, Bhattacharjee, Sinjini, Wodrich, Matthew D., Corminboeuf, Clemence
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8153079/
https://www.ncbi.nlm.nih.gov/pubmed/34123316
http://dx.doi.org/10.1039/d1sc00482d
_version_ 1783698722462367744
author Gallarati, Simone
Fabregat, Raimon
Laplaza, Rubén
Bhattacharjee, Sinjini
Wodrich, Matthew D.
Corminboeuf, Clemence
author_facet Gallarati, Simone
Fabregat, Raimon
Laplaza, Rubén
Bhattacharjee, Sinjini
Wodrich, Matthew D.
Corminboeuf, Clemence
author_sort Gallarati, Simone
collection PubMed
description Hundreds of catalytic methods are developed each year to meet the demand for high-purity chiral compounds. The computational design of enantioselective organocatalysts remains a significant challenge, as catalysts are typically discovered through experimental screening. Recent advances in combining quantum chemical computations and machine learning (ML) hold great potential to propel the next leap forward in asymmetric catalysis. Within the context of quantum chemical machine learning (QML, or atomistic ML), the ML representations used to encode the three-dimensional structure of molecules and evaluate their similarity cannot easily capture the subtle energy differences that govern enantioselectivity. Here, we present a general strategy for improving molecular representations within an atomistic machine learning model to predict the DFT-computed enantiomeric excess of asymmetric propargylation organocatalysts solely from the structure of catalytic cycle intermediates. Mean absolute errors as low as 0.25 kcal mol(−1) were achieved in predictions of the activation energy with respect to DFT computations. By virtue of its design, this strategy is generalisable to other ML models, to experimental data and to any catalytic asymmetric reaction, enabling the rapid screening of structurally diverse organocatalysts from available structural information.
format Online
Article
Text
id pubmed-8153079
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-81530792021-06-11 Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts Gallarati, Simone Fabregat, Raimon Laplaza, Rubén Bhattacharjee, Sinjini Wodrich, Matthew D. Corminboeuf, Clemence Chem Sci Chemistry Hundreds of catalytic methods are developed each year to meet the demand for high-purity chiral compounds. The computational design of enantioselective organocatalysts remains a significant challenge, as catalysts are typically discovered through experimental screening. Recent advances in combining quantum chemical computations and machine learning (ML) hold great potential to propel the next leap forward in asymmetric catalysis. Within the context of quantum chemical machine learning (QML, or atomistic ML), the ML representations used to encode the three-dimensional structure of molecules and evaluate their similarity cannot easily capture the subtle energy differences that govern enantioselectivity. Here, we present a general strategy for improving molecular representations within an atomistic machine learning model to predict the DFT-computed enantiomeric excess of asymmetric propargylation organocatalysts solely from the structure of catalytic cycle intermediates. Mean absolute errors as low as 0.25 kcal mol(−1) were achieved in predictions of the activation energy with respect to DFT computations. By virtue of its design, this strategy is generalisable to other ML models, to experimental data and to any catalytic asymmetric reaction, enabling the rapid screening of structurally diverse organocatalysts from available structural information. The Royal Society of Chemistry 2021-04-03 /pmc/articles/PMC8153079/ /pubmed/34123316 http://dx.doi.org/10.1039/d1sc00482d Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Gallarati, Simone
Fabregat, Raimon
Laplaza, Rubén
Bhattacharjee, Sinjini
Wodrich, Matthew D.
Corminboeuf, Clemence
Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts
title Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts
title_full Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts
title_fullStr Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts
title_full_unstemmed Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts
title_short Reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts
title_sort reaction-based machine learning representations for predicting the enantioselectivity of organocatalysts
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8153079/
https://www.ncbi.nlm.nih.gov/pubmed/34123316
http://dx.doi.org/10.1039/d1sc00482d
work_keys_str_mv AT gallaratisimone reactionbasedmachinelearningrepresentationsforpredictingtheenantioselectivityoforganocatalysts
AT fabregatraimon reactionbasedmachinelearningrepresentationsforpredictingtheenantioselectivityoforganocatalysts
AT laplazaruben reactionbasedmachinelearningrepresentationsforpredictingtheenantioselectivityoforganocatalysts
AT bhattacharjeesinjini reactionbasedmachinelearningrepresentationsforpredictingtheenantioselectivityoforganocatalysts
AT wodrichmatthewd reactionbasedmachinelearningrepresentationsforpredictingtheenantioselectivityoforganocatalysts
AT corminboeufclemence reactionbasedmachinelearningrepresentationsforpredictingtheenantioselectivityoforganocatalysts