Cargando…

Predicting glycosylation stereoselectivity using machine learning

Predicting the stereochemical outcome of chemical reactions is challenging in mechanistically ambiguous transformations. The stereoselectivity of glycosylation reactions is influenced by at least eleven factors across four chemical participants and temperature. A random forest algorithm was trained...

Descripción completa

Detalles Bibliográficos
Autores principales: Moon, Sooyeon, Chatterjee, Sourav, Seeberger, Peter H., Gilmore, Kerry
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8179398/
https://www.ncbi.nlm.nih.gov/pubmed/34164060
http://dx.doi.org/10.1039/d0sc06222g
_version_ 1783703772095053824
author Moon, Sooyeon
Chatterjee, Sourav
Seeberger, Peter H.
Gilmore, Kerry
author_facet Moon, Sooyeon
Chatterjee, Sourav
Seeberger, Peter H.
Gilmore, Kerry
author_sort Moon, Sooyeon
collection PubMed
description Predicting the stereochemical outcome of chemical reactions is challenging in mechanistically ambiguous transformations. The stereoselectivity of glycosylation reactions is influenced by at least eleven factors across four chemical participants and temperature. A random forest algorithm was trained using a highly reproducible, concise dataset to accurately predict the stereoselective outcome of glycosylations. The steric and electronic contributions of all chemical reagents and solvents were quantified by quantum mechanical calculations. The trained model accurately predicts stereoselectivities for unseen nucleophiles, electrophiles, acid catalyst, and solvents across a wide temperature range (overall root mean square error 6.8%). All predictions were validated experimentally on a standardized microreactor platform. The model helped to identify novel ways to control glycosylation stereoselectivity and accurately predicts previously unknown means of stereocontrol. By quantifying the degree of influence of each variable, we begin to gain a better general understanding of the transformation, for example that environmental factors influence the stereoselectivity of glycosylations more than the coupling partners in this area of chemical space.
format Online
Article
Text
id pubmed-8179398
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-81793982021-06-22 Predicting glycosylation stereoselectivity using machine learning Moon, Sooyeon Chatterjee, Sourav Seeberger, Peter H. Gilmore, Kerry Chem Sci Chemistry Predicting the stereochemical outcome of chemical reactions is challenging in mechanistically ambiguous transformations. The stereoselectivity of glycosylation reactions is influenced by at least eleven factors across four chemical participants and temperature. A random forest algorithm was trained using a highly reproducible, concise dataset to accurately predict the stereoselective outcome of glycosylations. The steric and electronic contributions of all chemical reagents and solvents were quantified by quantum mechanical calculations. The trained model accurately predicts stereoselectivities for unseen nucleophiles, electrophiles, acid catalyst, and solvents across a wide temperature range (overall root mean square error 6.8%). All predictions were validated experimentally on a standardized microreactor platform. The model helped to identify novel ways to control glycosylation stereoselectivity and accurately predicts previously unknown means of stereocontrol. By quantifying the degree of influence of each variable, we begin to gain a better general understanding of the transformation, for example that environmental factors influence the stereoselectivity of glycosylations more than the coupling partners in this area of chemical space. The Royal Society of Chemistry 2020-12-26 /pmc/articles/PMC8179398/ /pubmed/34164060 http://dx.doi.org/10.1039/d0sc06222g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/
spellingShingle Chemistry
Moon, Sooyeon
Chatterjee, Sourav
Seeberger, Peter H.
Gilmore, Kerry
Predicting glycosylation stereoselectivity using machine learning
title Predicting glycosylation stereoselectivity using machine learning
title_full Predicting glycosylation stereoselectivity using machine learning
title_fullStr Predicting glycosylation stereoselectivity using machine learning
title_full_unstemmed Predicting glycosylation stereoselectivity using machine learning
title_short Predicting glycosylation stereoselectivity using machine learning
title_sort predicting glycosylation stereoselectivity using machine learning
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8179398/
https://www.ncbi.nlm.nih.gov/pubmed/34164060
http://dx.doi.org/10.1039/d0sc06222g
work_keys_str_mv AT moonsooyeon predictingglycosylationstereoselectivityusingmachinelearning
AT chatterjeesourav predictingglycosylationstereoselectivityusingmachinelearning
AT seebergerpeterh predictingglycosylationstereoselectivityusingmachinelearning
AT gilmorekerry predictingglycosylationstereoselectivityusingmachinelearning