Cargando…
Predicting glycosylation stereoselectivity using machine learning
Predicting the stereochemical outcome of chemical reactions is challenging in mechanistically ambiguous transformations. The stereoselectivity of glycosylation reactions is influenced by at least eleven factors across four chemical participants and temperature. A random forest algorithm was trained...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8179398/ https://www.ncbi.nlm.nih.gov/pubmed/34164060 http://dx.doi.org/10.1039/d0sc06222g |
_version_ | 1783703772095053824 |
---|---|
author | Moon, Sooyeon Chatterjee, Sourav Seeberger, Peter H. Gilmore, Kerry |
author_facet | Moon, Sooyeon Chatterjee, Sourav Seeberger, Peter H. Gilmore, Kerry |
author_sort | Moon, Sooyeon |
collection | PubMed |
description | Predicting the stereochemical outcome of chemical reactions is challenging in mechanistically ambiguous transformations. The stereoselectivity of glycosylation reactions is influenced by at least eleven factors across four chemical participants and temperature. A random forest algorithm was trained using a highly reproducible, concise dataset to accurately predict the stereoselective outcome of glycosylations. The steric and electronic contributions of all chemical reagents and solvents were quantified by quantum mechanical calculations. The trained model accurately predicts stereoselectivities for unseen nucleophiles, electrophiles, acid catalyst, and solvents across a wide temperature range (overall root mean square error 6.8%). All predictions were validated experimentally on a standardized microreactor platform. The model helped to identify novel ways to control glycosylation stereoselectivity and accurately predicts previously unknown means of stereocontrol. By quantifying the degree of influence of each variable, we begin to gain a better general understanding of the transformation, for example that environmental factors influence the stereoselectivity of glycosylations more than the coupling partners in this area of chemical space. |
format | Online Article Text |
id | pubmed-8179398 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-81793982021-06-22 Predicting glycosylation stereoselectivity using machine learning Moon, Sooyeon Chatterjee, Sourav Seeberger, Peter H. Gilmore, Kerry Chem Sci Chemistry Predicting the stereochemical outcome of chemical reactions is challenging in mechanistically ambiguous transformations. The stereoselectivity of glycosylation reactions is influenced by at least eleven factors across four chemical participants and temperature. A random forest algorithm was trained using a highly reproducible, concise dataset to accurately predict the stereoselective outcome of glycosylations. The steric and electronic contributions of all chemical reagents and solvents were quantified by quantum mechanical calculations. The trained model accurately predicts stereoselectivities for unseen nucleophiles, electrophiles, acid catalyst, and solvents across a wide temperature range (overall root mean square error 6.8%). All predictions were validated experimentally on a standardized microreactor platform. The model helped to identify novel ways to control glycosylation stereoselectivity and accurately predicts previously unknown means of stereocontrol. By quantifying the degree of influence of each variable, we begin to gain a better general understanding of the transformation, for example that environmental factors influence the stereoselectivity of glycosylations more than the coupling partners in this area of chemical space. The Royal Society of Chemistry 2020-12-26 /pmc/articles/PMC8179398/ /pubmed/34164060 http://dx.doi.org/10.1039/d0sc06222g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/ |
spellingShingle | Chemistry Moon, Sooyeon Chatterjee, Sourav Seeberger, Peter H. Gilmore, Kerry Predicting glycosylation stereoselectivity using machine learning |
title | Predicting glycosylation stereoselectivity using machine learning |
title_full | Predicting glycosylation stereoselectivity using machine learning |
title_fullStr | Predicting glycosylation stereoselectivity using machine learning |
title_full_unstemmed | Predicting glycosylation stereoselectivity using machine learning |
title_short | Predicting glycosylation stereoselectivity using machine learning |
title_sort | predicting glycosylation stereoselectivity using machine learning |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8179398/ https://www.ncbi.nlm.nih.gov/pubmed/34164060 http://dx.doi.org/10.1039/d0sc06222g |
work_keys_str_mv | AT moonsooyeon predictingglycosylationstereoselectivityusingmachinelearning AT chatterjeesourav predictingglycosylationstereoselectivityusingmachinelearning AT seebergerpeterh predictingglycosylationstereoselectivityusingmachinelearning AT gilmorekerry predictingglycosylationstereoselectivityusingmachinelearning |