Cargando…

Using Machine Learning To Predict Suitable Conditions for Organic Reactions

[Image: see text] Reaction condition recommendation is an essential element for the realization of computer-assisted synthetic planning. Accurate suggestions of reaction conditions are required for experimental validation and can have a significant effect on the success or failure of an attempted tr...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Hanyu, Struble, Thomas J., Coley, Connor W., Wang, Yuran, Green, William H., Jensen, Klavs F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2018
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6276053/
https://www.ncbi.nlm.nih.gov/pubmed/30555898
http://dx.doi.org/10.1021/acscentsci.8b00357
_version_ 1783377938582863872
author Gao, Hanyu
Struble, Thomas J.
Coley, Connor W.
Wang, Yuran
Green, William H.
Jensen, Klavs F.
author_facet Gao, Hanyu
Struble, Thomas J.
Coley, Connor W.
Wang, Yuran
Green, William H.
Jensen, Klavs F.
author_sort Gao, Hanyu
collection PubMed
description [Image: see text] Reaction condition recommendation is an essential element for the realization of computer-assisted synthetic planning. Accurate suggestions of reaction conditions are required for experimental validation and can have a significant effect on the success or failure of an attempted transformation. However, de novo condition recommendation remains a challenging and under-explored problem and relies heavily on chemists’ knowledge and experience. In this work, we develop a neural-network model to predict the chemical context (catalyst(s), solvent(s), reagent(s)), as well as the temperature most suitable for any particular organic reaction. Trained on ∼10 million examples from Reaxys, the model is able to propose conditions where a close match to the recorded catalyst, solvent, and reagent is found within the top-10 predictions 69.6% of the time, with top-10 accuracies for individual species reaching 80–90%. Temperature is accurately predicted within ±20 °C from the recorded temperature in 60–70% of test cases, with higher accuracy for cases with correct chemical context predictions. The utility of the model is illustrated through several examples spanning a range of common reaction classes. We also demonstrate that the model implicitly learns a continuous numerical embedding of solvent and reagent species that captures their functional similarity.
format Online
Article
Text
id pubmed-6276053
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-62760532018-12-15 Using Machine Learning To Predict Suitable Conditions for Organic Reactions Gao, Hanyu Struble, Thomas J. Coley, Connor W. Wang, Yuran Green, William H. Jensen, Klavs F. ACS Cent Sci [Image: see text] Reaction condition recommendation is an essential element for the realization of computer-assisted synthetic planning. Accurate suggestions of reaction conditions are required for experimental validation and can have a significant effect on the success or failure of an attempted transformation. However, de novo condition recommendation remains a challenging and under-explored problem and relies heavily on chemists’ knowledge and experience. In this work, we develop a neural-network model to predict the chemical context (catalyst(s), solvent(s), reagent(s)), as well as the temperature most suitable for any particular organic reaction. Trained on ∼10 million examples from Reaxys, the model is able to propose conditions where a close match to the recorded catalyst, solvent, and reagent is found within the top-10 predictions 69.6% of the time, with top-10 accuracies for individual species reaching 80–90%. Temperature is accurately predicted within ±20 °C from the recorded temperature in 60–70% of test cases, with higher accuracy for cases with correct chemical context predictions. The utility of the model is illustrated through several examples spanning a range of common reaction classes. We also demonstrate that the model implicitly learns a continuous numerical embedding of solvent and reagent species that captures their functional similarity. American Chemical Society 2018-11-16 2018-11-28 /pmc/articles/PMC6276053/ /pubmed/30555898 http://dx.doi.org/10.1021/acscentsci.8b00357 Text en Copyright © 2018 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle Gao, Hanyu
Struble, Thomas J.
Coley, Connor W.
Wang, Yuran
Green, William H.
Jensen, Klavs F.
Using Machine Learning To Predict Suitable Conditions for Organic Reactions
title Using Machine Learning To Predict Suitable Conditions for Organic Reactions
title_full Using Machine Learning To Predict Suitable Conditions for Organic Reactions
title_fullStr Using Machine Learning To Predict Suitable Conditions for Organic Reactions
title_full_unstemmed Using Machine Learning To Predict Suitable Conditions for Organic Reactions
title_short Using Machine Learning To Predict Suitable Conditions for Organic Reactions
title_sort using machine learning to predict suitable conditions for organic reactions
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6276053/
https://www.ncbi.nlm.nih.gov/pubmed/30555898
http://dx.doi.org/10.1021/acscentsci.8b00357
work_keys_str_mv AT gaohanyu usingmachinelearningtopredictsuitableconditionsfororganicreactions
AT strublethomasj usingmachinelearningtopredictsuitableconditionsfororganicreactions
AT coleyconnorw usingmachinelearningtopredictsuitableconditionsfororganicreactions
AT wangyuran usingmachinelearningtopredictsuitableconditionsfororganicreactions
AT greenwilliamh usingmachinelearningtopredictsuitableconditionsfororganicreactions
AT jensenklavsf usingmachinelearningtopredictsuitableconditionsfororganicreactions