Cargando…
Using Machine Learning To Predict Suitable Conditions for Organic Reactions
[Image: see text] Reaction condition recommendation is an essential element for the realization of computer-assisted synthetic planning. Accurate suggestions of reaction conditions are required for experimental validation and can have a significant effect on the success or failure of an attempted tr...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2018
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6276053/ https://www.ncbi.nlm.nih.gov/pubmed/30555898 http://dx.doi.org/10.1021/acscentsci.8b00357 |
_version_ | 1783377938582863872 |
---|---|
author | Gao, Hanyu Struble, Thomas J. Coley, Connor W. Wang, Yuran Green, William H. Jensen, Klavs F. |
author_facet | Gao, Hanyu Struble, Thomas J. Coley, Connor W. Wang, Yuran Green, William H. Jensen, Klavs F. |
author_sort | Gao, Hanyu |
collection | PubMed |
description | [Image: see text] Reaction condition recommendation is an essential element for the realization of computer-assisted synthetic planning. Accurate suggestions of reaction conditions are required for experimental validation and can have a significant effect on the success or failure of an attempted transformation. However, de novo condition recommendation remains a challenging and under-explored problem and relies heavily on chemists’ knowledge and experience. In this work, we develop a neural-network model to predict the chemical context (catalyst(s), solvent(s), reagent(s)), as well as the temperature most suitable for any particular organic reaction. Trained on ∼10 million examples from Reaxys, the model is able to propose conditions where a close match to the recorded catalyst, solvent, and reagent is found within the top-10 predictions 69.6% of the time, with top-10 accuracies for individual species reaching 80–90%. Temperature is accurately predicted within ±20 °C from the recorded temperature in 60–70% of test cases, with higher accuracy for cases with correct chemical context predictions. The utility of the model is illustrated through several examples spanning a range of common reaction classes. We also demonstrate that the model implicitly learns a continuous numerical embedding of solvent and reagent species that captures their functional similarity. |
format | Online Article Text |
id | pubmed-6276053 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-62760532018-12-15 Using Machine Learning To Predict Suitable Conditions for Organic Reactions Gao, Hanyu Struble, Thomas J. Coley, Connor W. Wang, Yuran Green, William H. Jensen, Klavs F. ACS Cent Sci [Image: see text] Reaction condition recommendation is an essential element for the realization of computer-assisted synthetic planning. Accurate suggestions of reaction conditions are required for experimental validation and can have a significant effect on the success or failure of an attempted transformation. However, de novo condition recommendation remains a challenging and under-explored problem and relies heavily on chemists’ knowledge and experience. In this work, we develop a neural-network model to predict the chemical context (catalyst(s), solvent(s), reagent(s)), as well as the temperature most suitable for any particular organic reaction. Trained on ∼10 million examples from Reaxys, the model is able to propose conditions where a close match to the recorded catalyst, solvent, and reagent is found within the top-10 predictions 69.6% of the time, with top-10 accuracies for individual species reaching 80–90%. Temperature is accurately predicted within ±20 °C from the recorded temperature in 60–70% of test cases, with higher accuracy for cases with correct chemical context predictions. The utility of the model is illustrated through several examples spanning a range of common reaction classes. We also demonstrate that the model implicitly learns a continuous numerical embedding of solvent and reagent species that captures their functional similarity. American Chemical Society 2018-11-16 2018-11-28 /pmc/articles/PMC6276053/ /pubmed/30555898 http://dx.doi.org/10.1021/acscentsci.8b00357 Text en Copyright © 2018 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes. |
spellingShingle | Gao, Hanyu Struble, Thomas J. Coley, Connor W. Wang, Yuran Green, William H. Jensen, Klavs F. Using Machine Learning To Predict Suitable Conditions for Organic Reactions |
title | Using Machine Learning To Predict Suitable Conditions
for Organic Reactions |
title_full | Using Machine Learning To Predict Suitable Conditions
for Organic Reactions |
title_fullStr | Using Machine Learning To Predict Suitable Conditions
for Organic Reactions |
title_full_unstemmed | Using Machine Learning To Predict Suitable Conditions
for Organic Reactions |
title_short | Using Machine Learning To Predict Suitable Conditions
for Organic Reactions |
title_sort | using machine learning to predict suitable conditions
for organic reactions |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6276053/ https://www.ncbi.nlm.nih.gov/pubmed/30555898 http://dx.doi.org/10.1021/acscentsci.8b00357 |
work_keys_str_mv | AT gaohanyu usingmachinelearningtopredictsuitableconditionsfororganicreactions AT strublethomasj usingmachinelearningtopredictsuitableconditionsfororganicreactions AT coleyconnorw usingmachinelearningtopredictsuitableconditionsfororganicreactions AT wangyuran usingmachinelearningtopredictsuitableconditionsfororganicreactions AT greenwilliamh usingmachinelearningtopredictsuitableconditionsfororganicreactions AT jensenklavsf usingmachinelearningtopredictsuitableconditionsfororganicreactions |