Cargando…
Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions
We present machine learning models for predicting the chemical context for Buchwald‐Hartwig coupling reactions, i. e., what chemicals to add to the reactants to give a productive reaction. Using reaction data from in‐house electronic lab notebooks, we train two models: one based on single‐label data...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9540548/ https://www.ncbi.nlm.nih.gov/pubmed/35122702 http://dx.doi.org/10.1002/minf.202100294 |
_version_ | 1784803731356778496 |
---|---|
author | Genheden, Samuel Mårdh, Agnes Lahti, Gustav Engkvist, Ola Olsson, Simon Kogej, Thierry |
author_facet | Genheden, Samuel Mårdh, Agnes Lahti, Gustav Engkvist, Ola Olsson, Simon Kogej, Thierry |
author_sort | Genheden, Samuel |
collection | PubMed |
description | We present machine learning models for predicting the chemical context for Buchwald‐Hartwig coupling reactions, i. e., what chemicals to add to the reactants to give a productive reaction. Using reaction data from in‐house electronic lab notebooks, we train two models: one based on single‐label data and one based on multi‐label data. Both models show excellent top‐3 accuracy of approximately 90 %, which suggests strong predictivity. Furthermore, there seems to be an advantage of including multi‐label data because the multi‐label model shows higher accuracy and better sensitivity for the individual contexts than the single‐label model. Although the models are performant, we also show that such models need to be re‐trained periodically as there is a strong temporal characteristic to the usage of different contexts. Therefore, a model trained on historical data will decrease in usefulness with time as newer and better contexts emerge and replace older ones. We hypothesize that such significant transitions in the context‐usage will likely affect any model predicting chemical contexts trained on historical data. Consequently, training context prediction models warrants careful planning of what data is used for training and how often the model needs to be re‐trained. |
format | Online Article Text |
id | pubmed-9540548 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95405482022-10-14 Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions Genheden, Samuel Mårdh, Agnes Lahti, Gustav Engkvist, Ola Olsson, Simon Kogej, Thierry Mol Inform Research Articles We present machine learning models for predicting the chemical context for Buchwald‐Hartwig coupling reactions, i. e., what chemicals to add to the reactants to give a productive reaction. Using reaction data from in‐house electronic lab notebooks, we train two models: one based on single‐label data and one based on multi‐label data. Both models show excellent top‐3 accuracy of approximately 90 %, which suggests strong predictivity. Furthermore, there seems to be an advantage of including multi‐label data because the multi‐label model shows higher accuracy and better sensitivity for the individual contexts than the single‐label model. Although the models are performant, we also show that such models need to be re‐trained periodically as there is a strong temporal characteristic to the usage of different contexts. Therefore, a model trained on historical data will decrease in usefulness with time as newer and better contexts emerge and replace older ones. We hypothesize that such significant transitions in the context‐usage will likely affect any model predicting chemical contexts trained on historical data. Consequently, training context prediction models warrants careful planning of what data is used for training and how often the model needs to be re‐trained. John Wiley and Sons Inc. 2022-02-22 2022-08 /pmc/articles/PMC9540548/ /pubmed/35122702 http://dx.doi.org/10.1002/minf.202100294 Text en © 2022 The Authors. Molecular Informatics published by Wiley-VCH GmbH https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Genheden, Samuel Mårdh, Agnes Lahti, Gustav Engkvist, Ola Olsson, Simon Kogej, Thierry Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions |
title | Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions |
title_full | Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions |
title_fullStr | Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions |
title_full_unstemmed | Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions |
title_short | Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions |
title_sort | prediction of the chemical context for buchwald‐hartwig coupling reactions |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9540548/ https://www.ncbi.nlm.nih.gov/pubmed/35122702 http://dx.doi.org/10.1002/minf.202100294 |
work_keys_str_mv | AT genhedensamuel predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions AT mardhagnes predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions AT lahtigustav predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions AT engkvistola predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions AT olssonsimon predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions AT kogejthierry predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions |