Cargando…

Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions

We present machine learning models for predicting the chemical context for Buchwald‐Hartwig coupling reactions, i. e., what chemicals to add to the reactants to give a productive reaction. Using reaction data from in‐house electronic lab notebooks, we train two models: one based on single‐label data...

Descripción completa

Detalles Bibliográficos
Autores principales: Genheden, Samuel, Mårdh, Agnes, Lahti, Gustav, Engkvist, Ola, Olsson, Simon, Kogej, Thierry
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9540548/
https://www.ncbi.nlm.nih.gov/pubmed/35122702
http://dx.doi.org/10.1002/minf.202100294
_version_ 1784803731356778496
author Genheden, Samuel
Mårdh, Agnes
Lahti, Gustav
Engkvist, Ola
Olsson, Simon
Kogej, Thierry
author_facet Genheden, Samuel
Mårdh, Agnes
Lahti, Gustav
Engkvist, Ola
Olsson, Simon
Kogej, Thierry
author_sort Genheden, Samuel
collection PubMed
description We present machine learning models for predicting the chemical context for Buchwald‐Hartwig coupling reactions, i. e., what chemicals to add to the reactants to give a productive reaction. Using reaction data from in‐house electronic lab notebooks, we train two models: one based on single‐label data and one based on multi‐label data. Both models show excellent top‐3 accuracy of approximately 90 %, which suggests strong predictivity. Furthermore, there seems to be an advantage of including multi‐label data because the multi‐label model shows higher accuracy and better sensitivity for the individual contexts than the single‐label model. Although the models are performant, we also show that such models need to be re‐trained periodically as there is a strong temporal characteristic to the usage of different contexts. Therefore, a model trained on historical data will decrease in usefulness with time as newer and better contexts emerge and replace older ones. We hypothesize that such significant transitions in the context‐usage will likely affect any model predicting chemical contexts trained on historical data. Consequently, training context prediction models warrants careful planning of what data is used for training and how often the model needs to be re‐trained.
format Online
Article
Text
id pubmed-9540548
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-95405482022-10-14 Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions Genheden, Samuel Mårdh, Agnes Lahti, Gustav Engkvist, Ola Olsson, Simon Kogej, Thierry Mol Inform Research Articles We present machine learning models for predicting the chemical context for Buchwald‐Hartwig coupling reactions, i. e., what chemicals to add to the reactants to give a productive reaction. Using reaction data from in‐house electronic lab notebooks, we train two models: one based on single‐label data and one based on multi‐label data. Both models show excellent top‐3 accuracy of approximately 90 %, which suggests strong predictivity. Furthermore, there seems to be an advantage of including multi‐label data because the multi‐label model shows higher accuracy and better sensitivity for the individual contexts than the single‐label model. Although the models are performant, we also show that such models need to be re‐trained periodically as there is a strong temporal characteristic to the usage of different contexts. Therefore, a model trained on historical data will decrease in usefulness with time as newer and better contexts emerge and replace older ones. We hypothesize that such significant transitions in the context‐usage will likely affect any model predicting chemical contexts trained on historical data. Consequently, training context prediction models warrants careful planning of what data is used for training and how often the model needs to be re‐trained. John Wiley and Sons Inc. 2022-02-22 2022-08 /pmc/articles/PMC9540548/ /pubmed/35122702 http://dx.doi.org/10.1002/minf.202100294 Text en © 2022 The Authors. Molecular Informatics published by Wiley-VCH GmbH https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Genheden, Samuel
Mårdh, Agnes
Lahti, Gustav
Engkvist, Ola
Olsson, Simon
Kogej, Thierry
Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions
title Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions
title_full Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions
title_fullStr Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions
title_full_unstemmed Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions
title_short Prediction of the Chemical Context for Buchwald‐Hartwig Coupling Reactions
title_sort prediction of the chemical context for buchwald‐hartwig coupling reactions
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9540548/
https://www.ncbi.nlm.nih.gov/pubmed/35122702
http://dx.doi.org/10.1002/minf.202100294
work_keys_str_mv AT genhedensamuel predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions
AT mardhagnes predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions
AT lahtigustav predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions
AT engkvistola predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions
AT olssonsimon predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions
AT kogejthierry predictionofthechemicalcontextforbuchwaldhartwigcouplingreactions