Cargando…

Reagent prediction with a molecular transformer improves reaction data quality

Automated synthesis planning is key for efficient generative chemistry. Since reactions of given reactants may yield different products depending on conditions such as the chemical context imposed by specific reagents, computer-aided synthesis planning should benefit from recommendations of reaction...

Descripción completa

Detalles Bibliográficos
Autores principales: Andronov, Mikhail, Voinarovska, Varvara, Andronova, Natalia, Wand, Michael, Clevert, Djork-Arné, Schmidhuber, Jürgen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10034139/
https://www.ncbi.nlm.nih.gov/pubmed/36970100
http://dx.doi.org/10.1039/d2sc06798f
_version_ 1784911145936617472
author Andronov, Mikhail
Voinarovska, Varvara
Andronova, Natalia
Wand, Michael
Clevert, Djork-Arné
Schmidhuber, Jürgen
author_facet Andronov, Mikhail
Voinarovska, Varvara
Andronova, Natalia
Wand, Michael
Clevert, Djork-Arné
Schmidhuber, Jürgen
author_sort Andronov, Mikhail
collection PubMed
description Automated synthesis planning is key for efficient generative chemistry. Since reactions of given reactants may yield different products depending on conditions such as the chemical context imposed by specific reagents, computer-aided synthesis planning should benefit from recommendations of reaction conditions. Traditional synthesis planning software, however, typically proposes reactions without specifying such conditions, relying on human organic chemists who know the conditions to carry out suggested reactions. In particular, reagent prediction for arbitrary reactions, a crucial aspect of condition recommendation, has been largely overlooked in cheminformatics until recently. Here we employ the Molecular Transformer, a state-of-the-art model for reaction prediction and single-step retrosynthesis, to tackle this problem. We train the model on the US patents dataset (USPTO) and test it on Reaxys to demonstrate its out-of-distribution generalization capabilities. Our reagent prediction model also improves the quality of product prediction: the Molecular Transformer is able to substitute the reagents in the noisy USPTO data with reagents that enable product prediction models to outperform those trained on plain USPTO. This makes it possible to improve upon the state-of-the-art in reaction product prediction on the USPTO MIT benchmark.
format Online
Article
Text
id pubmed-10034139
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-100341392023-03-24 Reagent prediction with a molecular transformer improves reaction data quality Andronov, Mikhail Voinarovska, Varvara Andronova, Natalia Wand, Michael Clevert, Djork-Arné Schmidhuber, Jürgen Chem Sci Chemistry Automated synthesis planning is key for efficient generative chemistry. Since reactions of given reactants may yield different products depending on conditions such as the chemical context imposed by specific reagents, computer-aided synthesis planning should benefit from recommendations of reaction conditions. Traditional synthesis planning software, however, typically proposes reactions without specifying such conditions, relying on human organic chemists who know the conditions to carry out suggested reactions. In particular, reagent prediction for arbitrary reactions, a crucial aspect of condition recommendation, has been largely overlooked in cheminformatics until recently. Here we employ the Molecular Transformer, a state-of-the-art model for reaction prediction and single-step retrosynthesis, to tackle this problem. We train the model on the US patents dataset (USPTO) and test it on Reaxys to demonstrate its out-of-distribution generalization capabilities. Our reagent prediction model also improves the quality of product prediction: the Molecular Transformer is able to substitute the reagents in the noisy USPTO data with reagents that enable product prediction models to outperform those trained on plain USPTO. This makes it possible to improve upon the state-of-the-art in reaction product prediction on the USPTO MIT benchmark. The Royal Society of Chemistry 2023-03-01 /pmc/articles/PMC10034139/ /pubmed/36970100 http://dx.doi.org/10.1039/d2sc06798f Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Andronov, Mikhail
Voinarovska, Varvara
Andronova, Natalia
Wand, Michael
Clevert, Djork-Arné
Schmidhuber, Jürgen
Reagent prediction with a molecular transformer improves reaction data quality
title Reagent prediction with a molecular transformer improves reaction data quality
title_full Reagent prediction with a molecular transformer improves reaction data quality
title_fullStr Reagent prediction with a molecular transformer improves reaction data quality
title_full_unstemmed Reagent prediction with a molecular transformer improves reaction data quality
title_short Reagent prediction with a molecular transformer improves reaction data quality
title_sort reagent prediction with a molecular transformer improves reaction data quality
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10034139/
https://www.ncbi.nlm.nih.gov/pubmed/36970100
http://dx.doi.org/10.1039/d2sc06798f
work_keys_str_mv AT andronovmikhail reagentpredictionwithamoleculartransformerimprovesreactiondataquality
AT voinarovskavarvara reagentpredictionwithamoleculartransformerimprovesreactiondataquality
AT andronovanatalia reagentpredictionwithamoleculartransformerimprovesreactiondataquality
AT wandmichael reagentpredictionwithamoleculartransformerimprovesreactiondataquality
AT clevertdjorkarne reagentpredictionwithamoleculartransformerimprovesreactiondataquality
AT schmidhuberjurgen reagentpredictionwithamoleculartransformerimprovesreactiondataquality