Cargando…
Reagent prediction with a molecular transformer improves reaction data quality
Automated synthesis planning is key for efficient generative chemistry. Since reactions of given reactants may yield different products depending on conditions such as the chemical context imposed by specific reagents, computer-aided synthesis planning should benefit from recommendations of reaction...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10034139/ https://www.ncbi.nlm.nih.gov/pubmed/36970100 http://dx.doi.org/10.1039/d2sc06798f |
_version_ | 1784911145936617472 |
---|---|
author | Andronov, Mikhail Voinarovska, Varvara Andronova, Natalia Wand, Michael Clevert, Djork-Arné Schmidhuber, Jürgen |
author_facet | Andronov, Mikhail Voinarovska, Varvara Andronova, Natalia Wand, Michael Clevert, Djork-Arné Schmidhuber, Jürgen |
author_sort | Andronov, Mikhail |
collection | PubMed |
description | Automated synthesis planning is key for efficient generative chemistry. Since reactions of given reactants may yield different products depending on conditions such as the chemical context imposed by specific reagents, computer-aided synthesis planning should benefit from recommendations of reaction conditions. Traditional synthesis planning software, however, typically proposes reactions without specifying such conditions, relying on human organic chemists who know the conditions to carry out suggested reactions. In particular, reagent prediction for arbitrary reactions, a crucial aspect of condition recommendation, has been largely overlooked in cheminformatics until recently. Here we employ the Molecular Transformer, a state-of-the-art model for reaction prediction and single-step retrosynthesis, to tackle this problem. We train the model on the US patents dataset (USPTO) and test it on Reaxys to demonstrate its out-of-distribution generalization capabilities. Our reagent prediction model also improves the quality of product prediction: the Molecular Transformer is able to substitute the reagents in the noisy USPTO data with reagents that enable product prediction models to outperform those trained on plain USPTO. This makes it possible to improve upon the state-of-the-art in reaction product prediction on the USPTO MIT benchmark. |
format | Online Article Text |
id | pubmed-10034139 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-100341392023-03-24 Reagent prediction with a molecular transformer improves reaction data quality Andronov, Mikhail Voinarovska, Varvara Andronova, Natalia Wand, Michael Clevert, Djork-Arné Schmidhuber, Jürgen Chem Sci Chemistry Automated synthesis planning is key for efficient generative chemistry. Since reactions of given reactants may yield different products depending on conditions such as the chemical context imposed by specific reagents, computer-aided synthesis planning should benefit from recommendations of reaction conditions. Traditional synthesis planning software, however, typically proposes reactions without specifying such conditions, relying on human organic chemists who know the conditions to carry out suggested reactions. In particular, reagent prediction for arbitrary reactions, a crucial aspect of condition recommendation, has been largely overlooked in cheminformatics until recently. Here we employ the Molecular Transformer, a state-of-the-art model for reaction prediction and single-step retrosynthesis, to tackle this problem. We train the model on the US patents dataset (USPTO) and test it on Reaxys to demonstrate its out-of-distribution generalization capabilities. Our reagent prediction model also improves the quality of product prediction: the Molecular Transformer is able to substitute the reagents in the noisy USPTO data with reagents that enable product prediction models to outperform those trained on plain USPTO. This makes it possible to improve upon the state-of-the-art in reaction product prediction on the USPTO MIT benchmark. The Royal Society of Chemistry 2023-03-01 /pmc/articles/PMC10034139/ /pubmed/36970100 http://dx.doi.org/10.1039/d2sc06798f Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/ |
spellingShingle | Chemistry Andronov, Mikhail Voinarovska, Varvara Andronova, Natalia Wand, Michael Clevert, Djork-Arné Schmidhuber, Jürgen Reagent prediction with a molecular transformer improves reaction data quality |
title | Reagent prediction with a molecular transformer improves reaction data quality |
title_full | Reagent prediction with a molecular transformer improves reaction data quality |
title_fullStr | Reagent prediction with a molecular transformer improves reaction data quality |
title_full_unstemmed | Reagent prediction with a molecular transformer improves reaction data quality |
title_short | Reagent prediction with a molecular transformer improves reaction data quality |
title_sort | reagent prediction with a molecular transformer improves reaction data quality |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10034139/ https://www.ncbi.nlm.nih.gov/pubmed/36970100 http://dx.doi.org/10.1039/d2sc06798f |
work_keys_str_mv | AT andronovmikhail reagentpredictionwithamoleculartransformerimprovesreactiondataquality AT voinarovskavarvara reagentpredictionwithamoleculartransformerimprovesreactiondataquality AT andronovanatalia reagentpredictionwithamoleculartransformerimprovesreactiondataquality AT wandmichael reagentpredictionwithamoleculartransformerimprovesreactiondataquality AT clevertdjorkarne reagentpredictionwithamoleculartransformerimprovesreactiondataquality AT schmidhuberjurgen reagentpredictionwithamoleculartransformerimprovesreactiondataquality |