Cargando…
Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy
We present an extension of our Molecular Transformer model combined with a hyper-graph exploration strategy for automatic retrosynthesis route planning without human intervention. The single-step retrosynthetic model sets a new state of the art for predicting reactants as well as reagents, solvents...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8152799/ https://www.ncbi.nlm.nih.gov/pubmed/34122839 http://dx.doi.org/10.1039/c9sc05704h |
_version_ | 1783698671595945984 |
---|---|
author | Schwaller, Philippe Petraglia, Riccardo Zullo, Valerio Nair, Vishnu H. Haeuselmann, Rico Andreas Pisoni, Riccardo Bekas, Costas Iuliano, Anna Laino, Teodoro |
author_facet | Schwaller, Philippe Petraglia, Riccardo Zullo, Valerio Nair, Vishnu H. Haeuselmann, Rico Andreas Pisoni, Riccardo Bekas, Costas Iuliano, Anna Laino, Teodoro |
author_sort | Schwaller, Philippe |
collection | PubMed |
description | We present an extension of our Molecular Transformer model combined with a hyper-graph exploration strategy for automatic retrosynthesis route planning without human intervention. The single-step retrosynthetic model sets a new state of the art for predicting reactants as well as reagents, solvents and catalysts for each retrosynthetic step. We introduce four metrics (coverage, class diversity, round-trip accuracy and Jensen–Shannon divergence) to evaluate the single-step retrosynthetic models, using the forward prediction and a reaction classification model always based on the transformer architecture. The hypergraph is constructed on the fly, and the nodes are filtered and further expanded based on a Bayesian-like probability. We critically assessed the end-to-end framework with several retrosynthesis examples from literature and academic exams. Overall, the frameworks have an excellent performance with few weaknesses related to the training data. The use of the introduced metrics opens up the possibility to optimize entire retrosynthetic frameworks by focusing on the performance of the single-step model only. |
format | Online Article Text |
id | pubmed-8152799 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-81527992021-06-11 Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy Schwaller, Philippe Petraglia, Riccardo Zullo, Valerio Nair, Vishnu H. Haeuselmann, Rico Andreas Pisoni, Riccardo Bekas, Costas Iuliano, Anna Laino, Teodoro Chem Sci Chemistry We present an extension of our Molecular Transformer model combined with a hyper-graph exploration strategy for automatic retrosynthesis route planning without human intervention. The single-step retrosynthetic model sets a new state of the art for predicting reactants as well as reagents, solvents and catalysts for each retrosynthetic step. We introduce four metrics (coverage, class diversity, round-trip accuracy and Jensen–Shannon divergence) to evaluate the single-step retrosynthetic models, using the forward prediction and a reaction classification model always based on the transformer architecture. The hypergraph is constructed on the fly, and the nodes are filtered and further expanded based on a Bayesian-like probability. We critically assessed the end-to-end framework with several retrosynthesis examples from literature and academic exams. Overall, the frameworks have an excellent performance with few weaknesses related to the training data. The use of the introduced metrics opens up the possibility to optimize entire retrosynthetic frameworks by focusing on the performance of the single-step model only. The Royal Society of Chemistry 2020-03-03 /pmc/articles/PMC8152799/ /pubmed/34122839 http://dx.doi.org/10.1039/c9sc05704h Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/ |
spellingShingle | Chemistry Schwaller, Philippe Petraglia, Riccardo Zullo, Valerio Nair, Vishnu H. Haeuselmann, Rico Andreas Pisoni, Riccardo Bekas, Costas Iuliano, Anna Laino, Teodoro Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy |
title | Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy |
title_full | Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy |
title_fullStr | Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy |
title_full_unstemmed | Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy |
title_short | Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy |
title_sort | predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8152799/ https://www.ncbi.nlm.nih.gov/pubmed/34122839 http://dx.doi.org/10.1039/c9sc05704h |
work_keys_str_mv | AT schwallerphilippe predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy AT petragliariccardo predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy AT zullovalerio predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy AT nairvishnuh predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy AT haeuselmannricoandreas predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy AT pisoniriccardo predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy AT bekascostas predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy AT iulianoanna predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy AT lainoteodoro predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy |