Cargando…

Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy

We present an extension of our Molecular Transformer model combined with a hyper-graph exploration strategy for automatic retrosynthesis route planning without human intervention. The single-step retrosynthetic model sets a new state of the art for predicting reactants as well as reagents, solvents...

Descripción completa

Detalles Bibliográficos
Autores principales: Schwaller, Philippe, Petraglia, Riccardo, Zullo, Valerio, Nair, Vishnu H., Haeuselmann, Rico Andreas, Pisoni, Riccardo, Bekas, Costas, Iuliano, Anna, Laino, Teodoro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8152799/
https://www.ncbi.nlm.nih.gov/pubmed/34122839
http://dx.doi.org/10.1039/c9sc05704h
_version_ 1783698671595945984
author Schwaller, Philippe
Petraglia, Riccardo
Zullo, Valerio
Nair, Vishnu H.
Haeuselmann, Rico Andreas
Pisoni, Riccardo
Bekas, Costas
Iuliano, Anna
Laino, Teodoro
author_facet Schwaller, Philippe
Petraglia, Riccardo
Zullo, Valerio
Nair, Vishnu H.
Haeuselmann, Rico Andreas
Pisoni, Riccardo
Bekas, Costas
Iuliano, Anna
Laino, Teodoro
author_sort Schwaller, Philippe
collection PubMed
description We present an extension of our Molecular Transformer model combined with a hyper-graph exploration strategy for automatic retrosynthesis route planning without human intervention. The single-step retrosynthetic model sets a new state of the art for predicting reactants as well as reagents, solvents and catalysts for each retrosynthetic step. We introduce four metrics (coverage, class diversity, round-trip accuracy and Jensen–Shannon divergence) to evaluate the single-step retrosynthetic models, using the forward prediction and a reaction classification model always based on the transformer architecture. The hypergraph is constructed on the fly, and the nodes are filtered and further expanded based on a Bayesian-like probability. We critically assessed the end-to-end framework with several retrosynthesis examples from literature and academic exams. Overall, the frameworks have an excellent performance with few weaknesses related to the training data. The use of the introduced metrics opens up the possibility to optimize entire retrosynthetic frameworks by focusing on the performance of the single-step model only.
format Online
Article
Text
id pubmed-8152799
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-81527992021-06-11 Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy Schwaller, Philippe Petraglia, Riccardo Zullo, Valerio Nair, Vishnu H. Haeuselmann, Rico Andreas Pisoni, Riccardo Bekas, Costas Iuliano, Anna Laino, Teodoro Chem Sci Chemistry We present an extension of our Molecular Transformer model combined with a hyper-graph exploration strategy for automatic retrosynthesis route planning without human intervention. The single-step retrosynthetic model sets a new state of the art for predicting reactants as well as reagents, solvents and catalysts for each retrosynthetic step. We introduce four metrics (coverage, class diversity, round-trip accuracy and Jensen–Shannon divergence) to evaluate the single-step retrosynthetic models, using the forward prediction and a reaction classification model always based on the transformer architecture. The hypergraph is constructed on the fly, and the nodes are filtered and further expanded based on a Bayesian-like probability. We critically assessed the end-to-end framework with several retrosynthesis examples from literature and academic exams. Overall, the frameworks have an excellent performance with few weaknesses related to the training data. The use of the introduced metrics opens up the possibility to optimize entire retrosynthetic frameworks by focusing on the performance of the single-step model only. The Royal Society of Chemistry 2020-03-03 /pmc/articles/PMC8152799/ /pubmed/34122839 http://dx.doi.org/10.1039/c9sc05704h Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Schwaller, Philippe
Petraglia, Riccardo
Zullo, Valerio
Nair, Vishnu H.
Haeuselmann, Rico Andreas
Pisoni, Riccardo
Bekas, Costas
Iuliano, Anna
Laino, Teodoro
Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy
title Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy
title_full Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy
title_fullStr Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy
title_full_unstemmed Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy
title_short Predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy
title_sort predicting retrosynthetic pathways using transformer-based models and a hyper-graph exploration strategy
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8152799/
https://www.ncbi.nlm.nih.gov/pubmed/34122839
http://dx.doi.org/10.1039/c9sc05704h
work_keys_str_mv AT schwallerphilippe predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy
AT petragliariccardo predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy
AT zullovalerio predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy
AT nairvishnuh predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy
AT haeuselmannricoandreas predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy
AT pisoniriccardo predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy
AT bekascostas predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy
AT iulianoanna predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy
AT lainoteodoro predictingretrosyntheticpathwaysusingtransformerbasedmodelsandahypergraphexplorationstrategy