Cargando…

Improving the performance of models for one-step retrosynthesis through re-ranking

ABSTRACT: Retrosynthesis is at the core of organic chemistry. Recently, the rapid growth of artificial intelligence (AI) has spurred a variety of novel machine learning approaches for data-driven synthesis planning. These methods learn complex patterns from reaction databases in order to predict, fo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lin, Min Htoo, Tu, Zhengkai, Coley, Connor W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922884/ https://www.ncbi.nlm.nih.gov/pubmed/35292121 http://dx.doi.org/10.1186/s13321-022-00594-8

_version_	1784669584368861184
author	Lin, Min Htoo Tu, Zhengkai Coley, Connor W.
author_facet	Lin, Min Htoo Tu, Zhengkai Coley, Connor W.
author_sort	Lin, Min Htoo
collection	PubMed
description	ABSTRACT: Retrosynthesis is at the core of organic chemistry. Recently, the rapid growth of artificial intelligence (AI) has spurred a variety of novel machine learning approaches for data-driven synthesis planning. These methods learn complex patterns from reaction databases in order to predict, for a given product, sets of reactants that can be used to synthesise that product. However, their performance as measured by the top-N accuracy in matching published reaction precedents still leaves room for improvement. This work aims to enhance these models by learning to re-rank their reactant predictions. Specifically, we design and train an energy-based model to re-rank, for each product, the published reaction as the top suggestion and the remaining reactant predictions as lower-ranked. We show that re-ranking can improve one-step models significantly using the standard USPTO-50k benchmark dataset, such as RetroSim, a similarity-based method, from 35.7 to 51.8% top-1 accuracy and NeuralSym, a deep learning method, from 45.7 to 51.3%, and also that re-ranking the union of two models’ suggestions can lead to better performance than either alone. However, the state-of-the-art top-1 accuracy is not improved by this method. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00594-8.
format	Online Article Text
id	pubmed-8922884
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-89228842022-03-23 Improving the performance of models for one-step retrosynthesis through re-ranking Lin, Min Htoo Tu, Zhengkai Coley, Connor W. J Cheminform Research Article ABSTRACT: Retrosynthesis is at the core of organic chemistry. Recently, the rapid growth of artificial intelligence (AI) has spurred a variety of novel machine learning approaches for data-driven synthesis planning. These methods learn complex patterns from reaction databases in order to predict, for a given product, sets of reactants that can be used to synthesise that product. However, their performance as measured by the top-N accuracy in matching published reaction precedents still leaves room for improvement. This work aims to enhance these models by learning to re-rank their reactant predictions. Specifically, we design and train an energy-based model to re-rank, for each product, the published reaction as the top suggestion and the remaining reactant predictions as lower-ranked. We show that re-ranking can improve one-step models significantly using the standard USPTO-50k benchmark dataset, such as RetroSim, a similarity-based method, from 35.7 to 51.8% top-1 accuracy and NeuralSym, a deep learning method, from 45.7 to 51.3%, and also that re-ranking the union of two models’ suggestions can lead to better performance than either alone. However, the state-of-the-art top-1 accuracy is not improved by this method. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00594-8. Springer International Publishing 2022-03-15 /pmc/articles/PMC8922884/ /pubmed/35292121 http://dx.doi.org/10.1186/s13321-022-00594-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Article Lin, Min Htoo Tu, Zhengkai Coley, Connor W. Improving the performance of models for one-step retrosynthesis through re-ranking
title	Improving the performance of models for one-step retrosynthesis through re-ranking
title_full	Improving the performance of models for one-step retrosynthesis through re-ranking
title_fullStr	Improving the performance of models for one-step retrosynthesis through re-ranking
title_full_unstemmed	Improving the performance of models for one-step retrosynthesis through re-ranking
title_short	Improving the performance of models for one-step retrosynthesis through re-ranking
title_sort	improving the performance of models for one-step retrosynthesis through re-ranking
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922884/ https://www.ncbi.nlm.nih.gov/pubmed/35292121 http://dx.doi.org/10.1186/s13321-022-00594-8
work_keys_str_mv	AT linminhtoo improvingtheperformanceofmodelsforonestepretrosynthesisthroughreranking AT tuzhengkai improvingtheperformanceofmodelsforonestepretrosynthesisthroughreranking AT coleyconnorw improvingtheperformanceofmodelsforonestepretrosynthesisthroughreranking

Improving the performance of models for one-step retrosynthesis through re-ranking

Ejemplares similares