Cargando…

Reaction classification and yield prediction using the differential reaction fingerprint DRFP

Predicting the nature and outcome of reactions using computational methods is a crucial tool to accelerate chemical research. The recent application of deep learning-based learned fingerprints to reaction classification and reaction yield prediction has shown an impressive increase in performance co...

Descripción completa

Detalles Bibliográficos
Autores principales: Probst, Daniel, Schwaller, Philippe, Reymond, Jean-Louis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: RSC 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8996827/
https://www.ncbi.nlm.nih.gov/pubmed/35515081
http://dx.doi.org/10.1039/d1dd00006c
_version_ 1784684561390632960
author Probst, Daniel
Schwaller, Philippe
Reymond, Jean-Louis
author_facet Probst, Daniel
Schwaller, Philippe
Reymond, Jean-Louis
author_sort Probst, Daniel
collection PubMed
description Predicting the nature and outcome of reactions using computational methods is a crucial tool to accelerate chemical research. The recent application of deep learning-based learned fingerprints to reaction classification and reaction yield prediction has shown an impressive increase in performance compared to previous methods such as DFT- and structure-based fingerprints. However, learned fingerprints require large training data sets, are inherently biased, and are based on complex deep learning architectures. Here we present the differential reaction fingerprint DRFP. The DRFP algorithm takes a reaction SMILES as an input and creates a binary fingerprint based on the symmetric difference of two sets containing the circular molecular n-grams generated from the molecules listed left and right from the reaction arrow, respectively, without the need for distinguishing between reactants and reagents. We show that DRFP performs better than DFT-based fingerprints in reaction yield prediction and other structure-based fingerprints in reaction classification, reaching the performance of state-of-the-art learned fingerprints in both tasks while being data-independent.
format Online
Article
Text
id pubmed-8996827
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher RSC
record_format MEDLINE/PubMed
spelling pubmed-89968272022-05-03 Reaction classification and yield prediction using the differential reaction fingerprint DRFP Probst, Daniel Schwaller, Philippe Reymond, Jean-Louis Digit Discov Chemistry Predicting the nature and outcome of reactions using computational methods is a crucial tool to accelerate chemical research. The recent application of deep learning-based learned fingerprints to reaction classification and reaction yield prediction has shown an impressive increase in performance compared to previous methods such as DFT- and structure-based fingerprints. However, learned fingerprints require large training data sets, are inherently biased, and are based on complex deep learning architectures. Here we present the differential reaction fingerprint DRFP. The DRFP algorithm takes a reaction SMILES as an input and creates a binary fingerprint based on the symmetric difference of two sets containing the circular molecular n-grams generated from the molecules listed left and right from the reaction arrow, respectively, without the need for distinguishing between reactants and reagents. We show that DRFP performs better than DFT-based fingerprints in reaction yield prediction and other structure-based fingerprints in reaction classification, reaching the performance of state-of-the-art learned fingerprints in both tasks while being data-independent. RSC 2022-01-21 /pmc/articles/PMC8996827/ /pubmed/35515081 http://dx.doi.org/10.1039/d1dd00006c Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/
spellingShingle Chemistry
Probst, Daniel
Schwaller, Philippe
Reymond, Jean-Louis
Reaction classification and yield prediction using the differential reaction fingerprint DRFP
title Reaction classification and yield prediction using the differential reaction fingerprint DRFP
title_full Reaction classification and yield prediction using the differential reaction fingerprint DRFP
title_fullStr Reaction classification and yield prediction using the differential reaction fingerprint DRFP
title_full_unstemmed Reaction classification and yield prediction using the differential reaction fingerprint DRFP
title_short Reaction classification and yield prediction using the differential reaction fingerprint DRFP
title_sort reaction classification and yield prediction using the differential reaction fingerprint drfp
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8996827/
https://www.ncbi.nlm.nih.gov/pubmed/35515081
http://dx.doi.org/10.1039/d1dd00006c
work_keys_str_mv AT probstdaniel reactionclassificationandyieldpredictionusingthedifferentialreactionfingerprintdrfp
AT schwallerphilippe reactionclassificationandyieldpredictionusingthedifferentialreactionfingerprintdrfp
AT reymondjeanlouis reactionclassificationandyieldpredictionusingthedifferentialreactionfingerprintdrfp