Cargando…

Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models

[Image: see text] We describe a fully data driven model that learns to perform a retrosynthetic reaction prediction task, which is treated as a sequence-to-sequence mapping problem. The end-to-end trained model has an encoder–decoder architecture that consists of two recurrent neural networks, which...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Bowen, Ramsundar, Bharath, Kawthekar, Prasad, Shi, Jade, Gomes, Joseph, Luu Nguyen, Quang, Ho, Stephen, Sloane, Jack, Wender, Paul, Pande, Vijay
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2017
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5658761/
https://www.ncbi.nlm.nih.gov/pubmed/29104927
http://dx.doi.org/10.1021/acscentsci.7b00303
_version_ 1783274068836876288
author Liu, Bowen
Ramsundar, Bharath
Kawthekar, Prasad
Shi, Jade
Gomes, Joseph
Luu Nguyen, Quang
Ho, Stephen
Sloane, Jack
Wender, Paul
Pande, Vijay
author_facet Liu, Bowen
Ramsundar, Bharath
Kawthekar, Prasad
Shi, Jade
Gomes, Joseph
Luu Nguyen, Quang
Ho, Stephen
Sloane, Jack
Wender, Paul
Pande, Vijay
author_sort Liu, Bowen
collection PubMed
description [Image: see text] We describe a fully data driven model that learns to perform a retrosynthetic reaction prediction task, which is treated as a sequence-to-sequence mapping problem. The end-to-end trained model has an encoder–decoder architecture that consists of two recurrent neural networks, which has previously shown great success in solving other sequence-to-sequence prediction tasks such as machine translation. The model is trained on 50,000 experimental reaction examples from the United States patent literature, which span 10 broad reaction types that are commonly used by medicinal chemists. We find that our model performs comparably with a rule-based expert system baseline model, and also overcomes certain limitations associated with rule-based expert systems and with any machine learning approach that contains a rule-based expert system component. Our model provides an important first step toward solving the challenging problem of computational retrosynthetic analysis.
format Online
Article
Text
id pubmed-5658761
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-56587612017-11-04 Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models Liu, Bowen Ramsundar, Bharath Kawthekar, Prasad Shi, Jade Gomes, Joseph Luu Nguyen, Quang Ho, Stephen Sloane, Jack Wender, Paul Pande, Vijay ACS Cent Sci [Image: see text] We describe a fully data driven model that learns to perform a retrosynthetic reaction prediction task, which is treated as a sequence-to-sequence mapping problem. The end-to-end trained model has an encoder–decoder architecture that consists of two recurrent neural networks, which has previously shown great success in solving other sequence-to-sequence prediction tasks such as machine translation. The model is trained on 50,000 experimental reaction examples from the United States patent literature, which span 10 broad reaction types that are commonly used by medicinal chemists. We find that our model performs comparably with a rule-based expert system baseline model, and also overcomes certain limitations associated with rule-based expert systems and with any machine learning approach that contains a rule-based expert system component. Our model provides an important first step toward solving the challenging problem of computational retrosynthetic analysis. American Chemical Society 2017-09-05 2017-10-25 /pmc/articles/PMC5658761/ /pubmed/29104927 http://dx.doi.org/10.1021/acscentsci.7b00303 Text en Copyright © 2017 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle Liu, Bowen
Ramsundar, Bharath
Kawthekar, Prasad
Shi, Jade
Gomes, Joseph
Luu Nguyen, Quang
Ho, Stephen
Sloane, Jack
Wender, Paul
Pande, Vijay
Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models
title Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models
title_full Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models
title_fullStr Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models
title_full_unstemmed Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models
title_short Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models
title_sort retrosynthetic reaction prediction using neural sequence-to-sequence models
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5658761/
https://www.ncbi.nlm.nih.gov/pubmed/29104927
http://dx.doi.org/10.1021/acscentsci.7b00303
work_keys_str_mv AT liubowen retrosyntheticreactionpredictionusingneuralsequencetosequencemodels
AT ramsundarbharath retrosyntheticreactionpredictionusingneuralsequencetosequencemodels
AT kawthekarprasad retrosyntheticreactionpredictionusingneuralsequencetosequencemodels
AT shijade retrosyntheticreactionpredictionusingneuralsequencetosequencemodels
AT gomesjoseph retrosyntheticreactionpredictionusingneuralsequencetosequencemodels
AT luunguyenquang retrosyntheticreactionpredictionusingneuralsequencetosequencemodels
AT hostephen retrosyntheticreactionpredictionusingneuralsequencetosequencemodels
AT sloanejack retrosyntheticreactionpredictionusingneuralsequencetosequencemodels
AT wenderpaul retrosyntheticreactionpredictionusingneuralsequencetosequencemodels
AT pandevijay retrosyntheticreactionpredictionusingneuralsequencetosequencemodels