Cargando…

Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks

[Image: see text] In de novo drug design, computational strategies are used to generate novel molecules with good affinity to the desired biological target. In this work, we show that recurrent neural networks can be trained as generative models for molecular structures, similar to statistical langu...

Descripción completa

Detalles Bibliográficos
Autores principales: Segler, Marwin H. S., Kogej, Thierry, Tyrchan, Christian, Waller, Mark P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2017
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5785775/
https://www.ncbi.nlm.nih.gov/pubmed/29392184
http://dx.doi.org/10.1021/acscentsci.7b00512
_version_ 1783295670291005440
author Segler, Marwin H. S.
Kogej, Thierry
Tyrchan, Christian
Waller, Mark P.
author_facet Segler, Marwin H. S.
Kogej, Thierry
Tyrchan, Christian
Waller, Mark P.
author_sort Segler, Marwin H. S.
collection PubMed
description [Image: see text] In de novo drug design, computational strategies are used to generate novel molecules with good affinity to the desired biological target. In this work, we show that recurrent neural networks can be trained as generative models for molecular structures, similar to statistical language models in natural language processing. We demonstrate that the properties of the generated molecules correlate very well with the properties of the molecules used to train the model. In order to enrich libraries with molecules active toward a given biological target, we propose to fine-tune the model with small sets of molecules, which are known to be active against that target. Against Staphylococcus aureus, the model reproduced 14% of 6051 hold-out test molecules that medicinal chemists designed, whereas against Plasmodium falciparum (Malaria), it reproduced 28% of 1240 test molecules. When coupled with a scoring function, our model can perform the complete de novo drug design cycle to generate large sets of novel molecules for drug discovery.
format Online
Article
Text
id pubmed-5785775
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-57857752018-02-01 Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks Segler, Marwin H. S. Kogej, Thierry Tyrchan, Christian Waller, Mark P. ACS Cent Sci [Image: see text] In de novo drug design, computational strategies are used to generate novel molecules with good affinity to the desired biological target. In this work, we show that recurrent neural networks can be trained as generative models for molecular structures, similar to statistical language models in natural language processing. We demonstrate that the properties of the generated molecules correlate very well with the properties of the molecules used to train the model. In order to enrich libraries with molecules active toward a given biological target, we propose to fine-tune the model with small sets of molecules, which are known to be active against that target. Against Staphylococcus aureus, the model reproduced 14% of 6051 hold-out test molecules that medicinal chemists designed, whereas against Plasmodium falciparum (Malaria), it reproduced 28% of 1240 test molecules. When coupled with a scoring function, our model can perform the complete de novo drug design cycle to generate large sets of novel molecules for drug discovery. American Chemical Society 2017-12-28 2018-01-24 /pmc/articles/PMC5785775/ /pubmed/29392184 http://dx.doi.org/10.1021/acscentsci.7b00512 Text en Copyright © 2017 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle Segler, Marwin H. S.
Kogej, Thierry
Tyrchan, Christian
Waller, Mark P.
Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks
title Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks
title_full Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks
title_fullStr Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks
title_full_unstemmed Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks
title_short Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks
title_sort generating focused molecule libraries for drug discovery with recurrent neural networks
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5785775/
https://www.ncbi.nlm.nih.gov/pubmed/29392184
http://dx.doi.org/10.1021/acscentsci.7b00512
work_keys_str_mv AT seglermarwinhs generatingfocusedmoleculelibrariesfordrugdiscoverywithrecurrentneuralnetworks
AT kogejthierry generatingfocusedmoleculelibrariesfordrugdiscoverywithrecurrentneuralnetworks
AT tyrchanchristian generatingfocusedmoleculelibrariesfordrugdiscoverywithrecurrentneuralnetworks
AT wallermarkp generatingfocusedmoleculelibrariesfordrugdiscoverywithrecurrentneuralnetworks