Cargando…

Deep scaffold hopping with multimodal transformer neural networks

Scaffold hopping is a central task of modern medicinal chemistry for rational drug design, which aims to design molecules of novel scaffolds sharing similar target biological activities toward known hit molecules. Traditionally, scaffolding hopping depends on searching databases of available compoun...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Shuangjia, Lei, Zengrong, Ai, Haitao, Chen, Hongming, Deng, Daiguo, Yang, Yuedong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8590293/
https://www.ncbi.nlm.nih.gov/pubmed/34774103
http://dx.doi.org/10.1186/s13321-021-00565-5
_version_ 1784598928629432320
author Zheng, Shuangjia
Lei, Zengrong
Ai, Haitao
Chen, Hongming
Deng, Daiguo
Yang, Yuedong
author_facet Zheng, Shuangjia
Lei, Zengrong
Ai, Haitao
Chen, Hongming
Deng, Daiguo
Yang, Yuedong
author_sort Zheng, Shuangjia
collection PubMed
description Scaffold hopping is a central task of modern medicinal chemistry for rational drug design, which aims to design molecules of novel scaffolds sharing similar target biological activities toward known hit molecules. Traditionally, scaffolding hopping depends on searching databases of available compounds that can't exploit vast chemical space. In this study, we have re-formulated this task as a supervised molecule-to-molecule translation to generate hopped molecules novel in 2D structure but similar in 3D structure, as inspired by the fact that candidate compounds bind with their targets through 3D conformations. To efficiently train the model, we curated over 50 thousand pairs of molecules with increased bioactivity, similar 3D structure, but different 2D structure from public bioactivity database, which spanned 40 kinases commonly investigated by medicinal chemists. Moreover, we have designed a multimodal molecular transformer architecture by integrating molecular 3D conformer through a spatial graph neural network and protein sequence information through Transformer. The trained DeepHop model was shown able to generate around 70% molecules having improved bioactivity together with high 3D similarity but low 2D scaffold similarity to the template molecules. This ratio was 1.9 times higher than other state-of-the-art deep learning methods and rule- and virtual screening-based methods. Furthermore, we demonstrated that the model could generalize to new target proteins through fine-tuning with a small set of active compounds. Case studies have also shown the advantages and usefulness of DeepHop in practical scaffold hopping scenarios. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00565-5.
format Online
Article
Text
id pubmed-8590293
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-85902932021-11-15 Deep scaffold hopping with multimodal transformer neural networks Zheng, Shuangjia Lei, Zengrong Ai, Haitao Chen, Hongming Deng, Daiguo Yang, Yuedong J Cheminform Research Article Scaffold hopping is a central task of modern medicinal chemistry for rational drug design, which aims to design molecules of novel scaffolds sharing similar target biological activities toward known hit molecules. Traditionally, scaffolding hopping depends on searching databases of available compounds that can't exploit vast chemical space. In this study, we have re-formulated this task as a supervised molecule-to-molecule translation to generate hopped molecules novel in 2D structure but similar in 3D structure, as inspired by the fact that candidate compounds bind with their targets through 3D conformations. To efficiently train the model, we curated over 50 thousand pairs of molecules with increased bioactivity, similar 3D structure, but different 2D structure from public bioactivity database, which spanned 40 kinases commonly investigated by medicinal chemists. Moreover, we have designed a multimodal molecular transformer architecture by integrating molecular 3D conformer through a spatial graph neural network and protein sequence information through Transformer. The trained DeepHop model was shown able to generate around 70% molecules having improved bioactivity together with high 3D similarity but low 2D scaffold similarity to the template molecules. This ratio was 1.9 times higher than other state-of-the-art deep learning methods and rule- and virtual screening-based methods. Furthermore, we demonstrated that the model could generalize to new target proteins through fine-tuning with a small set of active compounds. Case studies have also shown the advantages and usefulness of DeepHop in practical scaffold hopping scenarios. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00565-5. Springer International Publishing 2021-11-13 /pmc/articles/PMC8590293/ /pubmed/34774103 http://dx.doi.org/10.1186/s13321-021-00565-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Zheng, Shuangjia
Lei, Zengrong
Ai, Haitao
Chen, Hongming
Deng, Daiguo
Yang, Yuedong
Deep scaffold hopping with multimodal transformer neural networks
title Deep scaffold hopping with multimodal transformer neural networks
title_full Deep scaffold hopping with multimodal transformer neural networks
title_fullStr Deep scaffold hopping with multimodal transformer neural networks
title_full_unstemmed Deep scaffold hopping with multimodal transformer neural networks
title_short Deep scaffold hopping with multimodal transformer neural networks
title_sort deep scaffold hopping with multimodal transformer neural networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8590293/
https://www.ncbi.nlm.nih.gov/pubmed/34774103
http://dx.doi.org/10.1186/s13321-021-00565-5
work_keys_str_mv AT zhengshuangjia deepscaffoldhoppingwithmultimodaltransformerneuralnetworks
AT leizengrong deepscaffoldhoppingwithmultimodaltransformerneuralnetworks
AT aihaitao deepscaffoldhoppingwithmultimodaltransformerneuralnetworks
AT chenhongming deepscaffoldhoppingwithmultimodaltransformerneuralnetworks
AT dengdaiguo deepscaffoldhoppingwithmultimodaltransformerneuralnetworks
AT yangyuedong deepscaffoldhoppingwithmultimodaltransformerneuralnetworks