Cargando…

CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties

MOTIVATION: Deep learning-based molecule generation becomes a new paradigm of de novo molecule design since it enables fast and directional exploration in the vast chemical space. However, it is still an open issue to generate molecules, which bind to specific proteins with high-binding affinities w...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jia-Ning, Yang, Guang, Zhao, Peng-Cheng, Wei, Xue-Xin, Shi, Jian-Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311289/
https://www.ncbi.nlm.nih.gov/pubmed/37387157
http://dx.doi.org/10.1093/bioinformatics/btad222
_version_ 1785066711004741632
author Li, Jia-Ning
Yang, Guang
Zhao, Peng-Cheng
Wei, Xue-Xin
Shi, Jian-Yu
author_facet Li, Jia-Ning
Yang, Guang
Zhao, Peng-Cheng
Wei, Xue-Xin
Shi, Jian-Yu
author_sort Li, Jia-Ning
collection PubMed
description MOTIVATION: Deep learning-based molecule generation becomes a new paradigm of de novo molecule design since it enables fast and directional exploration in the vast chemical space. However, it is still an open issue to generate molecules, which bind to specific proteins with high-binding affinities while owning desired drug-like physicochemical properties. RESULTS: To address these issues, we elaborate a novel framework for controllable protein-oriented molecule generation, named CProMG, which contains a 3D protein embedding module, a dual-view protein encoder, a molecule embedding module, and a novel drug-like molecule decoder. Based on fusing the hierarchical views of proteins, it enhances the representation of protein binding pockets significantly by associating amino acid residues with their comprising atoms. Through jointly embedding molecule sequences, their drug-like properties, and binding affinities w.r.t. proteins, it autoregressively generates novel molecules having specific properties in a controllable manner by measuring the proximity of molecule tokens to protein residues and atoms. The comparison with state-of-the-art deep generative methods demonstrates the superiority of our CProMG. Furthermore, the progressive control of properties demonstrates the effectiveness of CProMG when controlling binding affinity and drug-like properties. After that, the ablation studies reveal how its crucial components contribute to the model respectively, including hierarchical protein views, Laplacian position encoding as well as property control. Last, a case study w.r.t. protein illustrates the novelty of CProMG and the ability to capture crucial interactions between protein pockets and molecules. It’s anticipated that this work can boost de novo molecule design. AVAILABILITY AND IMPLEMENTATION: The code and data underlying this article are freely available at https://github.com/lijianing0902/CProMG.
format Online
Article
Text
id pubmed-10311289
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103112892023-07-01 CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties Li, Jia-Ning Yang, Guang Zhao, Peng-Cheng Wei, Xue-Xin Shi, Jian-Yu Bioinformatics Macromolecular Sequence, Structure, and Function MOTIVATION: Deep learning-based molecule generation becomes a new paradigm of de novo molecule design since it enables fast and directional exploration in the vast chemical space. However, it is still an open issue to generate molecules, which bind to specific proteins with high-binding affinities while owning desired drug-like physicochemical properties. RESULTS: To address these issues, we elaborate a novel framework for controllable protein-oriented molecule generation, named CProMG, which contains a 3D protein embedding module, a dual-view protein encoder, a molecule embedding module, and a novel drug-like molecule decoder. Based on fusing the hierarchical views of proteins, it enhances the representation of protein binding pockets significantly by associating amino acid residues with their comprising atoms. Through jointly embedding molecule sequences, their drug-like properties, and binding affinities w.r.t. proteins, it autoregressively generates novel molecules having specific properties in a controllable manner by measuring the proximity of molecule tokens to protein residues and atoms. The comparison with state-of-the-art deep generative methods demonstrates the superiority of our CProMG. Furthermore, the progressive control of properties demonstrates the effectiveness of CProMG when controlling binding affinity and drug-like properties. After that, the ablation studies reveal how its crucial components contribute to the model respectively, including hierarchical protein views, Laplacian position encoding as well as property control. Last, a case study w.r.t. protein illustrates the novelty of CProMG and the ability to capture crucial interactions between protein pockets and molecules. It’s anticipated that this work can boost de novo molecule design. AVAILABILITY AND IMPLEMENTATION: The code and data underlying this article are freely available at https://github.com/lijianing0902/CProMG. Oxford University Press 2023-06-30 /pmc/articles/PMC10311289/ /pubmed/37387157 http://dx.doi.org/10.1093/bioinformatics/btad222 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Macromolecular Sequence, Structure, and Function
Li, Jia-Ning
Yang, Guang
Zhao, Peng-Cheng
Wei, Xue-Xin
Shi, Jian-Yu
CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties
title CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties
title_full CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties
title_fullStr CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties
title_full_unstemmed CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties
title_short CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties
title_sort cpromg: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties
topic Macromolecular Sequence, Structure, and Function
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311289/
https://www.ncbi.nlm.nih.gov/pubmed/37387157
http://dx.doi.org/10.1093/bioinformatics/btad222
work_keys_str_mv AT lijianing cpromgcontrollableproteinorientedmoleculegenerationwithdesiredbindingaffinityanddruglikeproperties
AT yangguang cpromgcontrollableproteinorientedmoleculegenerationwithdesiredbindingaffinityanddruglikeproperties
AT zhaopengcheng cpromgcontrollableproteinorientedmoleculegenerationwithdesiredbindingaffinityanddruglikeproperties
AT weixuexin cpromgcontrollableproteinorientedmoleculegenerationwithdesiredbindingaffinityanddruglikeproperties
AT shijianyu cpromgcontrollableproteinorientedmoleculegenerationwithdesiredbindingaffinityanddruglikeproperties