Cargando…
CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties
MOTIVATION: Deep learning-based molecule generation becomes a new paradigm of de novo molecule design since it enables fast and directional exploration in the vast chemical space. However, it is still an open issue to generate molecules, which bind to specific proteins with high-binding affinities w...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311289/ https://www.ncbi.nlm.nih.gov/pubmed/37387157 http://dx.doi.org/10.1093/bioinformatics/btad222 |
_version_ | 1785066711004741632 |
---|---|
author | Li, Jia-Ning Yang, Guang Zhao, Peng-Cheng Wei, Xue-Xin Shi, Jian-Yu |
author_facet | Li, Jia-Ning Yang, Guang Zhao, Peng-Cheng Wei, Xue-Xin Shi, Jian-Yu |
author_sort | Li, Jia-Ning |
collection | PubMed |
description | MOTIVATION: Deep learning-based molecule generation becomes a new paradigm of de novo molecule design since it enables fast and directional exploration in the vast chemical space. However, it is still an open issue to generate molecules, which bind to specific proteins with high-binding affinities while owning desired drug-like physicochemical properties. RESULTS: To address these issues, we elaborate a novel framework for controllable protein-oriented molecule generation, named CProMG, which contains a 3D protein embedding module, a dual-view protein encoder, a molecule embedding module, and a novel drug-like molecule decoder. Based on fusing the hierarchical views of proteins, it enhances the representation of protein binding pockets significantly by associating amino acid residues with their comprising atoms. Through jointly embedding molecule sequences, their drug-like properties, and binding affinities w.r.t. proteins, it autoregressively generates novel molecules having specific properties in a controllable manner by measuring the proximity of molecule tokens to protein residues and atoms. The comparison with state-of-the-art deep generative methods demonstrates the superiority of our CProMG. Furthermore, the progressive control of properties demonstrates the effectiveness of CProMG when controlling binding affinity and drug-like properties. After that, the ablation studies reveal how its crucial components contribute to the model respectively, including hierarchical protein views, Laplacian position encoding as well as property control. Last, a case study w.r.t. protein illustrates the novelty of CProMG and the ability to capture crucial interactions between protein pockets and molecules. It’s anticipated that this work can boost de novo molecule design. AVAILABILITY AND IMPLEMENTATION: The code and data underlying this article are freely available at https://github.com/lijianing0902/CProMG. |
format | Online Article Text |
id | pubmed-10311289 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-103112892023-07-01 CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties Li, Jia-Ning Yang, Guang Zhao, Peng-Cheng Wei, Xue-Xin Shi, Jian-Yu Bioinformatics Macromolecular Sequence, Structure, and Function MOTIVATION: Deep learning-based molecule generation becomes a new paradigm of de novo molecule design since it enables fast and directional exploration in the vast chemical space. However, it is still an open issue to generate molecules, which bind to specific proteins with high-binding affinities while owning desired drug-like physicochemical properties. RESULTS: To address these issues, we elaborate a novel framework for controllable protein-oriented molecule generation, named CProMG, which contains a 3D protein embedding module, a dual-view protein encoder, a molecule embedding module, and a novel drug-like molecule decoder. Based on fusing the hierarchical views of proteins, it enhances the representation of protein binding pockets significantly by associating amino acid residues with their comprising atoms. Through jointly embedding molecule sequences, their drug-like properties, and binding affinities w.r.t. proteins, it autoregressively generates novel molecules having specific properties in a controllable manner by measuring the proximity of molecule tokens to protein residues and atoms. The comparison with state-of-the-art deep generative methods demonstrates the superiority of our CProMG. Furthermore, the progressive control of properties demonstrates the effectiveness of CProMG when controlling binding affinity and drug-like properties. After that, the ablation studies reveal how its crucial components contribute to the model respectively, including hierarchical protein views, Laplacian position encoding as well as property control. Last, a case study w.r.t. protein illustrates the novelty of CProMG and the ability to capture crucial interactions between protein pockets and molecules. It’s anticipated that this work can boost de novo molecule design. AVAILABILITY AND IMPLEMENTATION: The code and data underlying this article are freely available at https://github.com/lijianing0902/CProMG. Oxford University Press 2023-06-30 /pmc/articles/PMC10311289/ /pubmed/37387157 http://dx.doi.org/10.1093/bioinformatics/btad222 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Macromolecular Sequence, Structure, and Function Li, Jia-Ning Yang, Guang Zhao, Peng-Cheng Wei, Xue-Xin Shi, Jian-Yu CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties |
title | CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties |
title_full | CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties |
title_fullStr | CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties |
title_full_unstemmed | CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties |
title_short | CProMG: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties |
title_sort | cpromg: controllable protein-oriented molecule generation with desired binding affinity and drug-like properties |
topic | Macromolecular Sequence, Structure, and Function |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311289/ https://www.ncbi.nlm.nih.gov/pubmed/37387157 http://dx.doi.org/10.1093/bioinformatics/btad222 |
work_keys_str_mv | AT lijianing cpromgcontrollableproteinorientedmoleculegenerationwithdesiredbindingaffinityanddruglikeproperties AT yangguang cpromgcontrollableproteinorientedmoleculegenerationwithdesiredbindingaffinityanddruglikeproperties AT zhaopengcheng cpromgcontrollableproteinorientedmoleculegenerationwithdesiredbindingaffinityanddruglikeproperties AT weixuexin cpromgcontrollableproteinorientedmoleculegenerationwithdesiredbindingaffinityanddruglikeproperties AT shijianyu cpromgcontrollableproteinorientedmoleculegenerationwithdesiredbindingaffinityanddruglikeproperties |