Cargando…

An equivariant generative framework for molecular graph-structure Co-design

Designing molecules with desirable physiochemical properties and functionalities is a long-standing challenge in chemistry, material science, and drug discovery. Recently, machine learning-based generative models have emerged as promising approaches for de novo molecule design. However, further refi...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zaixi, Liu, Qi, Lee, Chee-Kong, Hsieh, Chang-Yu, Chen, Enhong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411624/
https://www.ncbi.nlm.nih.gov/pubmed/37564414
http://dx.doi.org/10.1039/d3sc02538a
_version_ 1785086705715380224
author Zhang, Zaixi
Liu, Qi
Lee, Chee-Kong
Hsieh, Chang-Yu
Chen, Enhong
author_facet Zhang, Zaixi
Liu, Qi
Lee, Chee-Kong
Hsieh, Chang-Yu
Chen, Enhong
author_sort Zhang, Zaixi
collection PubMed
description Designing molecules with desirable physiochemical properties and functionalities is a long-standing challenge in chemistry, material science, and drug discovery. Recently, machine learning-based generative models have emerged as promising approaches for de novo molecule design. However, further refinement of methodology is highly desired as most existing methods lack unified modeling of 2D topology and 3D geometry information and fail to effectively learn the structure–property relationship for molecule design. Here we present MolCode, a roto-translation equivariant generative framework for molecular graph-structure Co-design. In MolCode, 3D geometric information empowers the molecular 2D graph generation, which in turn helps guide the prediction of molecular 3D structure. Extensive experimental results show that MolCode outperforms previous methods on a series of challenging tasks including de novo molecule design, targeted molecule discovery, and structure-based drug design. Particularly, MolCode not only consistently generates valid (99.95% validity) and diverse (98.75% uniqueness) molecular graphs/structures with desirable properties, but also generates drug-like molecules with high affinity to target proteins (61.8% high affinity ratio), which demonstrates MolCode's potential applications in material design and drug discovery. Our extensive investigation reveals that the 2D topology and 3D geometry contain intrinsically complementary information in molecule design, and provide new insights into machine learning-based molecule representation and generation.
format Online
Article
Text
id pubmed-10411624
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-104116242023-08-10 An equivariant generative framework for molecular graph-structure Co-design Zhang, Zaixi Liu, Qi Lee, Chee-Kong Hsieh, Chang-Yu Chen, Enhong Chem Sci Chemistry Designing molecules with desirable physiochemical properties and functionalities is a long-standing challenge in chemistry, material science, and drug discovery. Recently, machine learning-based generative models have emerged as promising approaches for de novo molecule design. However, further refinement of methodology is highly desired as most existing methods lack unified modeling of 2D topology and 3D geometry information and fail to effectively learn the structure–property relationship for molecule design. Here we present MolCode, a roto-translation equivariant generative framework for molecular graph-structure Co-design. In MolCode, 3D geometric information empowers the molecular 2D graph generation, which in turn helps guide the prediction of molecular 3D structure. Extensive experimental results show that MolCode outperforms previous methods on a series of challenging tasks including de novo molecule design, targeted molecule discovery, and structure-based drug design. Particularly, MolCode not only consistently generates valid (99.95% validity) and diverse (98.75% uniqueness) molecular graphs/structures with desirable properties, but also generates drug-like molecules with high affinity to target proteins (61.8% high affinity ratio), which demonstrates MolCode's potential applications in material design and drug discovery. Our extensive investigation reveals that the 2D topology and 3D geometry contain intrinsically complementary information in molecule design, and provide new insights into machine learning-based molecule representation and generation. The Royal Society of Chemistry 2023-07-19 /pmc/articles/PMC10411624/ /pubmed/37564414 http://dx.doi.org/10.1039/d3sc02538a Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Zhang, Zaixi
Liu, Qi
Lee, Chee-Kong
Hsieh, Chang-Yu
Chen, Enhong
An equivariant generative framework for molecular graph-structure Co-design
title An equivariant generative framework for molecular graph-structure Co-design
title_full An equivariant generative framework for molecular graph-structure Co-design
title_fullStr An equivariant generative framework for molecular graph-structure Co-design
title_full_unstemmed An equivariant generative framework for molecular graph-structure Co-design
title_short An equivariant generative framework for molecular graph-structure Co-design
title_sort equivariant generative framework for molecular graph-structure co-design
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411624/
https://www.ncbi.nlm.nih.gov/pubmed/37564414
http://dx.doi.org/10.1039/d3sc02538a
work_keys_str_mv AT zhangzaixi anequivariantgenerativeframeworkformoleculargraphstructurecodesign
AT liuqi anequivariantgenerativeframeworkformoleculargraphstructurecodesign
AT leecheekong anequivariantgenerativeframeworkformoleculargraphstructurecodesign
AT hsiehchangyu anequivariantgenerativeframeworkformoleculargraphstructurecodesign
AT chenenhong anequivariantgenerativeframeworkformoleculargraphstructurecodesign
AT zhangzaixi equivariantgenerativeframeworkformoleculargraphstructurecodesign
AT liuqi equivariantgenerativeframeworkformoleculargraphstructurecodesign
AT leecheekong equivariantgenerativeframeworkformoleculargraphstructurecodesign
AT hsiehchangyu equivariantgenerativeframeworkformoleculargraphstructurecodesign
AT chenenhong equivariantgenerativeframeworkformoleculargraphstructurecodesign