Cargando…

CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning

MOTIVATION: Metabolic stability plays a crucial role in the early stages of drug discovery and development. Accurately modeling and predicting molecular metabolic stability has great potential for the efficient screening of drug candidates as well as the optimization of lead compounds. Considering w...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Bing-Xue, Long, Yahui, Li, Xiaoli, Wu, Min, Shi, Jian-Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10457661/
https://www.ncbi.nlm.nih.gov/pubmed/37572298
http://dx.doi.org/10.1093/bioinformatics/btad503
_version_ 1785096979805634560
author Du, Bing-Xue
Long, Yahui
Li, Xiaoli
Wu, Min
Shi, Jian-Yu
author_facet Du, Bing-Xue
Long, Yahui
Li, Xiaoli
Wu, Min
Shi, Jian-Yu
author_sort Du, Bing-Xue
collection PubMed
description MOTIVATION: Metabolic stability plays a crucial role in the early stages of drug discovery and development. Accurately modeling and predicting molecular metabolic stability has great potential for the efficient screening of drug candidates as well as the optimization of lead compounds. Considering wet-lab experiment is time-consuming, laborious, and expensive, in silico prediction of metabolic stability is an alternative choice. However, few computational methods have been developed to address this task. In addition, it remains a significant challenge to explain key functional groups determining metabolic stability. RESULTS: To address these issues, we develop a novel cross-modality graph contrastive learning model named CMMS-GCL for predicting the metabolic stability of drug candidates. In our framework, we design deep learning methods to extract features for molecules from two modality data, i.e. SMILES sequence and molecule graph. In particular, for the sequence data, we design a multihead attention BiGRU-based encoder to preserve the context of symbols to learn sequence representations of molecules. For the graph data, we propose a graph contrastive learning-based encoder to learn structure representations by effectively capturing the consistencies between local and global structures. We further exploit fully connected neural networks to combine the sequence and structure representations for model training. Extensive experimental results on two datasets demonstrate that our CMMS-GCL consistently outperforms seven state-of-the-art methods. Furthermore, a collection of case studies on sequence data and statistical analyses of the graph structure module strengthens the validation of the interpretability of crucial functional groups recognized by CMMS-GCL. Overall, CMMS-GCL can serve as an effective and interpretable tool for predicting metabolic stability, identifying critical functional groups, and thus facilitating the drug discovery process and lead compound optimization. AVAILABILITY AND IMPLEMENTATION: The code and data underlying this article are freely available at https://github.com/dubingxue/CMMS-GCL.
format Online
Article
Text
id pubmed-10457661
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104576612023-08-27 CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning Du, Bing-Xue Long, Yahui Li, Xiaoli Wu, Min Shi, Jian-Yu Bioinformatics Original Paper MOTIVATION: Metabolic stability plays a crucial role in the early stages of drug discovery and development. Accurately modeling and predicting molecular metabolic stability has great potential for the efficient screening of drug candidates as well as the optimization of lead compounds. Considering wet-lab experiment is time-consuming, laborious, and expensive, in silico prediction of metabolic stability is an alternative choice. However, few computational methods have been developed to address this task. In addition, it remains a significant challenge to explain key functional groups determining metabolic stability. RESULTS: To address these issues, we develop a novel cross-modality graph contrastive learning model named CMMS-GCL for predicting the metabolic stability of drug candidates. In our framework, we design deep learning methods to extract features for molecules from two modality data, i.e. SMILES sequence and molecule graph. In particular, for the sequence data, we design a multihead attention BiGRU-based encoder to preserve the context of symbols to learn sequence representations of molecules. For the graph data, we propose a graph contrastive learning-based encoder to learn structure representations by effectively capturing the consistencies between local and global structures. We further exploit fully connected neural networks to combine the sequence and structure representations for model training. Extensive experimental results on two datasets demonstrate that our CMMS-GCL consistently outperforms seven state-of-the-art methods. Furthermore, a collection of case studies on sequence data and statistical analyses of the graph structure module strengthens the validation of the interpretability of crucial functional groups recognized by CMMS-GCL. Overall, CMMS-GCL can serve as an effective and interpretable tool for predicting metabolic stability, identifying critical functional groups, and thus facilitating the drug discovery process and lead compound optimization. AVAILABILITY AND IMPLEMENTATION: The code and data underlying this article are freely available at https://github.com/dubingxue/CMMS-GCL. Oxford University Press 2023-08-12 /pmc/articles/PMC10457661/ /pubmed/37572298 http://dx.doi.org/10.1093/bioinformatics/btad503 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Du, Bing-Xue
Long, Yahui
Li, Xiaoli
Wu, Min
Shi, Jian-Yu
CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning
title CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning
title_full CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning
title_fullStr CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning
title_full_unstemmed CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning
title_short CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning
title_sort cmms-gcl: cross-modality metabolic stability prediction with graph contrastive learning
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10457661/
https://www.ncbi.nlm.nih.gov/pubmed/37572298
http://dx.doi.org/10.1093/bioinformatics/btad503
work_keys_str_mv AT dubingxue cmmsgclcrossmodalitymetabolicstabilitypredictionwithgraphcontrastivelearning
AT longyahui cmmsgclcrossmodalitymetabolicstabilitypredictionwithgraphcontrastivelearning
AT lixiaoli cmmsgclcrossmodalitymetabolicstabilitypredictionwithgraphcontrastivelearning
AT wumin cmmsgclcrossmodalitymetabolicstabilitypredictionwithgraphcontrastivelearning
AT shijianyu cmmsgclcrossmodalitymetabolicstabilitypredictionwithgraphcontrastivelearning