Cargando…

EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction

The identification of drug-target interaction (DTI) plays a key role in drug discovery and development. Benefitting from large-scale drug databases and verified DTI relationships, a lot of machine-learning methods have been developed to predict DTIs. However, due to the difficulty in extracting usef...

Descripción completa

Detalles Bibliográficos
Autores principales: Jin, Yuan, Lu, Jiarui, Shi, Runhan, Yang, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8698792/
https://www.ncbi.nlm.nih.gov/pubmed/34944427
http://dx.doi.org/10.3390/biom11121783
_version_ 1784620362249535488
author Jin, Yuan
Lu, Jiarui
Shi, Runhan
Yang, Yang
author_facet Jin, Yuan
Lu, Jiarui
Shi, Runhan
Yang, Yang
author_sort Jin, Yuan
collection PubMed
description The identification of drug-target interaction (DTI) plays a key role in drug discovery and development. Benefitting from large-scale drug databases and verified DTI relationships, a lot of machine-learning methods have been developed to predict DTIs. However, due to the difficulty in extracting useful information from molecules, the performance of these methods is limited by the representation of drugs and target proteins. This study proposes a new model called EmbedDTI to enhance the representation of both drugs and target proteins, and improve the performance of DTI prediction. For protein sequences, we leverage language modeling for pretraining the feature embeddings of amino acids and feed them to a convolutional neural network model for further representation learning. For drugs, we build two levels of graphs to represent compound structural information, namely the atom graph and substructure graph, and adopt graph convolutional network with an attention module to learn the embedding vectors for the graphs. We compare EmbedDTI with the existing DTI predictors on two benchmark datasets. The experimental results show that EmbedDTI outperforms the state-of-the-art models, and the attention module can identify the components crucial for DTIs in compounds.
format Online
Article
Text
id pubmed-8698792
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86987922021-12-24 EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction Jin, Yuan Lu, Jiarui Shi, Runhan Yang, Yang Biomolecules Article The identification of drug-target interaction (DTI) plays a key role in drug discovery and development. Benefitting from large-scale drug databases and verified DTI relationships, a lot of machine-learning methods have been developed to predict DTIs. However, due to the difficulty in extracting useful information from molecules, the performance of these methods is limited by the representation of drugs and target proteins. This study proposes a new model called EmbedDTI to enhance the representation of both drugs and target proteins, and improve the performance of DTI prediction. For protein sequences, we leverage language modeling for pretraining the feature embeddings of amino acids and feed them to a convolutional neural network model for further representation learning. For drugs, we build two levels of graphs to represent compound structural information, namely the atom graph and substructure graph, and adopt graph convolutional network with an attention module to learn the embedding vectors for the graphs. We compare EmbedDTI with the existing DTI predictors on two benchmark datasets. The experimental results show that EmbedDTI outperforms the state-of-the-art models, and the attention module can identify the components crucial for DTIs in compounds. MDPI 2021-11-29 /pmc/articles/PMC8698792/ /pubmed/34944427 http://dx.doi.org/10.3390/biom11121783 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jin, Yuan
Lu, Jiarui
Shi, Runhan
Yang, Yang
EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title_full EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title_fullStr EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title_full_unstemmed EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title_short EmbedDTI: Enhancing the Molecular Representations via Sequence Embedding and Graph Convolutional Network for the Prediction of Drug-Target Interaction
title_sort embeddti: enhancing the molecular representations via sequence embedding and graph convolutional network for the prediction of drug-target interaction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8698792/
https://www.ncbi.nlm.nih.gov/pubmed/34944427
http://dx.doi.org/10.3390/biom11121783
work_keys_str_mv AT jinyuan embeddtienhancingthemolecularrepresentationsviasequenceembeddingandgraphconvolutionalnetworkforthepredictionofdrugtargetinteraction
AT lujiarui embeddtienhancingthemolecularrepresentationsviasequenceembeddingandgraphconvolutionalnetworkforthepredictionofdrugtargetinteraction
AT shirunhan embeddtienhancingthemolecularrepresentationsviasequenceembeddingandgraphconvolutionalnetworkforthepredictionofdrugtargetinteraction
AT yangyang embeddtienhancingthemolecularrepresentationsviasequenceembeddingandgraphconvolutionalnetworkforthepredictionofdrugtargetinteraction