Cargando…

FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding

Prediction of drug-protein binding is critical for virtual drug screening. Many deep learning methods have been proposed to predict the drug-protein binding based on protein sequences and drug representation sequences. However, most existing methods extract features from protein and drug sequences s...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Zhihui, Liu, Juan, Zhu, Xuekai, Yang, Feng, Zhang, Qiang, Shah, Hayat Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Higher Education Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9745276/
https://www.ncbi.nlm.nih.gov/pubmed/36532946
http://dx.doi.org/10.1007/s11704-022-2163-9
_version_ 1784849114443284480
author Yang, Zhihui
Liu, Juan
Zhu, Xuekai
Yang, Feng
Zhang, Qiang
Shah, Hayat Ali
author_facet Yang, Zhihui
Liu, Juan
Zhu, Xuekai
Yang, Feng
Zhang, Qiang
Shah, Hayat Ali
author_sort Yang, Zhihui
collection PubMed
description Prediction of drug-protein binding is critical for virtual drug screening. Many deep learning methods have been proposed to predict the drug-protein binding based on protein sequences and drug representation sequences. However, most existing methods extract features from protein and drug sequences separately. As a result, they can not learn the features characterizing the drug-protein interactions. In addition, the existing methods encode the protein (drug) sequence usually based on the assumption that each amino acid (atom) has the same contribution to the binding, ignoring different impacts of different amino acids (atoms) on the binding. However, the event of drug-protein binding usually occurs between conserved residue fragments in the protein sequence and atom fragments of the drug molecule. Therefore, a more comprehensive encoding strategy is required to extract information from the conserved fragments. In this paper, we propose a novel model, named FragDPI, to predict the drug-protein binding affinity. Unlike other methods, we encode the sequences based on the conserved fragments and encode the protein and drug into a unified vector. Moreover, we adopt a novel two-step training strategy to train FragDPI. The pre-training step is to learn the interactions between different fragments using unsupervised learning. The fine-tuning step is for predicting the binding affinities using supervised learning. The experiment results have illustrated the superiority of FragDPI. ELECTRONIC SUPPLEMENTARY MATERIAL: Supplementary material is available for this article at 10.1007/s11704-022-2163-9 and is accessible for authorized users.
format Online
Article
Text
id pubmed-9745276
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Higher Education Press
record_format MEDLINE/PubMed
spelling pubmed-97452762022-12-13 FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding Yang, Zhihui Liu, Juan Zhu, Xuekai Yang, Feng Zhang, Qiang Shah, Hayat Ali Front Comput Sci Research Article Prediction of drug-protein binding is critical for virtual drug screening. Many deep learning methods have been proposed to predict the drug-protein binding based on protein sequences and drug representation sequences. However, most existing methods extract features from protein and drug sequences separately. As a result, they can not learn the features characterizing the drug-protein interactions. In addition, the existing methods encode the protein (drug) sequence usually based on the assumption that each amino acid (atom) has the same contribution to the binding, ignoring different impacts of different amino acids (atoms) on the binding. However, the event of drug-protein binding usually occurs between conserved residue fragments in the protein sequence and atom fragments of the drug molecule. Therefore, a more comprehensive encoding strategy is required to extract information from the conserved fragments. In this paper, we propose a novel model, named FragDPI, to predict the drug-protein binding affinity. Unlike other methods, we encode the sequences based on the conserved fragments and encode the protein and drug into a unified vector. Moreover, we adopt a novel two-step training strategy to train FragDPI. The pre-training step is to learn the interactions between different fragments using unsupervised learning. The fine-tuning step is for predicting the binding affinities using supervised learning. The experiment results have illustrated the superiority of FragDPI. ELECTRONIC SUPPLEMENTARY MATERIAL: Supplementary material is available for this article at 10.1007/s11704-022-2163-9 and is accessible for authorized users. Higher Education Press 2022-12-13 2023 /pmc/articles/PMC9745276/ /pubmed/36532946 http://dx.doi.org/10.1007/s11704-022-2163-9 Text en © Higher Education Press 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Research Article
Yang, Zhihui
Liu, Juan
Zhu, Xuekai
Yang, Feng
Zhang, Qiang
Shah, Hayat Ali
FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding
title FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding
title_full FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding
title_fullStr FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding
title_full_unstemmed FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding
title_short FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding
title_sort fragdpi: a novel drug-protein interaction prediction model based on fragment understanding and unified coding
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9745276/
https://www.ncbi.nlm.nih.gov/pubmed/36532946
http://dx.doi.org/10.1007/s11704-022-2163-9
work_keys_str_mv AT yangzhihui fragdpianoveldrugproteininteractionpredictionmodelbasedonfragmentunderstandingandunifiedcoding
AT liujuan fragdpianoveldrugproteininteractionpredictionmodelbasedonfragmentunderstandingandunifiedcoding
AT zhuxuekai fragdpianoveldrugproteininteractionpredictionmodelbasedonfragmentunderstandingandunifiedcoding
AT yangfeng fragdpianoveldrugproteininteractionpredictionmodelbasedonfragmentunderstandingandunifiedcoding
AT zhangqiang fragdpianoveldrugproteininteractionpredictionmodelbasedonfragmentunderstandingandunifiedcoding
AT shahhayatali fragdpianoveldrugproteininteractionpredictionmodelbasedonfragmentunderstandingandunifiedcoding