Cargando…

An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints

Identifying the interactions of the drug-target is central to the cognate areas including drug discovery and drug reposition. Although the high-throughput biotechnologies have made tremendous progress, the indispensable clinical trials remain to be expensive, laborious, and intricate. Therefore, a c...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Zheng-Yang, Huang, Wen-Zhun, Zhan, Xin-Ke, Pan, Jie, Huang, Yu-An, Zhang, Shan-Wen, Yu, Chang-Qing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8093043/
https://www.ncbi.nlm.nih.gov/pubmed/33987446
http://dx.doi.org/10.1155/2021/9933873
_version_ 1783687731400933376
author Zhao, Zheng-Yang
Huang, Wen-Zhun
Zhan, Xin-Ke
Pan, Jie
Huang, Yu-An
Zhang, Shan-Wen
Yu, Chang-Qing
author_facet Zhao, Zheng-Yang
Huang, Wen-Zhun
Zhan, Xin-Ke
Pan, Jie
Huang, Yu-An
Zhang, Shan-Wen
Yu, Chang-Qing
author_sort Zhao, Zheng-Yang
collection PubMed
description Identifying the interactions of the drug-target is central to the cognate areas including drug discovery and drug reposition. Although the high-throughput biotechnologies have made tremendous progress, the indispensable clinical trials remain to be expensive, laborious, and intricate. Therefore, a convenient and reliable computer-aided method has become the focus on inferring drug-target interactions (DTIs). In this research, we propose a novel computational model integrating a pyramid histogram of oriented gradients (PHOG), Position-Specific Scoring Matrix (PSSM), and rotation forest (RF) classifier for identifying DTIs. Specifically, protein primary sequences are first converted into PSSMs to describe the potential biological evolution information. After that, PHOG is employed to mine the highly representative features of PSSM from multiple pyramid levels, and the complete describers of drug-target pairs are generated by combining the molecular substructure fingerprints and PHOG features. Finally, we feed the complete describers into the RF classifier for effective prediction. The experiments of 5-fold Cross-Validations (CV) yield mean accuracies of 88.96%, 86.37%, 82.88%, and 76.92% on four golden standard data sets (enzyme, ion channel, G protein-coupled receptors (GPCRs), and nuclear receptor, respectively). Moreover, the paper also conducts the state-of-art light gradient boosting machine (LGBM) and support vector machine (SVM) to further verify the performance of the proposed model. The experimental outcomes substantiate that the established model is feasible and reliable to predict DTIs. There is an excellent prospect that our model is capable of predicting DTIs as an efficient tool on a large scale.
format Online
Article
Text
id pubmed-8093043
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-80930432021-05-12 An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints Zhao, Zheng-Yang Huang, Wen-Zhun Zhan, Xin-Ke Pan, Jie Huang, Yu-An Zhang, Shan-Wen Yu, Chang-Qing Biomed Res Int Research Article Identifying the interactions of the drug-target is central to the cognate areas including drug discovery and drug reposition. Although the high-throughput biotechnologies have made tremendous progress, the indispensable clinical trials remain to be expensive, laborious, and intricate. Therefore, a convenient and reliable computer-aided method has become the focus on inferring drug-target interactions (DTIs). In this research, we propose a novel computational model integrating a pyramid histogram of oriented gradients (PHOG), Position-Specific Scoring Matrix (PSSM), and rotation forest (RF) classifier for identifying DTIs. Specifically, protein primary sequences are first converted into PSSMs to describe the potential biological evolution information. After that, PHOG is employed to mine the highly representative features of PSSM from multiple pyramid levels, and the complete describers of drug-target pairs are generated by combining the molecular substructure fingerprints and PHOG features. Finally, we feed the complete describers into the RF classifier for effective prediction. The experiments of 5-fold Cross-Validations (CV) yield mean accuracies of 88.96%, 86.37%, 82.88%, and 76.92% on four golden standard data sets (enzyme, ion channel, G protein-coupled receptors (GPCRs), and nuclear receptor, respectively). Moreover, the paper also conducts the state-of-art light gradient boosting machine (LGBM) and support vector machine (SVM) to further verify the performance of the proposed model. The experimental outcomes substantiate that the established model is feasible and reliable to predict DTIs. There is an excellent prospect that our model is capable of predicting DTIs as an efficient tool on a large scale. Hindawi 2021-04-24 /pmc/articles/PMC8093043/ /pubmed/33987446 http://dx.doi.org/10.1155/2021/9933873 Text en Copyright © 2021 Zheng-Yang Zhao et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhao, Zheng-Yang
Huang, Wen-Zhun
Zhan, Xin-Ke
Pan, Jie
Huang, Yu-An
Zhang, Shan-Wen
Yu, Chang-Qing
An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints
title An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints
title_full An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints
title_fullStr An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints
title_full_unstemmed An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints
title_short An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints
title_sort ensemble learning-based method for inferring drug-target interactions combining protein sequences and drug fingerprints
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8093043/
https://www.ncbi.nlm.nih.gov/pubmed/33987446
http://dx.doi.org/10.1155/2021/9933873
work_keys_str_mv AT zhaozhengyang anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT huangwenzhun anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT zhanxinke anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT panjie anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT huangyuan anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT zhangshanwen anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT yuchangqing anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT zhaozhengyang ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT huangwenzhun ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT zhanxinke ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT panjie ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT huangyuan ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT zhangshanwen ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints
AT yuchangqing ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints