Cargando…
An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints
Identifying the interactions of the drug-target is central to the cognate areas including drug discovery and drug reposition. Although the high-throughput biotechnologies have made tremendous progress, the indispensable clinical trials remain to be expensive, laborious, and intricate. Therefore, a c...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8093043/ https://www.ncbi.nlm.nih.gov/pubmed/33987446 http://dx.doi.org/10.1155/2021/9933873 |
_version_ | 1783687731400933376 |
---|---|
author | Zhao, Zheng-Yang Huang, Wen-Zhun Zhan, Xin-Ke Pan, Jie Huang, Yu-An Zhang, Shan-Wen Yu, Chang-Qing |
author_facet | Zhao, Zheng-Yang Huang, Wen-Zhun Zhan, Xin-Ke Pan, Jie Huang, Yu-An Zhang, Shan-Wen Yu, Chang-Qing |
author_sort | Zhao, Zheng-Yang |
collection | PubMed |
description | Identifying the interactions of the drug-target is central to the cognate areas including drug discovery and drug reposition. Although the high-throughput biotechnologies have made tremendous progress, the indispensable clinical trials remain to be expensive, laborious, and intricate. Therefore, a convenient and reliable computer-aided method has become the focus on inferring drug-target interactions (DTIs). In this research, we propose a novel computational model integrating a pyramid histogram of oriented gradients (PHOG), Position-Specific Scoring Matrix (PSSM), and rotation forest (RF) classifier for identifying DTIs. Specifically, protein primary sequences are first converted into PSSMs to describe the potential biological evolution information. After that, PHOG is employed to mine the highly representative features of PSSM from multiple pyramid levels, and the complete describers of drug-target pairs are generated by combining the molecular substructure fingerprints and PHOG features. Finally, we feed the complete describers into the RF classifier for effective prediction. The experiments of 5-fold Cross-Validations (CV) yield mean accuracies of 88.96%, 86.37%, 82.88%, and 76.92% on four golden standard data sets (enzyme, ion channel, G protein-coupled receptors (GPCRs), and nuclear receptor, respectively). Moreover, the paper also conducts the state-of-art light gradient boosting machine (LGBM) and support vector machine (SVM) to further verify the performance of the proposed model. The experimental outcomes substantiate that the established model is feasible and reliable to predict DTIs. There is an excellent prospect that our model is capable of predicting DTIs as an efficient tool on a large scale. |
format | Online Article Text |
id | pubmed-8093043 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-80930432021-05-12 An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints Zhao, Zheng-Yang Huang, Wen-Zhun Zhan, Xin-Ke Pan, Jie Huang, Yu-An Zhang, Shan-Wen Yu, Chang-Qing Biomed Res Int Research Article Identifying the interactions of the drug-target is central to the cognate areas including drug discovery and drug reposition. Although the high-throughput biotechnologies have made tremendous progress, the indispensable clinical trials remain to be expensive, laborious, and intricate. Therefore, a convenient and reliable computer-aided method has become the focus on inferring drug-target interactions (DTIs). In this research, we propose a novel computational model integrating a pyramid histogram of oriented gradients (PHOG), Position-Specific Scoring Matrix (PSSM), and rotation forest (RF) classifier for identifying DTIs. Specifically, protein primary sequences are first converted into PSSMs to describe the potential biological evolution information. After that, PHOG is employed to mine the highly representative features of PSSM from multiple pyramid levels, and the complete describers of drug-target pairs are generated by combining the molecular substructure fingerprints and PHOG features. Finally, we feed the complete describers into the RF classifier for effective prediction. The experiments of 5-fold Cross-Validations (CV) yield mean accuracies of 88.96%, 86.37%, 82.88%, and 76.92% on four golden standard data sets (enzyme, ion channel, G protein-coupled receptors (GPCRs), and nuclear receptor, respectively). Moreover, the paper also conducts the state-of-art light gradient boosting machine (LGBM) and support vector machine (SVM) to further verify the performance of the proposed model. The experimental outcomes substantiate that the established model is feasible and reliable to predict DTIs. There is an excellent prospect that our model is capable of predicting DTIs as an efficient tool on a large scale. Hindawi 2021-04-24 /pmc/articles/PMC8093043/ /pubmed/33987446 http://dx.doi.org/10.1155/2021/9933873 Text en Copyright © 2021 Zheng-Yang Zhao et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhao, Zheng-Yang Huang, Wen-Zhun Zhan, Xin-Ke Pan, Jie Huang, Yu-An Zhang, Shan-Wen Yu, Chang-Qing An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints |
title | An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints |
title_full | An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints |
title_fullStr | An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints |
title_full_unstemmed | An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints |
title_short | An Ensemble Learning-Based Method for Inferring Drug-Target Interactions Combining Protein Sequences and Drug Fingerprints |
title_sort | ensemble learning-based method for inferring drug-target interactions combining protein sequences and drug fingerprints |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8093043/ https://www.ncbi.nlm.nih.gov/pubmed/33987446 http://dx.doi.org/10.1155/2021/9933873 |
work_keys_str_mv | AT zhaozhengyang anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT huangwenzhun anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT zhanxinke anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT panjie anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT huangyuan anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT zhangshanwen anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT yuchangqing anensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT zhaozhengyang ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT huangwenzhun ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT zhanxinke ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT panjie ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT huangyuan ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT zhangshanwen ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints AT yuchangqing ensemblelearningbasedmethodforinferringdrugtargetinteractionscombiningproteinsequencesanddrugfingerprints |