Cargando…

Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs

Determining the target genes that interact with drugs—drug–target interactions—plays an important role in drug discovery. Identification of drug–target interactions through biological experiments is time consuming, laborious, and costly. Therefore, using computational approaches to predict candidate...

Descripción completa

Detalles Bibliográficos
Autores principales: Xuan, Ping, Sun, Chang, Zhang, Tiangang, Ye, Yilin, Shen, Tonghui, Dong, Yihua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6555260/
https://www.ncbi.nlm.nih.gov/pubmed/31214240
http://dx.doi.org/10.3389/fgene.2019.00459
_version_ 1783425122322874368
author Xuan, Ping
Sun, Chang
Zhang, Tiangang
Ye, Yilin
Shen, Tonghui
Dong, Yihua
author_facet Xuan, Ping
Sun, Chang
Zhang, Tiangang
Ye, Yilin
Shen, Tonghui
Dong, Yihua
author_sort Xuan, Ping
collection PubMed
description Determining the target genes that interact with drugs—drug–target interactions—plays an important role in drug discovery. Identification of drug–target interactions through biological experiments is time consuming, laborious, and costly. Therefore, using computational approaches to predict candidate targets is a good way to reduce the cost of wet-lab experiments. However, the known interactions (positive samples) and the unknown interactions (negative samples) display a serious class imbalance, which has an adverse effect on the accuracy of the prediction results. To mitigate the impact of class imbalance and completely exploit the negative samples, we proposed a new method, named DTIGBDT, based on gradient boosting decision trees, for predicting candidate drug–target interactions. We constructed a drug–target heterogeneous network that contains the drug similarities based on the chemical structures of drugs, the target similarities based on target sequences, and the known drug–target interactions. The topological information of the network was captured by random walks to update the similarities between drugs or targets. The paths between drugs and targets could be divided into multiple categories, and the features of each category of paths were extracted. We constructed a prediction model based on gradient boosting decision trees. The model establishes multiple decision trees with the extracted features and obtains the interaction scores between drugs and targets. DTIGBDT is a method of ensemble learning, and it effectively reduces the impact of class imbalance. The experimental results indicate that DTIGBDT outperforms several state-of-the-art methods for drug–target interaction prediction. In addition, case studies on Quetiapine, Clozapine, Olanzapine, Aripiprazole, and Ziprasidone demonstrate the ability of DTIGBDT to discover potential drug–target interactions.
format Online
Article
Text
id pubmed-6555260
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-65552602019-06-18 Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs Xuan, Ping Sun, Chang Zhang, Tiangang Ye, Yilin Shen, Tonghui Dong, Yihua Front Genet Genetics Determining the target genes that interact with drugs—drug–target interactions—plays an important role in drug discovery. Identification of drug–target interactions through biological experiments is time consuming, laborious, and costly. Therefore, using computational approaches to predict candidate targets is a good way to reduce the cost of wet-lab experiments. However, the known interactions (positive samples) and the unknown interactions (negative samples) display a serious class imbalance, which has an adverse effect on the accuracy of the prediction results. To mitigate the impact of class imbalance and completely exploit the negative samples, we proposed a new method, named DTIGBDT, based on gradient boosting decision trees, for predicting candidate drug–target interactions. We constructed a drug–target heterogeneous network that contains the drug similarities based on the chemical structures of drugs, the target similarities based on target sequences, and the known drug–target interactions. The topological information of the network was captured by random walks to update the similarities between drugs or targets. The paths between drugs and targets could be divided into multiple categories, and the features of each category of paths were extracted. We constructed a prediction model based on gradient boosting decision trees. The model establishes multiple decision trees with the extracted features and obtains the interaction scores between drugs and targets. DTIGBDT is a method of ensemble learning, and it effectively reduces the impact of class imbalance. The experimental results indicate that DTIGBDT outperforms several state-of-the-art methods for drug–target interaction prediction. In addition, case studies on Quetiapine, Clozapine, Olanzapine, Aripiprazole, and Ziprasidone demonstrate the ability of DTIGBDT to discover potential drug–target interactions. Frontiers Media S.A. 2019-05-31 /pmc/articles/PMC6555260/ /pubmed/31214240 http://dx.doi.org/10.3389/fgene.2019.00459 Text en Copyright © 2019 Xuan, Sun, Zhang, Ye, Shen and Dong. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xuan, Ping
Sun, Chang
Zhang, Tiangang
Ye, Yilin
Shen, Tonghui
Dong, Yihua
Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs
title Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs
title_full Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs
title_fullStr Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs
title_full_unstemmed Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs
title_short Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs
title_sort gradient boosting decision tree-based method for predicting interactions between target genes and drugs
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6555260/
https://www.ncbi.nlm.nih.gov/pubmed/31214240
http://dx.doi.org/10.3389/fgene.2019.00459
work_keys_str_mv AT xuanping gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs
AT sunchang gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs
AT zhangtiangang gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs
AT yeyilin gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs
AT shentonghui gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs
AT dongyihua gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs