Cargando…
Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs
Determining the target genes that interact with drugs—drug–target interactions—plays an important role in drug discovery. Identification of drug–target interactions through biological experiments is time consuming, laborious, and costly. Therefore, using computational approaches to predict candidate...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6555260/ https://www.ncbi.nlm.nih.gov/pubmed/31214240 http://dx.doi.org/10.3389/fgene.2019.00459 |
_version_ | 1783425122322874368 |
---|---|
author | Xuan, Ping Sun, Chang Zhang, Tiangang Ye, Yilin Shen, Tonghui Dong, Yihua |
author_facet | Xuan, Ping Sun, Chang Zhang, Tiangang Ye, Yilin Shen, Tonghui Dong, Yihua |
author_sort | Xuan, Ping |
collection | PubMed |
description | Determining the target genes that interact with drugs—drug–target interactions—plays an important role in drug discovery. Identification of drug–target interactions through biological experiments is time consuming, laborious, and costly. Therefore, using computational approaches to predict candidate targets is a good way to reduce the cost of wet-lab experiments. However, the known interactions (positive samples) and the unknown interactions (negative samples) display a serious class imbalance, which has an adverse effect on the accuracy of the prediction results. To mitigate the impact of class imbalance and completely exploit the negative samples, we proposed a new method, named DTIGBDT, based on gradient boosting decision trees, for predicting candidate drug–target interactions. We constructed a drug–target heterogeneous network that contains the drug similarities based on the chemical structures of drugs, the target similarities based on target sequences, and the known drug–target interactions. The topological information of the network was captured by random walks to update the similarities between drugs or targets. The paths between drugs and targets could be divided into multiple categories, and the features of each category of paths were extracted. We constructed a prediction model based on gradient boosting decision trees. The model establishes multiple decision trees with the extracted features and obtains the interaction scores between drugs and targets. DTIGBDT is a method of ensemble learning, and it effectively reduces the impact of class imbalance. The experimental results indicate that DTIGBDT outperforms several state-of-the-art methods for drug–target interaction prediction. In addition, case studies on Quetiapine, Clozapine, Olanzapine, Aripiprazole, and Ziprasidone demonstrate the ability of DTIGBDT to discover potential drug–target interactions. |
format | Online Article Text |
id | pubmed-6555260 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-65552602019-06-18 Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs Xuan, Ping Sun, Chang Zhang, Tiangang Ye, Yilin Shen, Tonghui Dong, Yihua Front Genet Genetics Determining the target genes that interact with drugs—drug–target interactions—plays an important role in drug discovery. Identification of drug–target interactions through biological experiments is time consuming, laborious, and costly. Therefore, using computational approaches to predict candidate targets is a good way to reduce the cost of wet-lab experiments. However, the known interactions (positive samples) and the unknown interactions (negative samples) display a serious class imbalance, which has an adverse effect on the accuracy of the prediction results. To mitigate the impact of class imbalance and completely exploit the negative samples, we proposed a new method, named DTIGBDT, based on gradient boosting decision trees, for predicting candidate drug–target interactions. We constructed a drug–target heterogeneous network that contains the drug similarities based on the chemical structures of drugs, the target similarities based on target sequences, and the known drug–target interactions. The topological information of the network was captured by random walks to update the similarities between drugs or targets. The paths between drugs and targets could be divided into multiple categories, and the features of each category of paths were extracted. We constructed a prediction model based on gradient boosting decision trees. The model establishes multiple decision trees with the extracted features and obtains the interaction scores between drugs and targets. DTIGBDT is a method of ensemble learning, and it effectively reduces the impact of class imbalance. The experimental results indicate that DTIGBDT outperforms several state-of-the-art methods for drug–target interaction prediction. In addition, case studies on Quetiapine, Clozapine, Olanzapine, Aripiprazole, and Ziprasidone demonstrate the ability of DTIGBDT to discover potential drug–target interactions. Frontiers Media S.A. 2019-05-31 /pmc/articles/PMC6555260/ /pubmed/31214240 http://dx.doi.org/10.3389/fgene.2019.00459 Text en Copyright © 2019 Xuan, Sun, Zhang, Ye, Shen and Dong. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Xuan, Ping Sun, Chang Zhang, Tiangang Ye, Yilin Shen, Tonghui Dong, Yihua Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs |
title | Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs |
title_full | Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs |
title_fullStr | Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs |
title_full_unstemmed | Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs |
title_short | Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs |
title_sort | gradient boosting decision tree-based method for predicting interactions between target genes and drugs |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6555260/ https://www.ncbi.nlm.nih.gov/pubmed/31214240 http://dx.doi.org/10.3389/fgene.2019.00459 |
work_keys_str_mv | AT xuanping gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs AT sunchang gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs AT zhangtiangang gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs AT yeyilin gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs AT shentonghui gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs AT dongyihua gradientboostingdecisiontreebasedmethodforpredictinginteractionsbetweentargetgenesanddrugs |