Cargando…

Predicting existing targets for new drugs base on strategies for missing interactions

BACKGROUND: There has been paid more and more attention to supervised classification models in the area of predicting drug-target interactions (DTIs). However, in terms of classification, unavoidable missing DTIs in data would cause three issues which have not yet been addressed appropriately by for...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Jian-Yu, Li, Jia-Xin, Lu, Hui-Meng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009565/
https://www.ncbi.nlm.nih.gov/pubmed/27585458
http://dx.doi.org/10.1186/s12859-016-1118-2
_version_ 1782451537397678080
author Shi, Jian-Yu
Li, Jia-Xin
Lu, Hui-Meng
author_facet Shi, Jian-Yu
Li, Jia-Xin
Lu, Hui-Meng
author_sort Shi, Jian-Yu
collection PubMed
description BACKGROUND: There has been paid more and more attention to supervised classification models in the area of predicting drug-target interactions (DTIs). However, in terms of classification, unavoidable missing DTIs in data would cause three issues which have not yet been addressed appropriately by former approaches. Directly labeled as negatives (non-DTIs), missing DTIs increase the confusion of positives (DTIs) and negatives, aggravate the imbalance between few positives and many negatives, and are usually discriminated as highly-scored false positives, which influence the existing measures sharply. RESULTS: Under the framework of local classification model (LCM), this work focuses on the scenario of predicting how possibly a new drug interacts with known targets. To address the first two issues, two strategies, Spy and Super-target, are introduced accordingly and further integrated to form a two-layer LCM. In the bottom layer, Spy-based local classifiers for protein targets are built by positives, as well as reliable negatives identified among unlabeled drug-target pairs. In the top layer, regular local classifiers specific to super-targets are built with more positives generated by grouping similar targets and their interactions. Furthermore, to handle the third issue, an additional performance measure, Coverage, is presented for assessing DTI prediction. The experiments based on benchmark datasets are finally performed under five-fold cross validation of drugs to evaluate this approach. The main findings are concluded as follows. (1) Both two individual strategies and their combination are effective to missing DTIs, and the combination wins the best. (2) Having the advantages of less confusing decision boundary at the bottom layer and less biased decision boundary at the top layer, our two-layer LCM outperforms two former approaches. (3) Coverage is more robust to missing interactions than other measures and is able to evaluate how far one needs to go down the list of targets to cover all the proper targets of a drug. CONCLUSIONS: Proposing two strategies and one performance measure, this work has addressed the issues derived from missing interactions, which cause confusing and biased decision boundaries in classifiers, as well as the inappropriate measure of predicting performance, in the scenario of predicting interactions between new drugs and known targets.
format Online
Article
Text
id pubmed-5009565
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50095652016-09-08 Predicting existing targets for new drugs base on strategies for missing interactions Shi, Jian-Yu Li, Jia-Xin Lu, Hui-Meng BMC Bioinformatics Research BACKGROUND: There has been paid more and more attention to supervised classification models in the area of predicting drug-target interactions (DTIs). However, in terms of classification, unavoidable missing DTIs in data would cause three issues which have not yet been addressed appropriately by former approaches. Directly labeled as negatives (non-DTIs), missing DTIs increase the confusion of positives (DTIs) and negatives, aggravate the imbalance between few positives and many negatives, and are usually discriminated as highly-scored false positives, which influence the existing measures sharply. RESULTS: Under the framework of local classification model (LCM), this work focuses on the scenario of predicting how possibly a new drug interacts with known targets. To address the first two issues, two strategies, Spy and Super-target, are introduced accordingly and further integrated to form a two-layer LCM. In the bottom layer, Spy-based local classifiers for protein targets are built by positives, as well as reliable negatives identified among unlabeled drug-target pairs. In the top layer, regular local classifiers specific to super-targets are built with more positives generated by grouping similar targets and their interactions. Furthermore, to handle the third issue, an additional performance measure, Coverage, is presented for assessing DTI prediction. The experiments based on benchmark datasets are finally performed under five-fold cross validation of drugs to evaluate this approach. The main findings are concluded as follows. (1) Both two individual strategies and their combination are effective to missing DTIs, and the combination wins the best. (2) Having the advantages of less confusing decision boundary at the bottom layer and less biased decision boundary at the top layer, our two-layer LCM outperforms two former approaches. (3) Coverage is more robust to missing interactions than other measures and is able to evaluate how far one needs to go down the list of targets to cover all the proper targets of a drug. CONCLUSIONS: Proposing two strategies and one performance measure, this work has addressed the issues derived from missing interactions, which cause confusing and biased decision boundaries in classifiers, as well as the inappropriate measure of predicting performance, in the scenario of predicting interactions between new drugs and known targets. BioMed Central 2016-08-31 /pmc/articles/PMC5009565/ /pubmed/27585458 http://dx.doi.org/10.1186/s12859-016-1118-2 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Shi, Jian-Yu
Li, Jia-Xin
Lu, Hui-Meng
Predicting existing targets for new drugs base on strategies for missing interactions
title Predicting existing targets for new drugs base on strategies for missing interactions
title_full Predicting existing targets for new drugs base on strategies for missing interactions
title_fullStr Predicting existing targets for new drugs base on strategies for missing interactions
title_full_unstemmed Predicting existing targets for new drugs base on strategies for missing interactions
title_short Predicting existing targets for new drugs base on strategies for missing interactions
title_sort predicting existing targets for new drugs base on strategies for missing interactions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009565/
https://www.ncbi.nlm.nih.gov/pubmed/27585458
http://dx.doi.org/10.1186/s12859-016-1118-2
work_keys_str_mv AT shijianyu predictingexistingtargetsfornewdrugsbaseonstrategiesformissinginteractions
AT lijiaxin predictingexistingtargetsfornewdrugsbaseonstrategiesformissinginteractions
AT luhuimeng predictingexistingtargetsfornewdrugsbaseonstrategiesformissinginteractions