Cargando…
Predicting existing targets for new drugs base on strategies for missing interactions
BACKGROUND: There has been paid more and more attention to supervised classification models in the area of predicting drug-target interactions (DTIs). However, in terms of classification, unavoidable missing DTIs in data would cause three issues which have not yet been addressed appropriately by for...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009565/ https://www.ncbi.nlm.nih.gov/pubmed/27585458 http://dx.doi.org/10.1186/s12859-016-1118-2 |
_version_ | 1782451537397678080 |
---|---|
author | Shi, Jian-Yu Li, Jia-Xin Lu, Hui-Meng |
author_facet | Shi, Jian-Yu Li, Jia-Xin Lu, Hui-Meng |
author_sort | Shi, Jian-Yu |
collection | PubMed |
description | BACKGROUND: There has been paid more and more attention to supervised classification models in the area of predicting drug-target interactions (DTIs). However, in terms of classification, unavoidable missing DTIs in data would cause three issues which have not yet been addressed appropriately by former approaches. Directly labeled as negatives (non-DTIs), missing DTIs increase the confusion of positives (DTIs) and negatives, aggravate the imbalance between few positives and many negatives, and are usually discriminated as highly-scored false positives, which influence the existing measures sharply. RESULTS: Under the framework of local classification model (LCM), this work focuses on the scenario of predicting how possibly a new drug interacts with known targets. To address the first two issues, two strategies, Spy and Super-target, are introduced accordingly and further integrated to form a two-layer LCM. In the bottom layer, Spy-based local classifiers for protein targets are built by positives, as well as reliable negatives identified among unlabeled drug-target pairs. In the top layer, regular local classifiers specific to super-targets are built with more positives generated by grouping similar targets and their interactions. Furthermore, to handle the third issue, an additional performance measure, Coverage, is presented for assessing DTI prediction. The experiments based on benchmark datasets are finally performed under five-fold cross validation of drugs to evaluate this approach. The main findings are concluded as follows. (1) Both two individual strategies and their combination are effective to missing DTIs, and the combination wins the best. (2) Having the advantages of less confusing decision boundary at the bottom layer and less biased decision boundary at the top layer, our two-layer LCM outperforms two former approaches. (3) Coverage is more robust to missing interactions than other measures and is able to evaluate how far one needs to go down the list of targets to cover all the proper targets of a drug. CONCLUSIONS: Proposing two strategies and one performance measure, this work has addressed the issues derived from missing interactions, which cause confusing and biased decision boundaries in classifiers, as well as the inappropriate measure of predicting performance, in the scenario of predicting interactions between new drugs and known targets. |
format | Online Article Text |
id | pubmed-5009565 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50095652016-09-08 Predicting existing targets for new drugs base on strategies for missing interactions Shi, Jian-Yu Li, Jia-Xin Lu, Hui-Meng BMC Bioinformatics Research BACKGROUND: There has been paid more and more attention to supervised classification models in the area of predicting drug-target interactions (DTIs). However, in terms of classification, unavoidable missing DTIs in data would cause three issues which have not yet been addressed appropriately by former approaches. Directly labeled as negatives (non-DTIs), missing DTIs increase the confusion of positives (DTIs) and negatives, aggravate the imbalance between few positives and many negatives, and are usually discriminated as highly-scored false positives, which influence the existing measures sharply. RESULTS: Under the framework of local classification model (LCM), this work focuses on the scenario of predicting how possibly a new drug interacts with known targets. To address the first two issues, two strategies, Spy and Super-target, are introduced accordingly and further integrated to form a two-layer LCM. In the bottom layer, Spy-based local classifiers for protein targets are built by positives, as well as reliable negatives identified among unlabeled drug-target pairs. In the top layer, regular local classifiers specific to super-targets are built with more positives generated by grouping similar targets and their interactions. Furthermore, to handle the third issue, an additional performance measure, Coverage, is presented for assessing DTI prediction. The experiments based on benchmark datasets are finally performed under five-fold cross validation of drugs to evaluate this approach. The main findings are concluded as follows. (1) Both two individual strategies and their combination are effective to missing DTIs, and the combination wins the best. (2) Having the advantages of less confusing decision boundary at the bottom layer and less biased decision boundary at the top layer, our two-layer LCM outperforms two former approaches. (3) Coverage is more robust to missing interactions than other measures and is able to evaluate how far one needs to go down the list of targets to cover all the proper targets of a drug. CONCLUSIONS: Proposing two strategies and one performance measure, this work has addressed the issues derived from missing interactions, which cause confusing and biased decision boundaries in classifiers, as well as the inappropriate measure of predicting performance, in the scenario of predicting interactions between new drugs and known targets. BioMed Central 2016-08-31 /pmc/articles/PMC5009565/ /pubmed/27585458 http://dx.doi.org/10.1186/s12859-016-1118-2 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Shi, Jian-Yu Li, Jia-Xin Lu, Hui-Meng Predicting existing targets for new drugs base on strategies for missing interactions |
title | Predicting existing targets for new drugs base on strategies for missing interactions |
title_full | Predicting existing targets for new drugs base on strategies for missing interactions |
title_fullStr | Predicting existing targets for new drugs base on strategies for missing interactions |
title_full_unstemmed | Predicting existing targets for new drugs base on strategies for missing interactions |
title_short | Predicting existing targets for new drugs base on strategies for missing interactions |
title_sort | predicting existing targets for new drugs base on strategies for missing interactions |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009565/ https://www.ncbi.nlm.nih.gov/pubmed/27585458 http://dx.doi.org/10.1186/s12859-016-1118-2 |
work_keys_str_mv | AT shijianyu predictingexistingtargetsfornewdrugsbaseonstrategiesformissinginteractions AT lijiaxin predictingexistingtargetsfornewdrugsbaseonstrategiesformissinginteractions AT luhuimeng predictingexistingtargetsfornewdrugsbaseonstrategiesformissinginteractions |