Cargando…

Predicting existing targets for new drugs base on strategies for missing interactions

BACKGROUND: There has been paid more and more attention to supervised classification models in the area of predicting drug-target interactions (DTIs). However, in terms of classification, unavoidable missing DTIs in data would cause three issues which have not yet been addressed appropriately by for...

Descripción completa

Detalles Bibliográficos
Autores principales:	Shi, Jian-Yu, Li, Jia-Xin, Lu, Hui-Meng
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2016
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009565/ https://www.ncbi.nlm.nih.gov/pubmed/27585458 http://dx.doi.org/10.1186/s12859-016-1118-2

_version_	1782451537397678080
author	Shi, Jian-Yu Li, Jia-Xin Lu, Hui-Meng
author_facet	Shi, Jian-Yu Li, Jia-Xin Lu, Hui-Meng
author_sort	Shi, Jian-Yu
collection	PubMed
description	BACKGROUND: There has been paid more and more attention to supervised classification models in the area of predicting drug-target interactions (DTIs). However, in terms of classification, unavoidable missing DTIs in data would cause three issues which have not yet been addressed appropriately by former approaches. Directly labeled as negatives (non-DTIs), missing DTIs increase the confusion of positives (DTIs) and negatives, aggravate the imbalance between few positives and many negatives, and are usually discriminated as highly-scored false positives, which influence the existing measures sharply. RESULTS: Under the framework of local classification model (LCM), this work focuses on the scenario of predicting how possibly a new drug interacts with known targets. To address the first two issues, two strategies, Spy and Super-target, are introduced accordingly and further integrated to form a two-layer LCM. In the bottom layer, Spy-based local classifiers for protein targets are built by positives, as well as reliable negatives identified among unlabeled drug-target pairs. In the top layer, regular local classifiers specific to super-targets are built with more positives generated by grouping similar targets and their interactions. Furthermore, to handle the third issue, an additional performance measure, Coverage, is presented for assessing DTI prediction. The experiments based on benchmark datasets are finally performed under five-fold cross validation of drugs to evaluate this approach. The main findings are concluded as follows. (1) Both two individual strategies and their combination are effective to missing DTIs, and the combination wins the best. (2) Having the advantages of less confusing decision boundary at the bottom layer and less biased decision boundary at the top layer, our two-layer LCM outperforms two former approaches. (3) Coverage is more robust to missing interactions than other measures and is able to evaluate how far one needs to go down the list of targets to cover all the proper targets of a drug. CONCLUSIONS: Proposing two strategies and one performance measure, this work has addressed the issues derived from missing interactions, which cause confusing and biased decision boundaries in classifiers, as well as the inappropriate measure of predicting performance, in the scenario of predicting interactions between new drugs and known targets.
format	Online Article Text
id	pubmed-5009565
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-50095652016-09-08 Predicting existing targets for new drugs base on strategies for missing interactions Shi, Jian-Yu Li, Jia-Xin Lu, Hui-Meng BMC Bioinformatics Research BACKGROUND: There has been paid more and more attention to supervised classification models in the area of predicting drug-target interactions (DTIs). However, in terms of classification, unavoidable missing DTIs in data would cause three issues which have not yet been addressed appropriately by former approaches. Directly labeled as negatives (non-DTIs), missing DTIs increase the confusion of positives (DTIs) and negatives, aggravate the imbalance between few positives and many negatives, and are usually discriminated as highly-scored false positives, which influence the existing measures sharply. RESULTS: Under the framework of local classification model (LCM), this work focuses on the scenario of predicting how possibly a new drug interacts with known targets. To address the first two issues, two strategies, Spy and Super-target, are introduced accordingly and further integrated to form a two-layer LCM. In the bottom layer, Spy-based local classifiers for protein targets are built by positives, as well as reliable negatives identified among unlabeled drug-target pairs. In the top layer, regular local classifiers specific to super-targets are built with more positives generated by grouping similar targets and their interactions. Furthermore, to handle the third issue, an additional performance measure, Coverage, is presented for assessing DTI prediction. The experiments based on benchmark datasets are finally performed under five-fold cross validation of drugs to evaluate this approach. The main findings are concluded as follows. (1) Both two individual strategies and their combination are effective to missing DTIs, and the combination wins the best. (2) Having the advantages of less confusing decision boundary at the bottom layer and less biased decision boundary at the top layer, our two-layer LCM outperforms two former approaches. (3) Coverage is more robust to missing interactions than other measures and is able to evaluate how far one needs to go down the list of targets to cover all the proper targets of a drug. CONCLUSIONS: Proposing two strategies and one performance measure, this work has addressed the issues derived from missing interactions, which cause confusing and biased decision boundaries in classifiers, as well as the inappropriate measure of predicting performance, in the scenario of predicting interactions between new drugs and known targets. BioMed Central 2016-08-31 /pmc/articles/PMC5009565/ /pubmed/27585458 http://dx.doi.org/10.1186/s12859-016-1118-2 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Shi, Jian-Yu Li, Jia-Xin Lu, Hui-Meng Predicting existing targets for new drugs base on strategies for missing interactions
title	Predicting existing targets for new drugs base on strategies for missing interactions
title_full	Predicting existing targets for new drugs base on strategies for missing interactions
title_fullStr	Predicting existing targets for new drugs base on strategies for missing interactions
title_full_unstemmed	Predicting existing targets for new drugs base on strategies for missing interactions
title_short	Predicting existing targets for new drugs base on strategies for missing interactions
title_sort	predicting existing targets for new drugs base on strategies for missing interactions
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009565/ https://www.ncbi.nlm.nih.gov/pubmed/27585458 http://dx.doi.org/10.1186/s12859-016-1118-2
work_keys_str_mv	AT shijianyu predictingexistingtargetsfornewdrugsbaseonstrategiesformissinginteractions AT lijiaxin predictingexistingtargetsfornewdrugsbaseonstrategiesformissinginteractions AT luhuimeng predictingexistingtargetsfornewdrugsbaseonstrategiesformissinginteractions

Predicting existing targets for new drugs base on strategies for missing interactions

Ejemplares similares