Cargando…
DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions
BACKGROUND: Drug-drug interactions (DDIs) are a major concern in patients’ medication. It’s unfeasible to identify all potential DDIs using experimental methods which are time-consuming and expensive. Computational methods provide an effective strategy, however, facing challenges due to the lack of...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6929327/ https://www.ncbi.nlm.nih.gov/pubmed/31870276 http://dx.doi.org/10.1186/s12859-019-3214-6 |
_version_ | 1783482678379544576 |
---|---|
author | Zheng, Yi Peng, Hui Zhang, Xiaocai Zhao, Zhixun Gao, Xiaoying Li, Jinyan |
author_facet | Zheng, Yi Peng, Hui Zhang, Xiaocai Zhao, Zhixun Gao, Xiaoying Li, Jinyan |
author_sort | Zheng, Yi |
collection | PubMed |
description | BACKGROUND: Drug-drug interactions (DDIs) are a major concern in patients’ medication. It’s unfeasible to identify all potential DDIs using experimental methods which are time-consuming and expensive. Computational methods provide an effective strategy, however, facing challenges due to the lack of experimentally verified negative samples. RESULTS: To address this problem, we propose a novel positive-unlabeled learning method named DDI-PULearn for large-scale drug-drug-interaction predictions. DDI-PULearn first generates seeds of reliable negatives via OCSVM (one-class support vector machine) under a high-recall constraint and via the cosine-similarity based KNN (k-nearest neighbors) as well. Then trained with all the labeled positives (i.e., the validated DDIs) and the generated seed negatives, DDI-PULearn employs an iterative SVM to identify a set of entire reliable negatives from the unlabeled samples (i.e., the unobserved DDIs). Following that, DDI-PULearn represents all the labeled positives and the identified negatives as vectors of abundant drug properties by a similarity-based method. Finally, DDI-PULearn transforms these vectors into a lower-dimensional space via PCA (principal component analysis) and utilizes the compressed vectors as input for binary classifications. The performance of DDI-PULearn is evaluated on simulative prediction for 149,878 possible interactions between 548 drugs, comparing with two baseline methods and five state-of-the-art methods. Related experiment results show that the proposed method for the representation of DDIs characterizes them accurately. DDI-PULearn achieves superior performance owing to the identified reliable negatives, outperforming all other methods significantly. In addition, the predicted novel DDIs suggest that DDI-PULearn is capable to identify novel DDIs. CONCLUSIONS: The results demonstrate that positive-unlabeled learning paves a new way to tackle the problem caused by the lack of experimentally verified negatives in the computational prediction of DDIs. |
format | Online Article Text |
id | pubmed-6929327 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69293272019-12-30 DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions Zheng, Yi Peng, Hui Zhang, Xiaocai Zhao, Zhixun Gao, Xiaoying Li, Jinyan BMC Bioinformatics Research BACKGROUND: Drug-drug interactions (DDIs) are a major concern in patients’ medication. It’s unfeasible to identify all potential DDIs using experimental methods which are time-consuming and expensive. Computational methods provide an effective strategy, however, facing challenges due to the lack of experimentally verified negative samples. RESULTS: To address this problem, we propose a novel positive-unlabeled learning method named DDI-PULearn for large-scale drug-drug-interaction predictions. DDI-PULearn first generates seeds of reliable negatives via OCSVM (one-class support vector machine) under a high-recall constraint and via the cosine-similarity based KNN (k-nearest neighbors) as well. Then trained with all the labeled positives (i.e., the validated DDIs) and the generated seed negatives, DDI-PULearn employs an iterative SVM to identify a set of entire reliable negatives from the unlabeled samples (i.e., the unobserved DDIs). Following that, DDI-PULearn represents all the labeled positives and the identified negatives as vectors of abundant drug properties by a similarity-based method. Finally, DDI-PULearn transforms these vectors into a lower-dimensional space via PCA (principal component analysis) and utilizes the compressed vectors as input for binary classifications. The performance of DDI-PULearn is evaluated on simulative prediction for 149,878 possible interactions between 548 drugs, comparing with two baseline methods and five state-of-the-art methods. Related experiment results show that the proposed method for the representation of DDIs characterizes them accurately. DDI-PULearn achieves superior performance owing to the identified reliable negatives, outperforming all other methods significantly. In addition, the predicted novel DDIs suggest that DDI-PULearn is capable to identify novel DDIs. CONCLUSIONS: The results demonstrate that positive-unlabeled learning paves a new way to tackle the problem caused by the lack of experimentally verified negatives in the computational prediction of DDIs. BioMed Central 2019-12-24 /pmc/articles/PMC6929327/ /pubmed/31870276 http://dx.doi.org/10.1186/s12859-019-3214-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zheng, Yi Peng, Hui Zhang, Xiaocai Zhao, Zhixun Gao, Xiaoying Li, Jinyan DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions |
title | DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions |
title_full | DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions |
title_fullStr | DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions |
title_full_unstemmed | DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions |
title_short | DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions |
title_sort | ddi-pulearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6929327/ https://www.ncbi.nlm.nih.gov/pubmed/31870276 http://dx.doi.org/10.1186/s12859-019-3214-6 |
work_keys_str_mv | AT zhengyi ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions AT penghui ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions AT zhangxiaocai ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions AT zhaozhixun ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions AT gaoxiaoying ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions AT lijinyan ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions |