Cargando…

DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions

BACKGROUND: Drug-drug interactions (DDIs) are a major concern in patients’ medication. It’s unfeasible to identify all potential DDIs using experimental methods which are time-consuming and expensive. Computational methods provide an effective strategy, however, facing challenges due to the lack of...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Yi, Peng, Hui, Zhang, Xiaocai, Zhao, Zhixun, Gao, Xiaoying, Li, Jinyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6929327/
https://www.ncbi.nlm.nih.gov/pubmed/31870276
http://dx.doi.org/10.1186/s12859-019-3214-6
_version_ 1783482678379544576
author Zheng, Yi
Peng, Hui
Zhang, Xiaocai
Zhao, Zhixun
Gao, Xiaoying
Li, Jinyan
author_facet Zheng, Yi
Peng, Hui
Zhang, Xiaocai
Zhao, Zhixun
Gao, Xiaoying
Li, Jinyan
author_sort Zheng, Yi
collection PubMed
description BACKGROUND: Drug-drug interactions (DDIs) are a major concern in patients’ medication. It’s unfeasible to identify all potential DDIs using experimental methods which are time-consuming and expensive. Computational methods provide an effective strategy, however, facing challenges due to the lack of experimentally verified negative samples. RESULTS: To address this problem, we propose a novel positive-unlabeled learning method named DDI-PULearn for large-scale drug-drug-interaction predictions. DDI-PULearn first generates seeds of reliable negatives via OCSVM (one-class support vector machine) under a high-recall constraint and via the cosine-similarity based KNN (k-nearest neighbors) as well. Then trained with all the labeled positives (i.e., the validated DDIs) and the generated seed negatives, DDI-PULearn employs an iterative SVM to identify a set of entire reliable negatives from the unlabeled samples (i.e., the unobserved DDIs). Following that, DDI-PULearn represents all the labeled positives and the identified negatives as vectors of abundant drug properties by a similarity-based method. Finally, DDI-PULearn transforms these vectors into a lower-dimensional space via PCA (principal component analysis) and utilizes the compressed vectors as input for binary classifications. The performance of DDI-PULearn is evaluated on simulative prediction for 149,878 possible interactions between 548 drugs, comparing with two baseline methods and five state-of-the-art methods. Related experiment results show that the proposed method for the representation of DDIs characterizes them accurately. DDI-PULearn achieves superior performance owing to the identified reliable negatives, outperforming all other methods significantly. In addition, the predicted novel DDIs suggest that DDI-PULearn is capable to identify novel DDIs. CONCLUSIONS: The results demonstrate that positive-unlabeled learning paves a new way to tackle the problem caused by the lack of experimentally verified negatives in the computational prediction of DDIs.
format Online
Article
Text
id pubmed-6929327
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69293272019-12-30 DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions Zheng, Yi Peng, Hui Zhang, Xiaocai Zhao, Zhixun Gao, Xiaoying Li, Jinyan BMC Bioinformatics Research BACKGROUND: Drug-drug interactions (DDIs) are a major concern in patients’ medication. It’s unfeasible to identify all potential DDIs using experimental methods which are time-consuming and expensive. Computational methods provide an effective strategy, however, facing challenges due to the lack of experimentally verified negative samples. RESULTS: To address this problem, we propose a novel positive-unlabeled learning method named DDI-PULearn for large-scale drug-drug-interaction predictions. DDI-PULearn first generates seeds of reliable negatives via OCSVM (one-class support vector machine) under a high-recall constraint and via the cosine-similarity based KNN (k-nearest neighbors) as well. Then trained with all the labeled positives (i.e., the validated DDIs) and the generated seed negatives, DDI-PULearn employs an iterative SVM to identify a set of entire reliable negatives from the unlabeled samples (i.e., the unobserved DDIs). Following that, DDI-PULearn represents all the labeled positives and the identified negatives as vectors of abundant drug properties by a similarity-based method. Finally, DDI-PULearn transforms these vectors into a lower-dimensional space via PCA (principal component analysis) and utilizes the compressed vectors as input for binary classifications. The performance of DDI-PULearn is evaluated on simulative prediction for 149,878 possible interactions between 548 drugs, comparing with two baseline methods and five state-of-the-art methods. Related experiment results show that the proposed method for the representation of DDIs characterizes them accurately. DDI-PULearn achieves superior performance owing to the identified reliable negatives, outperforming all other methods significantly. In addition, the predicted novel DDIs suggest that DDI-PULearn is capable to identify novel DDIs. CONCLUSIONS: The results demonstrate that positive-unlabeled learning paves a new way to tackle the problem caused by the lack of experimentally verified negatives in the computational prediction of DDIs. BioMed Central 2019-12-24 /pmc/articles/PMC6929327/ /pubmed/31870276 http://dx.doi.org/10.1186/s12859-019-3214-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zheng, Yi
Peng, Hui
Zhang, Xiaocai
Zhao, Zhixun
Gao, Xiaoying
Li, Jinyan
DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions
title DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions
title_full DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions
title_fullStr DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions
title_full_unstemmed DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions
title_short DDI-PULearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions
title_sort ddi-pulearn: a positive-unlabeled learning method for large-scale prediction of drug-drug interactions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6929327/
https://www.ncbi.nlm.nih.gov/pubmed/31870276
http://dx.doi.org/10.1186/s12859-019-3214-6
work_keys_str_mv AT zhengyi ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions
AT penghui ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions
AT zhangxiaocai ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions
AT zhaozhixun ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions
AT gaoxiaoying ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions
AT lijinyan ddipulearnapositiveunlabeledlearningmethodforlargescalepredictionofdrugdruginteractions