Cargando…
Adaptive diffusion kernel learning from biological networks for protein function prediction
BACKGROUND: Machine-learning tools have gained considerable attention during the last few years for analyzing biological networks for protein function prediction. Kernel methods are suitable for learning from graph-based data such as biological networks, as they only require the abstraction of the s...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2409449/ https://www.ncbi.nlm.nih.gov/pubmed/18366736 http://dx.doi.org/10.1186/1471-2105-9-162 |
_version_ | 1782155767476912128 |
---|---|
author | Sun, Liang Ji, Shuiwang Ye, Jieping |
author_facet | Sun, Liang Ji, Shuiwang Ye, Jieping |
author_sort | Sun, Liang |
collection | PubMed |
description | BACKGROUND: Machine-learning tools have gained considerable attention during the last few years for analyzing biological networks for protein function prediction. Kernel methods are suitable for learning from graph-based data such as biological networks, as they only require the abstraction of the similarities between objects into the kernel matrix. One key issue in kernel methods is the selection of a good kernel function. Diffusion kernels, the discretization of the familiar Gaussian kernel of Euclidean space, are commonly used for graph-based data. RESULTS: In this paper, we address the issue of learning an optimal diffusion kernel, in the form of a convex combination of a set of pre-specified kernels constructed from biological networks, for protein function prediction. Most prior work on this kernel learning task focus on variants of the loss function based on Support Vector Machines (SVM). Their extensions to other loss functions such as the one based on Kullback-Leibler (KL) divergence, which is more suitable for mining biological networks, lead to expensive optimization problems. By exploiting the special structure of the diffusion kernel, we show that this KL divergence based kernel learning problem can be formulated as a simple optimization problem, which can then be solved efficiently. It is further extended to the multi-task case where we predict multiple functions of a protein simultaneously. We evaluate the efficiency and effectiveness of the proposed algorithms using two benchmark data sets. CONCLUSION: Results show that the performance of linearly combined diffusion kernel is better than every single candidate diffusion kernel. When the number of tasks is large, the algorithms based on multiple tasks are favored due to their competitive recognition performance and small computational costs. |
format | Text |
id | pubmed-2409449 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-24094492008-06-04 Adaptive diffusion kernel learning from biological networks for protein function prediction Sun, Liang Ji, Shuiwang Ye, Jieping BMC Bioinformatics Research Article BACKGROUND: Machine-learning tools have gained considerable attention during the last few years for analyzing biological networks for protein function prediction. Kernel methods are suitable for learning from graph-based data such as biological networks, as they only require the abstraction of the similarities between objects into the kernel matrix. One key issue in kernel methods is the selection of a good kernel function. Diffusion kernels, the discretization of the familiar Gaussian kernel of Euclidean space, are commonly used for graph-based data. RESULTS: In this paper, we address the issue of learning an optimal diffusion kernel, in the form of a convex combination of a set of pre-specified kernels constructed from biological networks, for protein function prediction. Most prior work on this kernel learning task focus on variants of the loss function based on Support Vector Machines (SVM). Their extensions to other loss functions such as the one based on Kullback-Leibler (KL) divergence, which is more suitable for mining biological networks, lead to expensive optimization problems. By exploiting the special structure of the diffusion kernel, we show that this KL divergence based kernel learning problem can be formulated as a simple optimization problem, which can then be solved efficiently. It is further extended to the multi-task case where we predict multiple functions of a protein simultaneously. We evaluate the efficiency and effectiveness of the proposed algorithms using two benchmark data sets. CONCLUSION: Results show that the performance of linearly combined diffusion kernel is better than every single candidate diffusion kernel. When the number of tasks is large, the algorithms based on multiple tasks are favored due to their competitive recognition performance and small computational costs. BioMed Central 2008-03-25 /pmc/articles/PMC2409449/ /pubmed/18366736 http://dx.doi.org/10.1186/1471-2105-9-162 Text en Copyright © 2008 Sun et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Sun, Liang Ji, Shuiwang Ye, Jieping Adaptive diffusion kernel learning from biological networks for protein function prediction |
title | Adaptive diffusion kernel learning from biological networks for protein function prediction |
title_full | Adaptive diffusion kernel learning from biological networks for protein function prediction |
title_fullStr | Adaptive diffusion kernel learning from biological networks for protein function prediction |
title_full_unstemmed | Adaptive diffusion kernel learning from biological networks for protein function prediction |
title_short | Adaptive diffusion kernel learning from biological networks for protein function prediction |
title_sort | adaptive diffusion kernel learning from biological networks for protein function prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2409449/ https://www.ncbi.nlm.nih.gov/pubmed/18366736 http://dx.doi.org/10.1186/1471-2105-9-162 |
work_keys_str_mv | AT sunliang adaptivediffusionkernellearningfrombiologicalnetworksforproteinfunctionprediction AT jishuiwang adaptivediffusionkernellearningfrombiologicalnetworksforproteinfunctionprediction AT yejieping adaptivediffusionkernellearningfrombiologicalnetworksforproteinfunctionprediction |