Cargando…
Multiple graph regularized protein domain ranking
BACKGROUND: Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3583823/ https://www.ncbi.nlm.nih.gov/pubmed/23157331 http://dx.doi.org/10.1186/1471-2105-13-307 |
_version_ | 1782475487361105920 |
---|---|
author | Wang, Jim Jing-Yan Bensmail, Halima Gao, Xin |
author_facet | Wang, Jim Jing-Yan Bensmail, Halima Gao, Xin |
author_sort | Wang, Jim Jing-Yan |
collection | PubMed |
description | BACKGROUND: Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the global structure of the graph defined by the pairwise similarities has been proposed. However, the existing graph regularized ranking methods are very sensitive to the choice of the graph model and parameters, and this remains a difficult problem for most of the protein domain ranking methods. RESULTS: To tackle this problem, we have developed the Multiple Graph regularized Ranking algorithm, MultiG-Rank. Instead of using a single graph to regularize the ranking scores, MultiG-Rank approximates the intrinsic manifold of protein domain distribution by combining multiple initial graphs for the regularization. Graph weights are learned with ranking scores jointly and automatically, by alternately minimizing an objective function in an iterative algorithm. Experimental results on a subset of the ASTRAL SCOP protein domain database demonstrate that MultiG-Rank achieves a better ranking performance than single graph regularized ranking methods and pairwise similarity based ranking methods. CONCLUSION: The problem of graph model and parameter selection in graph regularized protein domain ranking can be solved effectively by combining multiple graphs. This aspect of generalization introduces a new frontier in applying multiple graphs to solving protein domain ranking applications. |
format | Online Article Text |
id | pubmed-3583823 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35838232013-03-08 Multiple graph regularized protein domain ranking Wang, Jim Jing-Yan Bensmail, Halima Gao, Xin BMC Bioinformatics Methodology Article BACKGROUND: Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the global structure of the graph defined by the pairwise similarities has been proposed. However, the existing graph regularized ranking methods are very sensitive to the choice of the graph model and parameters, and this remains a difficult problem for most of the protein domain ranking methods. RESULTS: To tackle this problem, we have developed the Multiple Graph regularized Ranking algorithm, MultiG-Rank. Instead of using a single graph to regularize the ranking scores, MultiG-Rank approximates the intrinsic manifold of protein domain distribution by combining multiple initial graphs for the regularization. Graph weights are learned with ranking scores jointly and automatically, by alternately minimizing an objective function in an iterative algorithm. Experimental results on a subset of the ASTRAL SCOP protein domain database demonstrate that MultiG-Rank achieves a better ranking performance than single graph regularized ranking methods and pairwise similarity based ranking methods. CONCLUSION: The problem of graph model and parameter selection in graph regularized protein domain ranking can be solved effectively by combining multiple graphs. This aspect of generalization introduces a new frontier in applying multiple graphs to solving protein domain ranking applications. BioMed Central 2012-11-19 /pmc/articles/PMC3583823/ /pubmed/23157331 http://dx.doi.org/10.1186/1471-2105-13-307 Text en Copyright ©2012 Wang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Wang, Jim Jing-Yan Bensmail, Halima Gao, Xin Multiple graph regularized protein domain ranking |
title | Multiple graph regularized protein domain ranking |
title_full | Multiple graph regularized protein domain ranking |
title_fullStr | Multiple graph regularized protein domain ranking |
title_full_unstemmed | Multiple graph regularized protein domain ranking |
title_short | Multiple graph regularized protein domain ranking |
title_sort | multiple graph regularized protein domain ranking |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3583823/ https://www.ncbi.nlm.nih.gov/pubmed/23157331 http://dx.doi.org/10.1186/1471-2105-13-307 |
work_keys_str_mv | AT wangjimjingyan multiplegraphregularizedproteindomainranking AT bensmailhalima multiplegraphregularizedproteindomainranking AT gaoxin multiplegraphregularizedproteindomainranking |