Cargando…

Multiple graph regularized protein domain ranking

BACKGROUND: Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jim Jing-Yan, Bensmail, Halima, Gao, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3583823/
https://www.ncbi.nlm.nih.gov/pubmed/23157331
http://dx.doi.org/10.1186/1471-2105-13-307
_version_ 1782475487361105920
author Wang, Jim Jing-Yan
Bensmail, Halima
Gao, Xin
author_facet Wang, Jim Jing-Yan
Bensmail, Halima
Gao, Xin
author_sort Wang, Jim Jing-Yan
collection PubMed
description BACKGROUND: Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the global structure of the graph defined by the pairwise similarities has been proposed. However, the existing graph regularized ranking methods are very sensitive to the choice of the graph model and parameters, and this remains a difficult problem for most of the protein domain ranking methods. RESULTS: To tackle this problem, we have developed the Multiple Graph regularized Ranking algorithm, MultiG-Rank. Instead of using a single graph to regularize the ranking scores, MultiG-Rank approximates the intrinsic manifold of protein domain distribution by combining multiple initial graphs for the regularization. Graph weights are learned with ranking scores jointly and automatically, by alternately minimizing an objective function in an iterative algorithm. Experimental results on a subset of the ASTRAL SCOP protein domain database demonstrate that MultiG-Rank achieves a better ranking performance than single graph regularized ranking methods and pairwise similarity based ranking methods. CONCLUSION: The problem of graph model and parameter selection in graph regularized protein domain ranking can be solved effectively by combining multiple graphs. This aspect of generalization introduces a new frontier in applying multiple graphs to solving protein domain ranking applications.
format Online
Article
Text
id pubmed-3583823
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35838232013-03-08 Multiple graph regularized protein domain ranking Wang, Jim Jing-Yan Bensmail, Halima Gao, Xin BMC Bioinformatics Methodology Article BACKGROUND: Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the global structure of the graph defined by the pairwise similarities has been proposed. However, the existing graph regularized ranking methods are very sensitive to the choice of the graph model and parameters, and this remains a difficult problem for most of the protein domain ranking methods. RESULTS: To tackle this problem, we have developed the Multiple Graph regularized Ranking algorithm, MultiG-Rank. Instead of using a single graph to regularize the ranking scores, MultiG-Rank approximates the intrinsic manifold of protein domain distribution by combining multiple initial graphs for the regularization. Graph weights are learned with ranking scores jointly and automatically, by alternately minimizing an objective function in an iterative algorithm. Experimental results on a subset of the ASTRAL SCOP protein domain database demonstrate that MultiG-Rank achieves a better ranking performance than single graph regularized ranking methods and pairwise similarity based ranking methods. CONCLUSION: The problem of graph model and parameter selection in graph regularized protein domain ranking can be solved effectively by combining multiple graphs. This aspect of generalization introduces a new frontier in applying multiple graphs to solving protein domain ranking applications. BioMed Central 2012-11-19 /pmc/articles/PMC3583823/ /pubmed/23157331 http://dx.doi.org/10.1186/1471-2105-13-307 Text en Copyright ©2012 Wang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Wang, Jim Jing-Yan
Bensmail, Halima
Gao, Xin
Multiple graph regularized protein domain ranking
title Multiple graph regularized protein domain ranking
title_full Multiple graph regularized protein domain ranking
title_fullStr Multiple graph regularized protein domain ranking
title_full_unstemmed Multiple graph regularized protein domain ranking
title_short Multiple graph regularized protein domain ranking
title_sort multiple graph regularized protein domain ranking
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3583823/
https://www.ncbi.nlm.nih.gov/pubmed/23157331
http://dx.doi.org/10.1186/1471-2105-13-307
work_keys_str_mv AT wangjimjingyan multiplegraphregularizedproteindomainranking
AT bensmailhalima multiplegraphregularizedproteindomainranking
AT gaoxin multiplegraphregularizedproteindomainranking