Cargando…
Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble
BACKGROUND: Vitamins are typical ligands that play critical roles in various metabolic processes. The accurate identification of the vitamin-binding residues solely based on a protein sequence is of significant importance for the functional annotation of proteins, especially in the post-genomic era,...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4261549/ https://www.ncbi.nlm.nih.gov/pubmed/25189131 http://dx.doi.org/10.1186/1471-2105-15-297 |
_version_ | 1782348290506883072 |
---|---|
author | Yu, Dong-Jun Hu, Jun Yan, Hui Yang, Xi-Bei Yang, Jing-Yu Shen, Hong-Bin |
author_facet | Yu, Dong-Jun Hu, Jun Yan, Hui Yang, Xi-Bei Yang, Jing-Yu Shen, Hong-Bin |
author_sort | Yu, Dong-Jun |
collection | PubMed |
description | BACKGROUND: Vitamins are typical ligands that play critical roles in various metabolic processes. The accurate identification of the vitamin-binding residues solely based on a protein sequence is of significant importance for the functional annotation of proteins, especially in the post-genomic era, when large volumes of protein sequences are accumulating quickly without being functionally annotated. RESULTS: In this paper, a new predictor called TargetVita is designed and implemented for predicting protein-vitamin binding residues using protein sequences. In TargetVita, features derived from the position-specific scoring matrix (PSSM), predicted protein secondary structure, and vitamin binding propensity are combined to form the original feature space; then, several feature subspaces are selected by performing different feature selection methods. Finally, based on the selected feature subspaces, heterogeneous SVMs are trained and then ensembled for performing prediction. CONCLUSIONS: The experimental results obtained with four separate vitamin-binding benchmark datasets demonstrate that the proposed TargetVita is superior to the state-of-the-art vitamin-specific predictor, and an average improvement of 10% in terms of the Matthews correlation coefficient (MCC) was achieved over independent validation tests. The TargetVita web server and the datasets used are freely available for academic use at http://csbio.njust.edu.cn/bioinf/TargetVita or http://www.csbio.sjtu.edu.cn/bioinf/TargetVita. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-297) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4261549 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42615492014-12-10 Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble Yu, Dong-Jun Hu, Jun Yan, Hui Yang, Xi-Bei Yang, Jing-Yu Shen, Hong-Bin BMC Bioinformatics Research Article BACKGROUND: Vitamins are typical ligands that play critical roles in various metabolic processes. The accurate identification of the vitamin-binding residues solely based on a protein sequence is of significant importance for the functional annotation of proteins, especially in the post-genomic era, when large volumes of protein sequences are accumulating quickly without being functionally annotated. RESULTS: In this paper, a new predictor called TargetVita is designed and implemented for predicting protein-vitamin binding residues using protein sequences. In TargetVita, features derived from the position-specific scoring matrix (PSSM), predicted protein secondary structure, and vitamin binding propensity are combined to form the original feature space; then, several feature subspaces are selected by performing different feature selection methods. Finally, based on the selected feature subspaces, heterogeneous SVMs are trained and then ensembled for performing prediction. CONCLUSIONS: The experimental results obtained with four separate vitamin-binding benchmark datasets demonstrate that the proposed TargetVita is superior to the state-of-the-art vitamin-specific predictor, and an average improvement of 10% in terms of the Matthews correlation coefficient (MCC) was achieved over independent validation tests. The TargetVita web server and the datasets used are freely available for academic use at http://csbio.njust.edu.cn/bioinf/TargetVita or http://www.csbio.sjtu.edu.cn/bioinf/TargetVita. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-297) contains supplementary material, which is available to authorized users. BioMed Central 2014-09-05 /pmc/articles/PMC4261549/ /pubmed/25189131 http://dx.doi.org/10.1186/1471-2105-15-297 Text en © Yu et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Yu, Dong-Jun Hu, Jun Yan, Hui Yang, Xi-Bei Yang, Jing-Yu Shen, Hong-Bin Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble |
title | Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble |
title_full | Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble |
title_fullStr | Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble |
title_full_unstemmed | Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble |
title_short | Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble |
title_sort | enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace svms ensemble |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4261549/ https://www.ncbi.nlm.nih.gov/pubmed/25189131 http://dx.doi.org/10.1186/1471-2105-15-297 |
work_keys_str_mv | AT yudongjun enhancingproteinvitaminbindingresiduespredictionbymultipleheterogeneoussubspacesvmsensemble AT hujun enhancingproteinvitaminbindingresiduespredictionbymultipleheterogeneoussubspacesvmsensemble AT yanhui enhancingproteinvitaminbindingresiduespredictionbymultipleheterogeneoussubspacesvmsensemble AT yangxibei enhancingproteinvitaminbindingresiduespredictionbymultipleheterogeneoussubspacesvmsensemble AT yangjingyu enhancingproteinvitaminbindingresiduespredictionbymultipleheterogeneoussubspacesvmsensemble AT shenhongbin enhancingproteinvitaminbindingresiduespredictionbymultipleheterogeneoussubspacesvmsensemble |