Cargando…

Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features

BACKGROUND: Adenosine-5′-triphosphate (ATP) is one of multifunctional nucleotides and plays an important role in cell biology as a coenzyme interacting with proteins. Revealing the binding sites between protein and ATP is significantly important to understand the functionality of the proteins and th...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Ya-Nan, Yu, Dong-Jun, Li, Shu-Sen, Fan, Yong-Xian, Huang, Yan, Shen, Hong-Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3424114/
https://www.ncbi.nlm.nih.gov/pubmed/22651691
http://dx.doi.org/10.1186/1471-2105-13-118
_version_ 1782241177328680960
author Zhang, Ya-Nan
Yu, Dong-Jun
Li, Shu-Sen
Fan, Yong-Xian
Huang, Yan
Shen, Hong-Bin
author_facet Zhang, Ya-Nan
Yu, Dong-Jun
Li, Shu-Sen
Fan, Yong-Xian
Huang, Yan
Shen, Hong-Bin
author_sort Zhang, Ya-Nan
collection PubMed
description BACKGROUND: Adenosine-5′-triphosphate (ATP) is one of multifunctional nucleotides and plays an important role in cell biology as a coenzyme interacting with proteins. Revealing the binding sites between protein and ATP is significantly important to understand the functionality of the proteins and the mechanisms of protein-ATP complex. RESULTS: In this paper, we propose a novel framework for predicting the proteins’ functional residues, through which they can bind with ATP molecules. The new prediction protocol is achieved by combination of sequence evolutional information and bi-profile sampling of multi-view sequential features and the sequence derived structural features. The hypothesis for this strategy is single-view feature can only represent partial target’s knowledge and multiple sources of descriptors can be complementary. CONCLUSIONS: Prediction performances evaluated by both 5-fold and leave-one-out jackknife cross-validation tests on two benchmark datasets consisting of 168 and 227 non-homologous ATP binding proteins respectively demonstrate the efficacy of the proposed protocol. Our experimental results also reveal that the residue structural characteristics of real protein-ATP binding sites are significant different from those normal ones, for example the binding residues do not show high solvent accessibility propensities, and the bindings prefer to occur at the conjoint points between different secondary structure segments. Furthermore, results also show that performance is affected by the imbalanced training datasets by testing multiple ratios between positive and negative samples in the experiments. Increasing the dataset scale is also demonstrated useful for improving the prediction performances.
format Online
Article
Text
id pubmed-3424114
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34241142012-08-22 Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features Zhang, Ya-Nan Yu, Dong-Jun Li, Shu-Sen Fan, Yong-Xian Huang, Yan Shen, Hong-Bin BMC Bioinformatics Research Article BACKGROUND: Adenosine-5′-triphosphate (ATP) is one of multifunctional nucleotides and plays an important role in cell biology as a coenzyme interacting with proteins. Revealing the binding sites between protein and ATP is significantly important to understand the functionality of the proteins and the mechanisms of protein-ATP complex. RESULTS: In this paper, we propose a novel framework for predicting the proteins’ functional residues, through which they can bind with ATP molecules. The new prediction protocol is achieved by combination of sequence evolutional information and bi-profile sampling of multi-view sequential features and the sequence derived structural features. The hypothesis for this strategy is single-view feature can only represent partial target’s knowledge and multiple sources of descriptors can be complementary. CONCLUSIONS: Prediction performances evaluated by both 5-fold and leave-one-out jackknife cross-validation tests on two benchmark datasets consisting of 168 and 227 non-homologous ATP binding proteins respectively demonstrate the efficacy of the proposed protocol. Our experimental results also reveal that the residue structural characteristics of real protein-ATP binding sites are significant different from those normal ones, for example the binding residues do not show high solvent accessibility propensities, and the bindings prefer to occur at the conjoint points between different secondary structure segments. Furthermore, results also show that performance is affected by the imbalanced training datasets by testing multiple ratios between positive and negative samples in the experiments. Increasing the dataset scale is also demonstrated useful for improving the prediction performances. BioMed Central 2012-05-31 /pmc/articles/PMC3424114/ /pubmed/22651691 http://dx.doi.org/10.1186/1471-2105-13-118 Text en Copyright ©2012 Zhang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Ya-Nan
Yu, Dong-Jun
Li, Shu-Sen
Fan, Yong-Xian
Huang, Yan
Shen, Hong-Bin
Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features
title Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features
title_full Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features
title_fullStr Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features
title_full_unstemmed Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features
title_short Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features
title_sort predicting protein-atp binding sites from primary sequence through fusing bi-profile sampling of multi-view features
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3424114/
https://www.ncbi.nlm.nih.gov/pubmed/22651691
http://dx.doi.org/10.1186/1471-2105-13-118
work_keys_str_mv AT zhangyanan predictingproteinatpbindingsitesfromprimarysequencethroughfusingbiprofilesamplingofmultiviewfeatures
AT yudongjun predictingproteinatpbindingsitesfromprimarysequencethroughfusingbiprofilesamplingofmultiviewfeatures
AT lishusen predictingproteinatpbindingsitesfromprimarysequencethroughfusingbiprofilesamplingofmultiviewfeatures
AT fanyongxian predictingproteinatpbindingsitesfromprimarysequencethroughfusingbiprofilesamplingofmultiviewfeatures
AT huangyan predictingproteinatpbindingsitesfromprimarysequencethroughfusingbiprofilesamplingofmultiviewfeatures
AT shenhongbin predictingproteinatpbindingsitesfromprimarysequencethroughfusingbiprofilesamplingofmultiviewfeatures