Cargando…

Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs

Prokaryotic proteins are regulated by pupylation, a type of post-translational modification that contributes to cellular function in bacterial organisms. In pupylation process, the prokaryotic ubiquitin-like protein (Pup) tagging is functionally analogous to ubiquitination in order to tag target pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Hasan, Md. Mehedi, Zhou, Yuan, Lu, Xiaotian, Li, Jinyan, Song, Jiangning, Zhang, Ziding
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4469302/
https://www.ncbi.nlm.nih.gov/pubmed/26080082
http://dx.doi.org/10.1371/journal.pone.0129635
_version_ 1782376607290228736
author Hasan, Md. Mehedi
Zhou, Yuan
Lu, Xiaotian
Li, Jinyan
Song, Jiangning
Zhang, Ziding
author_facet Hasan, Md. Mehedi
Zhou, Yuan
Lu, Xiaotian
Li, Jinyan
Song, Jiangning
Zhang, Ziding
author_sort Hasan, Md. Mehedi
collection PubMed
description Prokaryotic proteins are regulated by pupylation, a type of post-translational modification that contributes to cellular function in bacterial organisms. In pupylation process, the prokaryotic ubiquitin-like protein (Pup) tagging is functionally analogous to ubiquitination in order to tag target proteins for proteasomal degradation. To date, several experimental methods have been developed to identify pupylated proteins and their pupylation sites, but these experimental methods are generally laborious and costly. Therefore, computational methods that can accurately predict potential pupylation sites based on protein sequence information are highly desirable. In this paper, a novel predictor termed as pbPUP has been developed for accurate prediction of pupylation sites. In particular, a sophisticated sequence encoding scheme [i.e. the profile-based composition of k-spaced amino acid pairs (pbCKSAAP)] is used to represent the sequence patterns and evolutionary information of the sequence fragments surrounding pupylation sites. Then, a Support Vector Machine (SVM) classifier is trained using the pbCKSAAP encoding scheme. The final pbPUP predictor achieves an AUC value of 0.849 in10-fold cross-validation tests and outperforms other existing predictors on a comprehensive independent test dataset. The proposed method is anticipated to be a helpful computational resource for the prediction of pupylation sites. The web server and curated datasets in this study are freely available at http://protein.cau.edu.cn/pbPUP/.
format Online
Article
Text
id pubmed-4469302
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44693022015-06-22 Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs Hasan, Md. Mehedi Zhou, Yuan Lu, Xiaotian Li, Jinyan Song, Jiangning Zhang, Ziding PLoS One Research Article Prokaryotic proteins are regulated by pupylation, a type of post-translational modification that contributes to cellular function in bacterial organisms. In pupylation process, the prokaryotic ubiquitin-like protein (Pup) tagging is functionally analogous to ubiquitination in order to tag target proteins for proteasomal degradation. To date, several experimental methods have been developed to identify pupylated proteins and their pupylation sites, but these experimental methods are generally laborious and costly. Therefore, computational methods that can accurately predict potential pupylation sites based on protein sequence information are highly desirable. In this paper, a novel predictor termed as pbPUP has been developed for accurate prediction of pupylation sites. In particular, a sophisticated sequence encoding scheme [i.e. the profile-based composition of k-spaced amino acid pairs (pbCKSAAP)] is used to represent the sequence patterns and evolutionary information of the sequence fragments surrounding pupylation sites. Then, a Support Vector Machine (SVM) classifier is trained using the pbCKSAAP encoding scheme. The final pbPUP predictor achieves an AUC value of 0.849 in10-fold cross-validation tests and outperforms other existing predictors on a comprehensive independent test dataset. The proposed method is anticipated to be a helpful computational resource for the prediction of pupylation sites. The web server and curated datasets in this study are freely available at http://protein.cau.edu.cn/pbPUP/. Public Library of Science 2015-06-16 /pmc/articles/PMC4469302/ /pubmed/26080082 http://dx.doi.org/10.1371/journal.pone.0129635 Text en © 2015 Hasan et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hasan, Md. Mehedi
Zhou, Yuan
Lu, Xiaotian
Li, Jinyan
Song, Jiangning
Zhang, Ziding
Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs
title Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs
title_full Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs
title_fullStr Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs
title_full_unstemmed Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs
title_short Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs
title_sort computational identification of protein pupylation sites by using profile-based composition of k-spaced amino acid pairs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4469302/
https://www.ncbi.nlm.nih.gov/pubmed/26080082
http://dx.doi.org/10.1371/journal.pone.0129635
work_keys_str_mv AT hasanmdmehedi computationalidentificationofproteinpupylationsitesbyusingprofilebasedcompositionofkspacedaminoacidpairs
AT zhouyuan computationalidentificationofproteinpupylationsitesbyusingprofilebasedcompositionofkspacedaminoacidpairs
AT luxiaotian computationalidentificationofproteinpupylationsitesbyusingprofilebasedcompositionofkspacedaminoacidpairs
AT lijinyan computationalidentificationofproteinpupylationsitesbyusingprofilebasedcompositionofkspacedaminoacidpairs
AT songjiangning computationalidentificationofproteinpupylationsitesbyusingprofilebasedcompositionofkspacedaminoacidpairs
AT zhangziding computationalidentificationofproteinpupylationsitesbyusingprofilebasedcompositionofkspacedaminoacidpairs