Cargando…

iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier

Protein phosphorylation is a posttranslational modification (PTM or PTLM), where a phosphoryl group is added to the residue(s) of a protein molecule. The most commonly phosphorylated amino acids occur at serine (S), threonine (T), and tyrosine (Y). Protein phosphorylation plays a significant role in...

Descripción completa

Detalles Bibliográficos
Autores principales: Qiu, Wang-Ren, Xiao, Xuan, Xu, Zhao-Chun, Chou, Kuo-Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Impact Journals LLC 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5239474/
https://www.ncbi.nlm.nih.gov/pubmed/27323404
http://dx.doi.org/10.18632/oncotarget.9987
_version_ 1782495901566107648
author Qiu, Wang-Ren
Xiao, Xuan
Xu, Zhao-Chun
Chou, Kuo-Chen
author_facet Qiu, Wang-Ren
Xiao, Xuan
Xu, Zhao-Chun
Chou, Kuo-Chen
author_sort Qiu, Wang-Ren
collection PubMed
description Protein phosphorylation is a posttranslational modification (PTM or PTLM), where a phosphoryl group is added to the residue(s) of a protein molecule. The most commonly phosphorylated amino acids occur at serine (S), threonine (T), and tyrosine (Y). Protein phosphorylation plays a significant role in a wide range of cellular processes; meanwhile its dysregulation is also involved with many diseases. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of S, T, or Y, which ones can be phosphorylated, and which ones cannot? To address this problem, we have developed a predictor called iPhos-PseEn by fusing four different pseudo component approaches (amino acids’ disorder scores, nearest neighbor scores, occurrence frequencies, and position weights) into an ensemble classifier via a voting system. Rigorous cross-validations indicated that the proposed predictor remarkably outperformed its existing counterparts. For the convenience of most experimental scientists, a user-friendly web-server for iPhos-PseEn has been established at http://www.jci-bioinfo.cn/iPhos-PseEn, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved.
format Online
Article
Text
id pubmed-5239474
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Impact Journals LLC
record_format MEDLINE/PubMed
spelling pubmed-52394742017-01-24 iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier Qiu, Wang-Ren Xiao, Xuan Xu, Zhao-Chun Chou, Kuo-Chen Oncotarget Research Paper Protein phosphorylation is a posttranslational modification (PTM or PTLM), where a phosphoryl group is added to the residue(s) of a protein molecule. The most commonly phosphorylated amino acids occur at serine (S), threonine (T), and tyrosine (Y). Protein phosphorylation plays a significant role in a wide range of cellular processes; meanwhile its dysregulation is also involved with many diseases. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of S, T, or Y, which ones can be phosphorylated, and which ones cannot? To address this problem, we have developed a predictor called iPhos-PseEn by fusing four different pseudo component approaches (amino acids’ disorder scores, nearest neighbor scores, occurrence frequencies, and position weights) into an ensemble classifier via a voting system. Rigorous cross-validations indicated that the proposed predictor remarkably outperformed its existing counterparts. For the convenience of most experimental scientists, a user-friendly web-server for iPhos-PseEn has been established at http://www.jci-bioinfo.cn/iPhos-PseEn, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved. Impact Journals LLC 2016-06-13 /pmc/articles/PMC5239474/ /pubmed/27323404 http://dx.doi.org/10.18632/oncotarget.9987 Text en Copyright: © 2016 Qiu et al. http://creativecommons.org/licenses/by/2.5/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Paper
Qiu, Wang-Ren
Xiao, Xuan
Xu, Zhao-Chun
Chou, Kuo-Chen
iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier
title iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier
title_full iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier
title_fullStr iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier
title_full_unstemmed iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier
title_short iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier
title_sort iphos-pseen: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5239474/
https://www.ncbi.nlm.nih.gov/pubmed/27323404
http://dx.doi.org/10.18632/oncotarget.9987
work_keys_str_mv AT qiuwangren iphospseenidentifyingphosphorylationsitesinproteinsbyfusingdifferentpseudocomponentsintoanensembleclassifier
AT xiaoxuan iphospseenidentifyingphosphorylationsitesinproteinsbyfusingdifferentpseudocomponentsintoanensembleclassifier
AT xuzhaochun iphospseenidentifyingphosphorylationsitesinproteinsbyfusingdifferentpseudocomponentsintoanensembleclassifier
AT choukuochen iphospseenidentifyingphosphorylationsitesinproteinsbyfusingdifferentpseudocomponentsintoanensembleclassifier