Cargando…
iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier
Protein phosphorylation is a posttranslational modification (PTM or PTLM), where a phosphoryl group is added to the residue(s) of a protein molecule. The most commonly phosphorylated amino acids occur at serine (S), threonine (T), and tyrosine (Y). Protein phosphorylation plays a significant role in...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Impact Journals LLC
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5239474/ https://www.ncbi.nlm.nih.gov/pubmed/27323404 http://dx.doi.org/10.18632/oncotarget.9987 |
_version_ | 1782495901566107648 |
---|---|
author | Qiu, Wang-Ren Xiao, Xuan Xu, Zhao-Chun Chou, Kuo-Chen |
author_facet | Qiu, Wang-Ren Xiao, Xuan Xu, Zhao-Chun Chou, Kuo-Chen |
author_sort | Qiu, Wang-Ren |
collection | PubMed |
description | Protein phosphorylation is a posttranslational modification (PTM or PTLM), where a phosphoryl group is added to the residue(s) of a protein molecule. The most commonly phosphorylated amino acids occur at serine (S), threonine (T), and tyrosine (Y). Protein phosphorylation plays a significant role in a wide range of cellular processes; meanwhile its dysregulation is also involved with many diseases. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of S, T, or Y, which ones can be phosphorylated, and which ones cannot? To address this problem, we have developed a predictor called iPhos-PseEn by fusing four different pseudo component approaches (amino acids’ disorder scores, nearest neighbor scores, occurrence frequencies, and position weights) into an ensemble classifier via a voting system. Rigorous cross-validations indicated that the proposed predictor remarkably outperformed its existing counterparts. For the convenience of most experimental scientists, a user-friendly web-server for iPhos-PseEn has been established at http://www.jci-bioinfo.cn/iPhos-PseEn, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved. |
format | Online Article Text |
id | pubmed-5239474 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Impact Journals LLC |
record_format | MEDLINE/PubMed |
spelling | pubmed-52394742017-01-24 iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier Qiu, Wang-Ren Xiao, Xuan Xu, Zhao-Chun Chou, Kuo-Chen Oncotarget Research Paper Protein phosphorylation is a posttranslational modification (PTM or PTLM), where a phosphoryl group is added to the residue(s) of a protein molecule. The most commonly phosphorylated amino acids occur at serine (S), threonine (T), and tyrosine (Y). Protein phosphorylation plays a significant role in a wide range of cellular processes; meanwhile its dysregulation is also involved with many diseases. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of S, T, or Y, which ones can be phosphorylated, and which ones cannot? To address this problem, we have developed a predictor called iPhos-PseEn by fusing four different pseudo component approaches (amino acids’ disorder scores, nearest neighbor scores, occurrence frequencies, and position weights) into an ensemble classifier via a voting system. Rigorous cross-validations indicated that the proposed predictor remarkably outperformed its existing counterparts. For the convenience of most experimental scientists, a user-friendly web-server for iPhos-PseEn has been established at http://www.jci-bioinfo.cn/iPhos-PseEn, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved. Impact Journals LLC 2016-06-13 /pmc/articles/PMC5239474/ /pubmed/27323404 http://dx.doi.org/10.18632/oncotarget.9987 Text en Copyright: © 2016 Qiu et al. http://creativecommons.org/licenses/by/2.5/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Paper Qiu, Wang-Ren Xiao, Xuan Xu, Zhao-Chun Chou, Kuo-Chen iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier |
title | iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier |
title_full | iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier |
title_fullStr | iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier |
title_full_unstemmed | iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier |
title_short | iPhos-PseEn: Identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier |
title_sort | iphos-pseen: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5239474/ https://www.ncbi.nlm.nih.gov/pubmed/27323404 http://dx.doi.org/10.18632/oncotarget.9987 |
work_keys_str_mv | AT qiuwangren iphospseenidentifyingphosphorylationsitesinproteinsbyfusingdifferentpseudocomponentsintoanensembleclassifier AT xiaoxuan iphospseenidentifyingphosphorylationsitesinproteinsbyfusingdifferentpseudocomponentsintoanensembleclassifier AT xuzhaochun iphospseenidentifyingphosphorylationsitesinproteinsbyfusingdifferentpseudocomponentsintoanensembleclassifier AT choukuochen iphospseenidentifyingphosphorylationsitesinproteinsbyfusingdifferentpseudocomponentsintoanensembleclassifier |