Cargando…

Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites

Experimentally-determined or computationally-predicted protein phosphorylation sites for distinctive species are becoming increasingly common. In this paper, we compare the predictive performance of a novel classification algorithm with different encoding schemes to develop a rice-specific protein p...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Shoukai, Song, Qi, Tao, Huan, Wang, Wei, Wan, Weifeng, Huang, Jian, Xu, Chaoqun, Chebii, Vivien, Kitony, Justine, Que, Shufu, Harrison, Andrew, He, Huaqin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4493637/
https://www.ncbi.nlm.nih.gov/pubmed/26149854
http://dx.doi.org/10.1038/srep11940
_version_ 1782379955117621248
author Lin, Shoukai
Song, Qi
Tao, Huan
Wang, Wei
Wan, Weifeng
Huang, Jian
Xu, Chaoqun
Chebii, Vivien
Kitony, Justine
Que, Shufu
Harrison, Andrew
He, Huaqin
author_facet Lin, Shoukai
Song, Qi
Tao, Huan
Wang, Wei
Wan, Weifeng
Huang, Jian
Xu, Chaoqun
Chebii, Vivien
Kitony, Justine
Que, Shufu
Harrison, Andrew
He, Huaqin
author_sort Lin, Shoukai
collection PubMed
description Experimentally-determined or computationally-predicted protein phosphorylation sites for distinctive species are becoming increasingly common. In this paper, we compare the predictive performance of a novel classification algorithm with different encoding schemes to develop a rice-specific protein phosphorylation site predictor. Our results imply that the combination of Amino acid occurrence Frequency with Composition of K-Spaced Amino Acid Pairs (AF-CKSAAP) provides the best description of relevant sequence features that surround a phosphorylation site. A support vector machine (SVM) using AF-CKSAAP achieves the best performance in classifying rice protein phophorylation sites when compared to the other algorithms. We have used SVM with AF-CKSAAP to construct a rice-specific protein phosphorylation sites predictor, Rice_Phospho 1.0 (http://bioinformatics.fafu.edu.cn/rice_phospho1.0). We measure the Accuracy (ACC) and Matthews Correlation Coefficient (MCC) of Rice_Phospho 1.0 to be 82.0% and 0.64, significantly higher than those measures for other predictors such as Scansite, Musite, PlantPhos and PhosphoRice. Rice_Phospho 1.0 also successfully predicted the experimentally identified phosphorylation sites in LOC_Os03g51600.1, a protein sequence which did not appear in the training dataset. In summary, Rice_phospho 1.0 outputs reliable predictions of protein phosphorylation sites in rice, and will serve as a useful tool to the community.
format Online
Article
Text
id pubmed-4493637
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-44936372015-07-09 Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites Lin, Shoukai Song, Qi Tao, Huan Wang, Wei Wan, Weifeng Huang, Jian Xu, Chaoqun Chebii, Vivien Kitony, Justine Que, Shufu Harrison, Andrew He, Huaqin Sci Rep Article Experimentally-determined or computationally-predicted protein phosphorylation sites for distinctive species are becoming increasingly common. In this paper, we compare the predictive performance of a novel classification algorithm with different encoding schemes to develop a rice-specific protein phosphorylation site predictor. Our results imply that the combination of Amino acid occurrence Frequency with Composition of K-Spaced Amino Acid Pairs (AF-CKSAAP) provides the best description of relevant sequence features that surround a phosphorylation site. A support vector machine (SVM) using AF-CKSAAP achieves the best performance in classifying rice protein phophorylation sites when compared to the other algorithms. We have used SVM with AF-CKSAAP to construct a rice-specific protein phosphorylation sites predictor, Rice_Phospho 1.0 (http://bioinformatics.fafu.edu.cn/rice_phospho1.0). We measure the Accuracy (ACC) and Matthews Correlation Coefficient (MCC) of Rice_Phospho 1.0 to be 82.0% and 0.64, significantly higher than those measures for other predictors such as Scansite, Musite, PlantPhos and PhosphoRice. Rice_Phospho 1.0 also successfully predicted the experimentally identified phosphorylation sites in LOC_Os03g51600.1, a protein sequence which did not appear in the training dataset. In summary, Rice_phospho 1.0 outputs reliable predictions of protein phosphorylation sites in rice, and will serve as a useful tool to the community. Nature Publishing Group 2015-07-07 /pmc/articles/PMC4493637/ /pubmed/26149854 http://dx.doi.org/10.1038/srep11940 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Lin, Shoukai
Song, Qi
Tao, Huan
Wang, Wei
Wan, Weifeng
Huang, Jian
Xu, Chaoqun
Chebii, Vivien
Kitony, Justine
Que, Shufu
Harrison, Andrew
He, Huaqin
Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites
title Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites
title_full Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites
title_fullStr Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites
title_full_unstemmed Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites
title_short Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites
title_sort rice_phospho 1.0: a new rice-specific svm predictor for protein phosphorylation sites
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4493637/
https://www.ncbi.nlm.nih.gov/pubmed/26149854
http://dx.doi.org/10.1038/srep11940
work_keys_str_mv AT linshoukai ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT songqi ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT taohuan ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT wangwei ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT wanweifeng ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT huangjian ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT xuchaoqun ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT chebiivivien ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT kitonyjustine ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT queshufu ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT harrisonandrew ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites
AT hehuaqin ricephospho10anewricespecificsvmpredictorforproteinphosphorylationsites