Cargando…
PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites
BACKGROUND: Post-translational modifications (PTMs) occur on almost all proteins and often strongly affect the functions of modified proteins. Phosphorylation is a crucial PTM mechanism with important regulatory functions in biological systems. Identifying the potential phosphorylation sites of a ta...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4943517/ https://www.ncbi.nlm.nih.gov/pubmed/27437197 http://dx.doi.org/10.1186/s40709-016-0042-y |
_version_ | 1782442609557372928 |
---|---|
author | Gao, Yong Hao, Weilin Gu, Jing Liu, Diwei Fan, Chao Chen, Zhigang Deng, Lei |
author_facet | Gao, Yong Hao, Weilin Gu, Jing Liu, Diwei Fan, Chao Chen, Zhigang Deng, Lei |
author_sort | Gao, Yong |
collection | PubMed |
description | BACKGROUND: Post-translational modifications (PTMs) occur on almost all proteins and often strongly affect the functions of modified proteins. Phosphorylation is a crucial PTM mechanism with important regulatory functions in biological systems. Identifying the potential phosphorylation sites of a target protein may increase our understanding of the molecular processes in which it takes part. RESULTS: In this paper, we propose PredPhos, a computational method that can accurately predict both kinase-specific and non-kinase-specific phosphorylation sites by using optimally selected properties. The optimal combination of features was selected from a set of 153 novel structural neighborhood properties by a two-step feature selection method consisting of a random forest algorithm and a sequential backward elimination method. To overcome the imbalanced problem, we adopt an ensemble method, which combines bootstrap resampling technique, support vector machine-based fusion classifiers and majority voting strategy. We evaluate the proposed method using both tenfold cross validation and independent test. Results show that our method achieves a significant improvement on the prediction performance for both kinase-specific and non-kinase-specific phosphorylation sites. CONCLUSIONS: The experimental results demonstrate that the proposed method is quite effective in predicting phosphorylation sites. Promising results are derived from the new structural neighborhood properties, the novel way of feature selection, as well as the ensemble method. |
format | Online Article Text |
id | pubmed-4943517 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49435172016-07-19 PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites Gao, Yong Hao, Weilin Gu, Jing Liu, Diwei Fan, Chao Chen, Zhigang Deng, Lei J Biol Res (Thessalon) Research BACKGROUND: Post-translational modifications (PTMs) occur on almost all proteins and often strongly affect the functions of modified proteins. Phosphorylation is a crucial PTM mechanism with important regulatory functions in biological systems. Identifying the potential phosphorylation sites of a target protein may increase our understanding of the molecular processes in which it takes part. RESULTS: In this paper, we propose PredPhos, a computational method that can accurately predict both kinase-specific and non-kinase-specific phosphorylation sites by using optimally selected properties. The optimal combination of features was selected from a set of 153 novel structural neighborhood properties by a two-step feature selection method consisting of a random forest algorithm and a sequential backward elimination method. To overcome the imbalanced problem, we adopt an ensemble method, which combines bootstrap resampling technique, support vector machine-based fusion classifiers and majority voting strategy. We evaluate the proposed method using both tenfold cross validation and independent test. Results show that our method achieves a significant improvement on the prediction performance for both kinase-specific and non-kinase-specific phosphorylation sites. CONCLUSIONS: The experimental results demonstrate that the proposed method is quite effective in predicting phosphorylation sites. Promising results are derived from the new structural neighborhood properties, the novel way of feature selection, as well as the ensemble method. BioMed Central 2016-07-04 /pmc/articles/PMC4943517/ /pubmed/27437197 http://dx.doi.org/10.1186/s40709-016-0042-y Text en © Gao et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Gao, Yong Hao, Weilin Gu, Jing Liu, Diwei Fan, Chao Chen, Zhigang Deng, Lei PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites |
title | PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites |
title_full | PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites |
title_fullStr | PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites |
title_full_unstemmed | PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites |
title_short | PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites |
title_sort | predphos: an ensemble framework for structure-based prediction of phosphorylation sites |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4943517/ https://www.ncbi.nlm.nih.gov/pubmed/27437197 http://dx.doi.org/10.1186/s40709-016-0042-y |
work_keys_str_mv | AT gaoyong predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites AT haoweilin predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites AT gujing predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites AT liudiwei predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites AT fanchao predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites AT chenzhigang predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites AT denglei predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites |