Cargando…

PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites

BACKGROUND: Post-translational modifications (PTMs) occur on almost all proteins and often strongly affect the functions of modified proteins. Phosphorylation is a crucial PTM mechanism with important regulatory functions in biological systems. Identifying the potential phosphorylation sites of a ta...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Yong, Hao, Weilin, Gu, Jing, Liu, Diwei, Fan, Chao, Chen, Zhigang, Deng, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4943517/
https://www.ncbi.nlm.nih.gov/pubmed/27437197
http://dx.doi.org/10.1186/s40709-016-0042-y
_version_ 1782442609557372928
author Gao, Yong
Hao, Weilin
Gu, Jing
Liu, Diwei
Fan, Chao
Chen, Zhigang
Deng, Lei
author_facet Gao, Yong
Hao, Weilin
Gu, Jing
Liu, Diwei
Fan, Chao
Chen, Zhigang
Deng, Lei
author_sort Gao, Yong
collection PubMed
description BACKGROUND: Post-translational modifications (PTMs) occur on almost all proteins and often strongly affect the functions of modified proteins. Phosphorylation is a crucial PTM mechanism with important regulatory functions in biological systems. Identifying the potential phosphorylation sites of a target protein may increase our understanding of the molecular processes in which it takes part. RESULTS: In this paper, we propose PredPhos, a computational method that can accurately predict both kinase-specific and non-kinase-specific phosphorylation sites by using optimally selected properties. The optimal combination of features was selected from a set of 153 novel structural neighborhood properties by a two-step feature selection method consisting of a random forest algorithm and a sequential backward elimination method. To overcome the imbalanced problem, we adopt an ensemble method, which combines bootstrap resampling technique, support vector machine-based fusion classifiers and majority voting strategy. We evaluate the proposed method using both tenfold cross validation and independent test. Results show that our method achieves a significant improvement on the prediction performance for both kinase-specific and non-kinase-specific phosphorylation sites. CONCLUSIONS: The experimental results demonstrate that the proposed method is quite effective in predicting phosphorylation sites. Promising results are derived from the new structural neighborhood properties, the novel way of feature selection, as well as the ensemble method.
format Online
Article
Text
id pubmed-4943517
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49435172016-07-19 PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites Gao, Yong Hao, Weilin Gu, Jing Liu, Diwei Fan, Chao Chen, Zhigang Deng, Lei J Biol Res (Thessalon) Research BACKGROUND: Post-translational modifications (PTMs) occur on almost all proteins and often strongly affect the functions of modified proteins. Phosphorylation is a crucial PTM mechanism with important regulatory functions in biological systems. Identifying the potential phosphorylation sites of a target protein may increase our understanding of the molecular processes in which it takes part. RESULTS: In this paper, we propose PredPhos, a computational method that can accurately predict both kinase-specific and non-kinase-specific phosphorylation sites by using optimally selected properties. The optimal combination of features was selected from a set of 153 novel structural neighborhood properties by a two-step feature selection method consisting of a random forest algorithm and a sequential backward elimination method. To overcome the imbalanced problem, we adopt an ensemble method, which combines bootstrap resampling technique, support vector machine-based fusion classifiers and majority voting strategy. We evaluate the proposed method using both tenfold cross validation and independent test. Results show that our method achieves a significant improvement on the prediction performance for both kinase-specific and non-kinase-specific phosphorylation sites. CONCLUSIONS: The experimental results demonstrate that the proposed method is quite effective in predicting phosphorylation sites. Promising results are derived from the new structural neighborhood properties, the novel way of feature selection, as well as the ensemble method. BioMed Central 2016-07-04 /pmc/articles/PMC4943517/ /pubmed/27437197 http://dx.doi.org/10.1186/s40709-016-0042-y Text en © Gao et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Gao, Yong
Hao, Weilin
Gu, Jing
Liu, Diwei
Fan, Chao
Chen, Zhigang
Deng, Lei
PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites
title PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites
title_full PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites
title_fullStr PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites
title_full_unstemmed PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites
title_short PredPhos: an ensemble framework for structure-based prediction of phosphorylation sites
title_sort predphos: an ensemble framework for structure-based prediction of phosphorylation sites
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4943517/
https://www.ncbi.nlm.nih.gov/pubmed/27437197
http://dx.doi.org/10.1186/s40709-016-0042-y
work_keys_str_mv AT gaoyong predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites
AT haoweilin predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites
AT gujing predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites
AT liudiwei predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites
AT fanchao predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites
AT chenzhigang predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites
AT denglei predphosanensembleframeworkforstructurebasedpredictionofphosphorylationsites