Cargando…

Prediction of kinase-specific phosphorylation sites using conditional random fields

Motivation: Phosphorylation is a crucial post-translational protein modification mechanism with important regulatory functions in biological systems. It is catalyzed by a group of enzymes called kinases, each of which recognizes certain target sites in its substrate proteins. Several authors have bu...

Descripción completa

Detalles Bibliográficos
Autores principales: Dang, Thanh Hai, Van Leemput, Koenraad, Verschoren, Alain, Laukens, Kris
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2639296/
https://www.ncbi.nlm.nih.gov/pubmed/18940828
http://dx.doi.org/10.1093/bioinformatics/btn546
_version_ 1782164450149662720
author Dang, Thanh Hai
Van Leemput, Koenraad
Verschoren, Alain
Laukens, Kris
author_facet Dang, Thanh Hai
Van Leemput, Koenraad
Verschoren, Alain
Laukens, Kris
author_sort Dang, Thanh Hai
collection PubMed
description Motivation: Phosphorylation is a crucial post-translational protein modification mechanism with important regulatory functions in biological systems. It is catalyzed by a group of enzymes called kinases, each of which recognizes certain target sites in its substrate proteins. Several authors have built computational models trained from sets of experimentally validated phosphorylation sites to predict these target sites for each given kinase. All of these models suffer from certain limitations, such as the fact that they do not take into account the dependencies between amino acid motifs within protein sequences in a global fashion. Results: We propose a novel approach to predict phosphorylation sites from the protein sequence. The method uses a positive dataset to train a conditional random field (CRF) model. The negative training dataset is used to specify the decision threshold corresponding to a desired false positive rate. Application of the method on experimentally verified benchmark phosphorylation data (Phospho.ELM) shows that it performs well compared to existing methods for most kinases. This is to our knowledge that the first report of the use of CRFs to predict post-translational modification sites in protein sequences. Availability: The source code of the implementation, called CRPhos, is available from http://www.ptools.ua.ac.be/CRPhos/ Contact: kris.laukens@ua.ac.be Suplementary Information: Supplementary data are available at http://www.ptools.ua.ac.be/CRPhos/
format Text
id pubmed-2639296
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-26392962009-02-25 Prediction of kinase-specific phosphorylation sites using conditional random fields Dang, Thanh Hai Van Leemput, Koenraad Verschoren, Alain Laukens, Kris Bioinformatics Original Papers Motivation: Phosphorylation is a crucial post-translational protein modification mechanism with important regulatory functions in biological systems. It is catalyzed by a group of enzymes called kinases, each of which recognizes certain target sites in its substrate proteins. Several authors have built computational models trained from sets of experimentally validated phosphorylation sites to predict these target sites for each given kinase. All of these models suffer from certain limitations, such as the fact that they do not take into account the dependencies between amino acid motifs within protein sequences in a global fashion. Results: We propose a novel approach to predict phosphorylation sites from the protein sequence. The method uses a positive dataset to train a conditional random field (CRF) model. The negative training dataset is used to specify the decision threshold corresponding to a desired false positive rate. Application of the method on experimentally verified benchmark phosphorylation data (Phospho.ELM) shows that it performs well compared to existing methods for most kinases. This is to our knowledge that the first report of the use of CRFs to predict post-translational modification sites in protein sequences. Availability: The source code of the implementation, called CRPhos, is available from http://www.ptools.ua.ac.be/CRPhos/ Contact: kris.laukens@ua.ac.be Suplementary Information: Supplementary data are available at http://www.ptools.ua.ac.be/CRPhos/ Oxford University Press 2008-12-15 2008-10-20 /pmc/articles/PMC2639296/ /pubmed/18940828 http://dx.doi.org/10.1093/bioinformatics/btn546 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Dang, Thanh Hai
Van Leemput, Koenraad
Verschoren, Alain
Laukens, Kris
Prediction of kinase-specific phosphorylation sites using conditional random fields
title Prediction of kinase-specific phosphorylation sites using conditional random fields
title_full Prediction of kinase-specific phosphorylation sites using conditional random fields
title_fullStr Prediction of kinase-specific phosphorylation sites using conditional random fields
title_full_unstemmed Prediction of kinase-specific phosphorylation sites using conditional random fields
title_short Prediction of kinase-specific phosphorylation sites using conditional random fields
title_sort prediction of kinase-specific phosphorylation sites using conditional random fields
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2639296/
https://www.ncbi.nlm.nih.gov/pubmed/18940828
http://dx.doi.org/10.1093/bioinformatics/btn546
work_keys_str_mv AT dangthanhhai predictionofkinasespecificphosphorylationsitesusingconditionalrandomfields
AT vanleemputkoenraad predictionofkinasespecificphosphorylationsitesusingconditionalrandomfields
AT verschorenalain predictionofkinasespecificphosphorylationsitesusingconditionalrandomfields
AT laukenskris predictionofkinasespecificphosphorylationsitesusingconditionalrandomfields