Cargando…
Classification of Protein Kinases on the Basis of Both Kinase and Non-Kinase Regions
BACKGROUND: Protein phosphorylation is a generic way to regulate signal transduction pathways in all kingdoms of life. In many organisms, it is achieved by the large family of Ser/Thr/Tyr protein kinases which are traditionally classified into groups and subfamilies on the basis of the amino acid se...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2939887/ https://www.ncbi.nlm.nih.gov/pubmed/20856812 http://dx.doi.org/10.1371/journal.pone.0012460 |
_version_ | 1782186781785980928 |
---|---|
author | Martin, Juliette Anamika, Krishanpal Srinivasan, Narayanaswamy |
author_facet | Martin, Juliette Anamika, Krishanpal Srinivasan, Narayanaswamy |
author_sort | Martin, Juliette |
collection | PubMed |
description | BACKGROUND: Protein phosphorylation is a generic way to regulate signal transduction pathways in all kingdoms of life. In many organisms, it is achieved by the large family of Ser/Thr/Tyr protein kinases which are traditionally classified into groups and subfamilies on the basis of the amino acid sequence of their catalytic domains. Many protein kinases are multi-domain in nature but the diversity of the accessory domains and their organization are usually not taken into account while classifying kinases into groups or subfamilies. METHODOLOGY: Here, we present an approach which considers amino acid sequences of complete gene products, in order to suggest refinements in sets of pre-classified sequences. The strategy is based on alignment-free similarity scores and iterative Area Under the Curve (AUC) computation. Similarity scores are computed by detecting common patterns between two sequences and scoring them using a substitution matrix, with a consistent normalization scheme. This allows us to handle full-length sequences, and implicitly takes into account domain diversity and domain shuffling. We quantitatively validate our approach on a subset of 212 human protein kinases. We then employ it on the complete repertoire of human protein kinases and suggest few qualitative refinements in the subfamily assignment stored in the KinG database, which is based on catalytic domains only. Based on our new measure, we delineate 37 cases of potential hybrid kinases: sequences for which classical classification based entirely on catalytic domains is inconsistent with the full-length similarity scores computed here, which implicitly consider multi-domain nature and regions outside the catalytic kinase domain. We also provide some examples of hybrid kinases of the protozoan parasite Entamoeba histolytica. CONCLUSIONS: The implicit consideration of multi-domain architectures is a valuable inclusion to complement other classification schemes. The proposed algorithm may also be employed to classify other families of enzymes with multi-domain architecture. |
format | Text |
id | pubmed-2939887 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-29398872010-09-20 Classification of Protein Kinases on the Basis of Both Kinase and Non-Kinase Regions Martin, Juliette Anamika, Krishanpal Srinivasan, Narayanaswamy PLoS One Research Article BACKGROUND: Protein phosphorylation is a generic way to regulate signal transduction pathways in all kingdoms of life. In many organisms, it is achieved by the large family of Ser/Thr/Tyr protein kinases which are traditionally classified into groups and subfamilies on the basis of the amino acid sequence of their catalytic domains. Many protein kinases are multi-domain in nature but the diversity of the accessory domains and their organization are usually not taken into account while classifying kinases into groups or subfamilies. METHODOLOGY: Here, we present an approach which considers amino acid sequences of complete gene products, in order to suggest refinements in sets of pre-classified sequences. The strategy is based on alignment-free similarity scores and iterative Area Under the Curve (AUC) computation. Similarity scores are computed by detecting common patterns between two sequences and scoring them using a substitution matrix, with a consistent normalization scheme. This allows us to handle full-length sequences, and implicitly takes into account domain diversity and domain shuffling. We quantitatively validate our approach on a subset of 212 human protein kinases. We then employ it on the complete repertoire of human protein kinases and suggest few qualitative refinements in the subfamily assignment stored in the KinG database, which is based on catalytic domains only. Based on our new measure, we delineate 37 cases of potential hybrid kinases: sequences for which classical classification based entirely on catalytic domains is inconsistent with the full-length similarity scores computed here, which implicitly consider multi-domain nature and regions outside the catalytic kinase domain. We also provide some examples of hybrid kinases of the protozoan parasite Entamoeba histolytica. CONCLUSIONS: The implicit consideration of multi-domain architectures is a valuable inclusion to complement other classification schemes. The proposed algorithm may also be employed to classify other families of enzymes with multi-domain architecture. Public Library of Science 2010-09-15 /pmc/articles/PMC2939887/ /pubmed/20856812 http://dx.doi.org/10.1371/journal.pone.0012460 Text en Martin et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Martin, Juliette Anamika, Krishanpal Srinivasan, Narayanaswamy Classification of Protein Kinases on the Basis of Both Kinase and Non-Kinase Regions |
title | Classification of Protein Kinases on the Basis of Both Kinase and Non-Kinase Regions |
title_full | Classification of Protein Kinases on the Basis of Both Kinase and Non-Kinase Regions |
title_fullStr | Classification of Protein Kinases on the Basis of Both Kinase and Non-Kinase Regions |
title_full_unstemmed | Classification of Protein Kinases on the Basis of Both Kinase and Non-Kinase Regions |
title_short | Classification of Protein Kinases on the Basis of Both Kinase and Non-Kinase Regions |
title_sort | classification of protein kinases on the basis of both kinase and non-kinase regions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2939887/ https://www.ncbi.nlm.nih.gov/pubmed/20856812 http://dx.doi.org/10.1371/journal.pone.0012460 |
work_keys_str_mv | AT martinjuliette classificationofproteinkinasesonthebasisofbothkinaseandnonkinaseregions AT anamikakrishanpal classificationofproteinkinasesonthebasisofbothkinaseandnonkinaseregions AT srinivasannarayanaswamy classificationofproteinkinasesonthebasisofbothkinaseandnonkinaseregions |