Cargando…

A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains

Studies on the structures and functions of individual kinases have been used to understand the biological properties of other kinases that do not yet have experimental structures. The key factor in accurate inference by homology is an accurate sequence alignment. We present a parsimonious, structure...

Descripción completa

Detalles Bibliográficos
Autores principales: Modi, Vivek, Dunbrack, Roland L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6930252/
https://www.ncbi.nlm.nih.gov/pubmed/31875044
http://dx.doi.org/10.1038/s41598-019-56499-4
_version_ 1783482856758050816
author Modi, Vivek
Dunbrack, Roland L.
author_facet Modi, Vivek
Dunbrack, Roland L.
author_sort Modi, Vivek
collection PubMed
description Studies on the structures and functions of individual kinases have been used to understand the biological properties of other kinases that do not yet have experimental structures. The key factor in accurate inference by homology is an accurate sequence alignment. We present a parsimonious, structure-based multiple sequence alignment (MSA) of 497 human protein kinase domains excluding atypical kinases. The alignment is arranged in 17 blocks of conserved regions and unaligned blocks in between that contain insertions of varying lengths present in only a subset of kinases. The aligned blocks contain well-conserved elements of secondary structure and well-known functional motifs, such as the DFG and HRD motifs. From pairwise, all-against-all alignment of 272 human kinase structures, we estimate the accuracy of our MSA to be 97%. The remaining inaccuracy comes from a few structures with shifted elements of secondary structure, and from the boundaries of aligned and unaligned regions, where compromises need to be made to encompass the majority of kinases. A new phylogeny of the protein kinase domains in the human genome based on our alignment indicates that ten kinases previously labeled as “OTHER” can be confidently placed into the CAMK group. These kinases comprise the Aurora kinases, Polo kinases, and calcium/calmodulin-dependent kinase kinases.
format Online
Article
Text
id pubmed-6930252
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-69302522019-12-27 A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains Modi, Vivek Dunbrack, Roland L. Sci Rep Article Studies on the structures and functions of individual kinases have been used to understand the biological properties of other kinases that do not yet have experimental structures. The key factor in accurate inference by homology is an accurate sequence alignment. We present a parsimonious, structure-based multiple sequence alignment (MSA) of 497 human protein kinase domains excluding atypical kinases. The alignment is arranged in 17 blocks of conserved regions and unaligned blocks in between that contain insertions of varying lengths present in only a subset of kinases. The aligned blocks contain well-conserved elements of secondary structure and well-known functional motifs, such as the DFG and HRD motifs. From pairwise, all-against-all alignment of 272 human kinase structures, we estimate the accuracy of our MSA to be 97%. The remaining inaccuracy comes from a few structures with shifted elements of secondary structure, and from the boundaries of aligned and unaligned regions, where compromises need to be made to encompass the majority of kinases. A new phylogeny of the protein kinase domains in the human genome based on our alignment indicates that ten kinases previously labeled as “OTHER” can be confidently placed into the CAMK group. These kinases comprise the Aurora kinases, Polo kinases, and calcium/calmodulin-dependent kinase kinases. Nature Publishing Group UK 2019-12-24 /pmc/articles/PMC6930252/ /pubmed/31875044 http://dx.doi.org/10.1038/s41598-019-56499-4 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Modi, Vivek
Dunbrack, Roland L.
A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains
title A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains
title_full A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains
title_fullStr A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains
title_full_unstemmed A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains
title_short A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains
title_sort structurally-validated multiple sequence alignment of 497 human protein kinase domains
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6930252/
https://www.ncbi.nlm.nih.gov/pubmed/31875044
http://dx.doi.org/10.1038/s41598-019-56499-4
work_keys_str_mv AT modivivek astructurallyvalidatedmultiplesequencealignmentof497humanproteinkinasedomains
AT dunbrackrolandl astructurallyvalidatedmultiplesequencealignmentof497humanproteinkinasedomains
AT modivivek structurallyvalidatedmultiplesequencealignmentof497humanproteinkinasedomains
AT dunbrackrolandl structurallyvalidatedmultiplesequencealignmentof497humanproteinkinasedomains