Cargando…
A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains
Studies on the structures and functions of individual kinases have been used to understand the biological properties of other kinases that do not yet have experimental structures. The key factor in accurate inference by homology is an accurate sequence alignment. We present a parsimonious, structure...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6930252/ https://www.ncbi.nlm.nih.gov/pubmed/31875044 http://dx.doi.org/10.1038/s41598-019-56499-4 |
_version_ | 1783482856758050816 |
---|---|
author | Modi, Vivek Dunbrack, Roland L. |
author_facet | Modi, Vivek Dunbrack, Roland L. |
author_sort | Modi, Vivek |
collection | PubMed |
description | Studies on the structures and functions of individual kinases have been used to understand the biological properties of other kinases that do not yet have experimental structures. The key factor in accurate inference by homology is an accurate sequence alignment. We present a parsimonious, structure-based multiple sequence alignment (MSA) of 497 human protein kinase domains excluding atypical kinases. The alignment is arranged in 17 blocks of conserved regions and unaligned blocks in between that contain insertions of varying lengths present in only a subset of kinases. The aligned blocks contain well-conserved elements of secondary structure and well-known functional motifs, such as the DFG and HRD motifs. From pairwise, all-against-all alignment of 272 human kinase structures, we estimate the accuracy of our MSA to be 97%. The remaining inaccuracy comes from a few structures with shifted elements of secondary structure, and from the boundaries of aligned and unaligned regions, where compromises need to be made to encompass the majority of kinases. A new phylogeny of the protein kinase domains in the human genome based on our alignment indicates that ten kinases previously labeled as “OTHER” can be confidently placed into the CAMK group. These kinases comprise the Aurora kinases, Polo kinases, and calcium/calmodulin-dependent kinase kinases. |
format | Online Article Text |
id | pubmed-6930252 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-69302522019-12-27 A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains Modi, Vivek Dunbrack, Roland L. Sci Rep Article Studies on the structures and functions of individual kinases have been used to understand the biological properties of other kinases that do not yet have experimental structures. The key factor in accurate inference by homology is an accurate sequence alignment. We present a parsimonious, structure-based multiple sequence alignment (MSA) of 497 human protein kinase domains excluding atypical kinases. The alignment is arranged in 17 blocks of conserved regions and unaligned blocks in between that contain insertions of varying lengths present in only a subset of kinases. The aligned blocks contain well-conserved elements of secondary structure and well-known functional motifs, such as the DFG and HRD motifs. From pairwise, all-against-all alignment of 272 human kinase structures, we estimate the accuracy of our MSA to be 97%. The remaining inaccuracy comes from a few structures with shifted elements of secondary structure, and from the boundaries of aligned and unaligned regions, where compromises need to be made to encompass the majority of kinases. A new phylogeny of the protein kinase domains in the human genome based on our alignment indicates that ten kinases previously labeled as “OTHER” can be confidently placed into the CAMK group. These kinases comprise the Aurora kinases, Polo kinases, and calcium/calmodulin-dependent kinase kinases. Nature Publishing Group UK 2019-12-24 /pmc/articles/PMC6930252/ /pubmed/31875044 http://dx.doi.org/10.1038/s41598-019-56499-4 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Modi, Vivek Dunbrack, Roland L. A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains |
title | A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains |
title_full | A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains |
title_fullStr | A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains |
title_full_unstemmed | A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains |
title_short | A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains |
title_sort | structurally-validated multiple sequence alignment of 497 human protein kinase domains |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6930252/ https://www.ncbi.nlm.nih.gov/pubmed/31875044 http://dx.doi.org/10.1038/s41598-019-56499-4 |
work_keys_str_mv | AT modivivek astructurallyvalidatedmultiplesequencealignmentof497humanproteinkinasedomains AT dunbrackrolandl astructurallyvalidatedmultiplesequencealignmentof497humanproteinkinasedomains AT modivivek structurallyvalidatedmultiplesequencealignmentof497humanproteinkinasedomains AT dunbrackrolandl structurallyvalidatedmultiplesequencealignmentof497humanproteinkinasedomains |