Cargando…
Prioritization of pathogenic mutations in the protein kinase superfamily
BACKGROUND: Most of the many mutations described in human protein kinases are tolerated without significant disruption of the corresponding structures or molecular functions, while some of them have been associated to a variety of human diseases, including cancer. In the last decade, a plethora of c...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3303724/ https://www.ncbi.nlm.nih.gov/pubmed/22759651 http://dx.doi.org/10.1186/1471-2164-13-S4-S3 |
_version_ | 1782226778651099136 |
---|---|
author | Izarzugaza, Jose MG del Pozo, Angela Vazquez, Miguel Valencia, Alfonso |
author_facet | Izarzugaza, Jose MG del Pozo, Angela Vazquez, Miguel Valencia, Alfonso |
author_sort | Izarzugaza, Jose MG |
collection | PubMed |
description | BACKGROUND: Most of the many mutations described in human protein kinases are tolerated without significant disruption of the corresponding structures or molecular functions, while some of them have been associated to a variety of human diseases, including cancer. In the last decade, a plethora of computational methods to predict the effect of missense single-nucleotide variants (SNVs) have been developed. Still, current high-throughput sequencing efforts and the concomitant need for massive interpretation of protein sequence variants will demand for more efficient and/or accurate computational methods in the forthcoming years. RESULTS: We present KinMut, a support vector machine (SVM) approach, to identify pathogenic mutations in the protein kinase superfamily. KinMut relays on a combination of sequence-derived features that describe mutations at different levels: (1) Gene level: membership to a specific group in Kinbase and the annotation with GO terms; (2) Domain level: annotated PFAM domains; and (3) Residue level: physicochemical features of amino acids, specificity determining positions, and functional annotations from SwissProt and FireDB. The system has been trained with the set of 3492 human kinase mutations in UniProt for which experimental validation of their pathogenic or neutral character exists. In addition, we discuss the relative importance of these independent properties and their combination for the development of a kinase-specific predictor. Finally, we compare KinMut with other state-of-the-art prediction methods. CONCLUSIONS: Family-specific features appear among the most discriminative information sources, which allow us to produce accurate results in a reliable and very simple way with minimal supervision. Our study aims to broaden the knowledge on the mechanisms by which mutations in the human kinome contribute to disease with a particular focus in cancer. The classifier as well as further documentation is available at http://kinmut.bioinfo.cnio.es/. |
format | Online Article Text |
id | pubmed-3303724 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-33037242012-03-16 Prioritization of pathogenic mutations in the protein kinase superfamily Izarzugaza, Jose MG del Pozo, Angela Vazquez, Miguel Valencia, Alfonso BMC Genomics Proceedings BACKGROUND: Most of the many mutations described in human protein kinases are tolerated without significant disruption of the corresponding structures or molecular functions, while some of them have been associated to a variety of human diseases, including cancer. In the last decade, a plethora of computational methods to predict the effect of missense single-nucleotide variants (SNVs) have been developed. Still, current high-throughput sequencing efforts and the concomitant need for massive interpretation of protein sequence variants will demand for more efficient and/or accurate computational methods in the forthcoming years. RESULTS: We present KinMut, a support vector machine (SVM) approach, to identify pathogenic mutations in the protein kinase superfamily. KinMut relays on a combination of sequence-derived features that describe mutations at different levels: (1) Gene level: membership to a specific group in Kinbase and the annotation with GO terms; (2) Domain level: annotated PFAM domains; and (3) Residue level: physicochemical features of amino acids, specificity determining positions, and functional annotations from SwissProt and FireDB. The system has been trained with the set of 3492 human kinase mutations in UniProt for which experimental validation of their pathogenic or neutral character exists. In addition, we discuss the relative importance of these independent properties and their combination for the development of a kinase-specific predictor. Finally, we compare KinMut with other state-of-the-art prediction methods. CONCLUSIONS: Family-specific features appear among the most discriminative information sources, which allow us to produce accurate results in a reliable and very simple way with minimal supervision. Our study aims to broaden the knowledge on the mechanisms by which mutations in the human kinome contribute to disease with a particular focus in cancer. The classifier as well as further documentation is available at http://kinmut.bioinfo.cnio.es/. BioMed Central 2012-06-18 /pmc/articles/PMC3303724/ /pubmed/22759651 http://dx.doi.org/10.1186/1471-2164-13-S4-S3 Text en Copyright ©2012 Izarzugaza et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Izarzugaza, Jose MG del Pozo, Angela Vazquez, Miguel Valencia, Alfonso Prioritization of pathogenic mutations in the protein kinase superfamily |
title | Prioritization of pathogenic mutations in the protein kinase superfamily |
title_full | Prioritization of pathogenic mutations in the protein kinase superfamily |
title_fullStr | Prioritization of pathogenic mutations in the protein kinase superfamily |
title_full_unstemmed | Prioritization of pathogenic mutations in the protein kinase superfamily |
title_short | Prioritization of pathogenic mutations in the protein kinase superfamily |
title_sort | prioritization of pathogenic mutations in the protein kinase superfamily |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3303724/ https://www.ncbi.nlm.nih.gov/pubmed/22759651 http://dx.doi.org/10.1186/1471-2164-13-S4-S3 |
work_keys_str_mv | AT izarzugazajosemg prioritizationofpathogenicmutationsintheproteinkinasesuperfamily AT delpozoangela prioritizationofpathogenicmutationsintheproteinkinasesuperfamily AT vazquezmiguel prioritizationofpathogenicmutationsintheproteinkinasesuperfamily AT valenciaalfonso prioritizationofpathogenicmutationsintheproteinkinasesuperfamily |