Cargando…

Prediction and Prioritization of Rare Oncogenic Mutations in the Cancer Kinome Using Novel Features and Multiple Classifiers

Cancer is a genetic disease that develops through a series of somatic mutations, a subset of which drive cancer progression. Although cancer genome sequencing studies are beginning to reveal the mutational patterns of genes in various cancers, identifying the small subset of “causative” mutations fr...

Descripción completa

Detalles Bibliográficos
Autores principales: U, ManChon, Talevich, Eric, Katiyar, Samiksha, Rasheed, Khaled, Kannan, Natarajan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3990476/
https://www.ncbi.nlm.nih.gov/pubmed/24743239
http://dx.doi.org/10.1371/journal.pcbi.1003545
_version_ 1782312284944596992
author U, ManChon
Talevich, Eric
Katiyar, Samiksha
Rasheed, Khaled
Kannan, Natarajan
author_facet U, ManChon
Talevich, Eric
Katiyar, Samiksha
Rasheed, Khaled
Kannan, Natarajan
author_sort U, ManChon
collection PubMed
description Cancer is a genetic disease that develops through a series of somatic mutations, a subset of which drive cancer progression. Although cancer genome sequencing studies are beginning to reveal the mutational patterns of genes in various cancers, identifying the small subset of “causative” mutations from the large subset of “non-causative” mutations, which accumulate as a consequence of the disease, is a challenge. In this article, we present an effective machine learning approach for identifying cancer-associated mutations in human protein kinases, a class of signaling proteins known to be frequently mutated in human cancers. We evaluate the performance of 11 well known supervised learners and show that a multiple-classifier approach, which combines the performances of individual learners, significantly improves the classification of known cancer-associated mutations. We introduce several novel features related specifically to structural and functional characteristics of protein kinases and find that the level of conservation of the mutated residue at specific evolutionary depths is an important predictor of oncogenic effect. We consolidate the novel features and the multiple-classifier approach to prioritize and experimentally test a set of rare unconfirmed mutations in the epidermal growth factor receptor tyrosine kinase (EGFR). Our studies identify T725M and L861R as rare cancer-associated mutations inasmuch as these mutations increase EGFR activity in the absence of the activating EGF ligand in cell-based assays.
format Online
Article
Text
id pubmed-3990476
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39904762014-04-21 Prediction and Prioritization of Rare Oncogenic Mutations in the Cancer Kinome Using Novel Features and Multiple Classifiers U, ManChon Talevich, Eric Katiyar, Samiksha Rasheed, Khaled Kannan, Natarajan PLoS Comput Biol Research Article Cancer is a genetic disease that develops through a series of somatic mutations, a subset of which drive cancer progression. Although cancer genome sequencing studies are beginning to reveal the mutational patterns of genes in various cancers, identifying the small subset of “causative” mutations from the large subset of “non-causative” mutations, which accumulate as a consequence of the disease, is a challenge. In this article, we present an effective machine learning approach for identifying cancer-associated mutations in human protein kinases, a class of signaling proteins known to be frequently mutated in human cancers. We evaluate the performance of 11 well known supervised learners and show that a multiple-classifier approach, which combines the performances of individual learners, significantly improves the classification of known cancer-associated mutations. We introduce several novel features related specifically to structural and functional characteristics of protein kinases and find that the level of conservation of the mutated residue at specific evolutionary depths is an important predictor of oncogenic effect. We consolidate the novel features and the multiple-classifier approach to prioritize and experimentally test a set of rare unconfirmed mutations in the epidermal growth factor receptor tyrosine kinase (EGFR). Our studies identify T725M and L861R as rare cancer-associated mutations inasmuch as these mutations increase EGFR activity in the absence of the activating EGF ligand in cell-based assays. Public Library of Science 2014-04-17 /pmc/articles/PMC3990476/ /pubmed/24743239 http://dx.doi.org/10.1371/journal.pcbi.1003545 Text en © 2014 U et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
U, ManChon
Talevich, Eric
Katiyar, Samiksha
Rasheed, Khaled
Kannan, Natarajan
Prediction and Prioritization of Rare Oncogenic Mutations in the Cancer Kinome Using Novel Features and Multiple Classifiers
title Prediction and Prioritization of Rare Oncogenic Mutations in the Cancer Kinome Using Novel Features and Multiple Classifiers
title_full Prediction and Prioritization of Rare Oncogenic Mutations in the Cancer Kinome Using Novel Features and Multiple Classifiers
title_fullStr Prediction and Prioritization of Rare Oncogenic Mutations in the Cancer Kinome Using Novel Features and Multiple Classifiers
title_full_unstemmed Prediction and Prioritization of Rare Oncogenic Mutations in the Cancer Kinome Using Novel Features and Multiple Classifiers
title_short Prediction and Prioritization of Rare Oncogenic Mutations in the Cancer Kinome Using Novel Features and Multiple Classifiers
title_sort prediction and prioritization of rare oncogenic mutations in the cancer kinome using novel features and multiple classifiers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3990476/
https://www.ncbi.nlm.nih.gov/pubmed/24743239
http://dx.doi.org/10.1371/journal.pcbi.1003545
work_keys_str_mv AT umanchon predictionandprioritizationofrareoncogenicmutationsinthecancerkinomeusingnovelfeaturesandmultipleclassifiers
AT talevicheric predictionandprioritizationofrareoncogenicmutationsinthecancerkinomeusingnovelfeaturesandmultipleclassifiers
AT katiyarsamiksha predictionandprioritizationofrareoncogenicmutationsinthecancerkinomeusingnovelfeaturesandmultipleclassifiers
AT rasheedkhaled predictionandprioritizationofrareoncogenicmutationsinthecancerkinomeusingnovelfeaturesandmultipleclassifiers
AT kannannatarajan predictionandprioritizationofrareoncogenicmutationsinthecancerkinomeusingnovelfeaturesandmultipleclassifiers