Cargando…

Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression

Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clusteri...

Descripción completa

Detalles Bibliográficos
Autores principales: Poole, William, Leinonen, Kalle, Shmulevich, Ilya, Knijnenburg, Theo A., Bernard, Brady
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5321471/
https://www.ncbi.nlm.nih.gov/pubmed/28170390
http://dx.doi.org/10.1371/journal.pcbi.1005347
_version_ 1782509706270474240
author Poole, William
Leinonen, Kalle
Shmulevich, Ilya
Knijnenburg, Theo A.
Bernard, Brady
author_facet Poole, William
Leinonen, Kalle
Shmulevich, Ilya
Knijnenburg, Theo A.
Bernard, Brady
author_sort Poole, William
collection PubMed
description Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.
format Online
Article
Text
id pubmed-5321471
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53214712017-03-10 Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression Poole, William Leinonen, Kalle Shmulevich, Ilya Knijnenburg, Theo A. Bernard, Brady PLoS Comput Biol Research Article Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C. Public Library of Science 2017-02-07 /pmc/articles/PMC5321471/ /pubmed/28170390 http://dx.doi.org/10.1371/journal.pcbi.1005347 Text en © 2017 Poole et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Poole, William
Leinonen, Kalle
Shmulevich, Ilya
Knijnenburg, Theo A.
Bernard, Brady
Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression
title Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression
title_full Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression
title_fullStr Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression
title_full_unstemmed Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression
title_short Multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression
title_sort multiscale mutation clustering algorithm identifies pan-cancer mutational clusters associated with pathway-level changes in gene expression
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5321471/
https://www.ncbi.nlm.nih.gov/pubmed/28170390
http://dx.doi.org/10.1371/journal.pcbi.1005347
work_keys_str_mv AT poolewilliam multiscalemutationclusteringalgorithmidentifiespancancermutationalclustersassociatedwithpathwaylevelchangesingeneexpression
AT leinonenkalle multiscalemutationclusteringalgorithmidentifiespancancermutationalclustersassociatedwithpathwaylevelchangesingeneexpression
AT shmulevichilya multiscalemutationclusteringalgorithmidentifiespancancermutationalclustersassociatedwithpathwaylevelchangesingeneexpression
AT knijnenburgtheoa multiscalemutationclusteringalgorithmidentifiespancancermutationalclustersassociatedwithpathwaylevelchangesingeneexpression
AT bernardbrady multiscalemutationclusteringalgorithmidentifiespancancermutationalclustersassociatedwithpathwaylevelchangesingeneexpression