Cargando…
Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models
Mutation signatures in cancer genomes reflect endogenous and exogenous mutational processes, offering insights into tumour etiology, features for prognostic and biologic stratification and vulnerabilities to be exploited therapeutically. We present a novel machine learning formalism for improved sig...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6402697/ https://www.ncbi.nlm.nih.gov/pubmed/30794536 http://dx.doi.org/10.1371/journal.pcbi.1006799 |
_version_ | 1783400451066036224 |
---|---|
author | Funnell, Tyler Zhang, Allen W. Grewal, Diljot McKinney, Steven Bashashati, Ali Wang, Yi Kan Shah, Sohrab P. |
author_facet | Funnell, Tyler Zhang, Allen W. Grewal, Diljot McKinney, Steven Bashashati, Ali Wang, Yi Kan Shah, Sohrab P. |
author_sort | Funnell, Tyler |
collection | PubMed |
description | Mutation signatures in cancer genomes reflect endogenous and exogenous mutational processes, offering insights into tumour etiology, features for prognostic and biologic stratification and vulnerabilities to be exploited therapeutically. We present a novel machine learning formalism for improved signature inference, based on multi-modal correlated topic models (MMCTM) which can at once infer signatures from both single nucleotide and structural variation counts derived from cancer genome sequencing data. We exemplify the utility of our approach on two hormone driven, DNA repair deficient cancers: breast and ovary (n = 755 samples total). We show how introducing correlated structure both within and between modes of mutation can increase accuracy of signature discovery, particularly in the context of sparse data. Our study emphasizes the importance of integrating multiple mutation modes for signature discovery and patient stratification, and provides a statistical modeling framework to incorporate additional features of interest for future studies. |
format | Online Article Text |
id | pubmed-6402697 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-64026972019-03-17 Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models Funnell, Tyler Zhang, Allen W. Grewal, Diljot McKinney, Steven Bashashati, Ali Wang, Yi Kan Shah, Sohrab P. PLoS Comput Biol Research Article Mutation signatures in cancer genomes reflect endogenous and exogenous mutational processes, offering insights into tumour etiology, features for prognostic and biologic stratification and vulnerabilities to be exploited therapeutically. We present a novel machine learning formalism for improved signature inference, based on multi-modal correlated topic models (MMCTM) which can at once infer signatures from both single nucleotide and structural variation counts derived from cancer genome sequencing data. We exemplify the utility of our approach on two hormone driven, DNA repair deficient cancers: breast and ovary (n = 755 samples total). We show how introducing correlated structure both within and between modes of mutation can increase accuracy of signature discovery, particularly in the context of sparse data. Our study emphasizes the importance of integrating multiple mutation modes for signature discovery and patient stratification, and provides a statistical modeling framework to incorporate additional features of interest for future studies. Public Library of Science 2019-02-22 /pmc/articles/PMC6402697/ /pubmed/30794536 http://dx.doi.org/10.1371/journal.pcbi.1006799 Text en © 2019 Funnell et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Funnell, Tyler Zhang, Allen W. Grewal, Diljot McKinney, Steven Bashashati, Ali Wang, Yi Kan Shah, Sohrab P. Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models |
title | Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models |
title_full | Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models |
title_fullStr | Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models |
title_full_unstemmed | Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models |
title_short | Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models |
title_sort | integrated structural variation and point mutation signatures in cancer genomes using correlated topic models |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6402697/ https://www.ncbi.nlm.nih.gov/pubmed/30794536 http://dx.doi.org/10.1371/journal.pcbi.1006799 |
work_keys_str_mv | AT funnelltyler integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels AT zhangallenw integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels AT grewaldiljot integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels AT mckinneysteven integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels AT bashashatiali integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels AT wangyikan integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels AT shahsohrabp integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels |