Cargando…

Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models

Mutation signatures in cancer genomes reflect endogenous and exogenous mutational processes, offering insights into tumour etiology, features for prognostic and biologic stratification and vulnerabilities to be exploited therapeutically. We present a novel machine learning formalism for improved sig...

Descripción completa

Detalles Bibliográficos
Autores principales: Funnell, Tyler, Zhang, Allen W., Grewal, Diljot, McKinney, Steven, Bashashati, Ali, Wang, Yi Kan, Shah, Sohrab P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6402697/
https://www.ncbi.nlm.nih.gov/pubmed/30794536
http://dx.doi.org/10.1371/journal.pcbi.1006799
_version_ 1783400451066036224
author Funnell, Tyler
Zhang, Allen W.
Grewal, Diljot
McKinney, Steven
Bashashati, Ali
Wang, Yi Kan
Shah, Sohrab P.
author_facet Funnell, Tyler
Zhang, Allen W.
Grewal, Diljot
McKinney, Steven
Bashashati, Ali
Wang, Yi Kan
Shah, Sohrab P.
author_sort Funnell, Tyler
collection PubMed
description Mutation signatures in cancer genomes reflect endogenous and exogenous mutational processes, offering insights into tumour etiology, features for prognostic and biologic stratification and vulnerabilities to be exploited therapeutically. We present a novel machine learning formalism for improved signature inference, based on multi-modal correlated topic models (MMCTM) which can at once infer signatures from both single nucleotide and structural variation counts derived from cancer genome sequencing data. We exemplify the utility of our approach on two hormone driven, DNA repair deficient cancers: breast and ovary (n = 755 samples total). We show how introducing correlated structure both within and between modes of mutation can increase accuracy of signature discovery, particularly in the context of sparse data. Our study emphasizes the importance of integrating multiple mutation modes for signature discovery and patient stratification, and provides a statistical modeling framework to incorporate additional features of interest for future studies.
format Online
Article
Text
id pubmed-6402697
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-64026972019-03-17 Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models Funnell, Tyler Zhang, Allen W. Grewal, Diljot McKinney, Steven Bashashati, Ali Wang, Yi Kan Shah, Sohrab P. PLoS Comput Biol Research Article Mutation signatures in cancer genomes reflect endogenous and exogenous mutational processes, offering insights into tumour etiology, features for prognostic and biologic stratification and vulnerabilities to be exploited therapeutically. We present a novel machine learning formalism for improved signature inference, based on multi-modal correlated topic models (MMCTM) which can at once infer signatures from both single nucleotide and structural variation counts derived from cancer genome sequencing data. We exemplify the utility of our approach on two hormone driven, DNA repair deficient cancers: breast and ovary (n = 755 samples total). We show how introducing correlated structure both within and between modes of mutation can increase accuracy of signature discovery, particularly in the context of sparse data. Our study emphasizes the importance of integrating multiple mutation modes for signature discovery and patient stratification, and provides a statistical modeling framework to incorporate additional features of interest for future studies. Public Library of Science 2019-02-22 /pmc/articles/PMC6402697/ /pubmed/30794536 http://dx.doi.org/10.1371/journal.pcbi.1006799 Text en © 2019 Funnell et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Funnell, Tyler
Zhang, Allen W.
Grewal, Diljot
McKinney, Steven
Bashashati, Ali
Wang, Yi Kan
Shah, Sohrab P.
Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models
title Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models
title_full Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models
title_fullStr Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models
title_full_unstemmed Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models
title_short Integrated structural variation and point mutation signatures in cancer genomes using correlated topic models
title_sort integrated structural variation and point mutation signatures in cancer genomes using correlated topic models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6402697/
https://www.ncbi.nlm.nih.gov/pubmed/30794536
http://dx.doi.org/10.1371/journal.pcbi.1006799
work_keys_str_mv AT funnelltyler integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels
AT zhangallenw integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels
AT grewaldiljot integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels
AT mckinneysteven integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels
AT bashashatiali integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels
AT wangyikan integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels
AT shahsohrabp integratedstructuralvariationandpointmutationsignaturesincancergenomesusingcorrelatedtopicmodels