Cargando…

Bayesian variable selection with graphical structure learning: Applications in integrative genomics

Significant advances in biotechnology have allowed for simultaneous measurement of molecular data across multiple genomic, epigenomic and transcriptomic levels from a single tumor/patient sample. This has motivated systematic data-driven approaches to integrate multi-dimensional structured datasets,...

Descripción completa

Detalles Bibliográficos
Autores principales: Kundu, Suprateek, Cheng, Yichen, Shin, Minsuk, Manyam, Ganiraju, Mallick, Bani K., Baladandayuthapani, Veerabhadran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6066211/
https://www.ncbi.nlm.nih.gov/pubmed/30059495
http://dx.doi.org/10.1371/journal.pone.0195070
_version_ 1783342932443529216
author Kundu, Suprateek
Cheng, Yichen
Shin, Minsuk
Manyam, Ganiraju
Mallick, Bani K.
Baladandayuthapani, Veerabhadran
author_facet Kundu, Suprateek
Cheng, Yichen
Shin, Minsuk
Manyam, Ganiraju
Mallick, Bani K.
Baladandayuthapani, Veerabhadran
author_sort Kundu, Suprateek
collection PubMed
description Significant advances in biotechnology have allowed for simultaneous measurement of molecular data across multiple genomic, epigenomic and transcriptomic levels from a single tumor/patient sample. This has motivated systematic data-driven approaches to integrate multi-dimensional structured datasets, since cancer development and progression is driven by numerous co-ordinated molecular alterations and the interactions between them. We propose a novel multi-scale Bayesian approach that combines integrative graphical structure learning from multiple sources of data with a variable selection framework—to determine the key genomic drivers of cancer progression. The integrative structure learning is first accomplished through novel joint graphical models for heterogeneous (mixed scale) data, allowing for flexible and interpretable incorporation of prior existing knowledge. This subsequently informs a variable selection step to identify groups of co-ordinated molecular features within and across platforms associated with clinical outcomes of cancer progression, while according appropriate adjustments for multicollinearity and multiplicities. We evaluate our methods through rigorous simulations to establish superiority over existing methods that do not take the network and/or prior information into account. Our methods are motivated by and applied to a glioblastoma multiforme (GBM) dataset from The Cancer Genome Atlas to predict patient survival times integrating gene expression, copy number and methylation data. We find a high concordance between our selected prognostic gene network modules with known associations with GBM. In addition, our model discovers several novel cross-platform network interactions (both cis and trans acting) between gene expression, copy number variation associated gene dosing and epigenetic regulation through promoter methylation, some with known implications in the etiology of GBM. Our framework provides a useful tool for biomedical researchers, since clinical prediction using multi-platform genomic information is an important step towards personalized treatment of many cancers.
format Online
Article
Text
id pubmed-6066211
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60662112018-08-10 Bayesian variable selection with graphical structure learning: Applications in integrative genomics Kundu, Suprateek Cheng, Yichen Shin, Minsuk Manyam, Ganiraju Mallick, Bani K. Baladandayuthapani, Veerabhadran PLoS One Research Article Significant advances in biotechnology have allowed for simultaneous measurement of molecular data across multiple genomic, epigenomic and transcriptomic levels from a single tumor/patient sample. This has motivated systematic data-driven approaches to integrate multi-dimensional structured datasets, since cancer development and progression is driven by numerous co-ordinated molecular alterations and the interactions between them. We propose a novel multi-scale Bayesian approach that combines integrative graphical structure learning from multiple sources of data with a variable selection framework—to determine the key genomic drivers of cancer progression. The integrative structure learning is first accomplished through novel joint graphical models for heterogeneous (mixed scale) data, allowing for flexible and interpretable incorporation of prior existing knowledge. This subsequently informs a variable selection step to identify groups of co-ordinated molecular features within and across platforms associated with clinical outcomes of cancer progression, while according appropriate adjustments for multicollinearity and multiplicities. We evaluate our methods through rigorous simulations to establish superiority over existing methods that do not take the network and/or prior information into account. Our methods are motivated by and applied to a glioblastoma multiforme (GBM) dataset from The Cancer Genome Atlas to predict patient survival times integrating gene expression, copy number and methylation data. We find a high concordance between our selected prognostic gene network modules with known associations with GBM. In addition, our model discovers several novel cross-platform network interactions (both cis and trans acting) between gene expression, copy number variation associated gene dosing and epigenetic regulation through promoter methylation, some with known implications in the etiology of GBM. Our framework provides a useful tool for biomedical researchers, since clinical prediction using multi-platform genomic information is an important step towards personalized treatment of many cancers. Public Library of Science 2018-07-30 /pmc/articles/PMC6066211/ /pubmed/30059495 http://dx.doi.org/10.1371/journal.pone.0195070 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Kundu, Suprateek
Cheng, Yichen
Shin, Minsuk
Manyam, Ganiraju
Mallick, Bani K.
Baladandayuthapani, Veerabhadran
Bayesian variable selection with graphical structure learning: Applications in integrative genomics
title Bayesian variable selection with graphical structure learning: Applications in integrative genomics
title_full Bayesian variable selection with graphical structure learning: Applications in integrative genomics
title_fullStr Bayesian variable selection with graphical structure learning: Applications in integrative genomics
title_full_unstemmed Bayesian variable selection with graphical structure learning: Applications in integrative genomics
title_short Bayesian variable selection with graphical structure learning: Applications in integrative genomics
title_sort bayesian variable selection with graphical structure learning: applications in integrative genomics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6066211/
https://www.ncbi.nlm.nih.gov/pubmed/30059495
http://dx.doi.org/10.1371/journal.pone.0195070
work_keys_str_mv AT kundusuprateek bayesianvariableselectionwithgraphicalstructurelearningapplicationsinintegrativegenomics
AT chengyichen bayesianvariableselectionwithgraphicalstructurelearningapplicationsinintegrativegenomics
AT shinminsuk bayesianvariableselectionwithgraphicalstructurelearningapplicationsinintegrativegenomics
AT manyamganiraju bayesianvariableselectionwithgraphicalstructurelearningapplicationsinintegrativegenomics
AT mallickbanik bayesianvariableselectionwithgraphicalstructurelearningapplicationsinintegrativegenomics
AT baladandayuthapaniveerabhadran bayesianvariableselectionwithgraphicalstructurelearningapplicationsinintegrativegenomics