Cargando…

A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data

BACKGROUND: Pan-omics, pan-cancer analysis has advanced our understanding of the molecular heterogeneity of cancer. However, such analyses have been limited in their ability to use information from multiple sources of data (e.g., omics platforms) and multiple sample sets (e.g., cancer types) to pred...

Descripción completa

Detalles Bibliográficos
Autores principales: Samorodnitsky, Sarah, Hoadley, Katherine A., Lock, Eric F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9204947/
https://www.ncbi.nlm.nih.gov/pubmed/35710340
http://dx.doi.org/10.1186/s12859-022-04770-3
_version_ 1784729026888204288
author Samorodnitsky, Sarah
Hoadley, Katherine A.
Lock, Eric F.
author_facet Samorodnitsky, Sarah
Hoadley, Katherine A.
Lock, Eric F.
author_sort Samorodnitsky, Sarah
collection PubMed
description BACKGROUND: Pan-omics, pan-cancer analysis has advanced our understanding of the molecular heterogeneity of cancer. However, such analyses have been limited in their ability to use information from multiple sources of data (e.g., omics platforms) and multiple sample sets (e.g., cancer types) to predict clinical outcomes. We address the issue of prediction across multiple high-dimensional sources of data and sample sets by using molecular patterns identified by BIDIFAC+, a method for integrative dimension reduction of bidimensionally-linked matrices, in a Bayesian hierarchical model. Our model performs variable selection through spike-and-slab priors that borrow information across clustered data. We use this model to predict overall patient survival from the Cancer Genome Atlas with data from 29 cancer types and 4 omics sources and use simulations to characterize the performance of the hierarchical spike-and-slab prior. RESULTS: We found that molecular patterns shared across all or most cancers were largely not predictive of survival. However, our model selected patterns unique to subsets of cancers that differentiate clinical tumor subtypes with markedly different survival outcomes. Some of these subtypes were previously established, such as subtypes of uterine corpus endometrial carcinoma, while others may be novel, such as subtypes within a set of kidney carcinomas. Through simulations, we found that the hierarchical spike-and-slab prior performs best in terms of variable selection accuracy and predictive power when borrowing information is advantageous, but also offers competitive performance when it is not. CONCLUSIONS: We address the issue of prediction across multiple sources of data by using results from BIDIFAC+ in a Bayesian hierarchical model for overall patient survival. By incorporating spike-and-slab priors that borrow information across cancers, we identified molecular patterns that distinguish clinical tumor subtypes within a single cancer and within a group of cancers. We also corroborate the flexibility and performance of using spike-and-slab priors as a Bayesian variable selection approach.
format Online
Article
Text
id pubmed-9204947
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-92049472022-06-18 A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data Samorodnitsky, Sarah Hoadley, Katherine A. Lock, Eric F. BMC Bioinformatics Research BACKGROUND: Pan-omics, pan-cancer analysis has advanced our understanding of the molecular heterogeneity of cancer. However, such analyses have been limited in their ability to use information from multiple sources of data (e.g., omics platforms) and multiple sample sets (e.g., cancer types) to predict clinical outcomes. We address the issue of prediction across multiple high-dimensional sources of data and sample sets by using molecular patterns identified by BIDIFAC+, a method for integrative dimension reduction of bidimensionally-linked matrices, in a Bayesian hierarchical model. Our model performs variable selection through spike-and-slab priors that borrow information across clustered data. We use this model to predict overall patient survival from the Cancer Genome Atlas with data from 29 cancer types and 4 omics sources and use simulations to characterize the performance of the hierarchical spike-and-slab prior. RESULTS: We found that molecular patterns shared across all or most cancers were largely not predictive of survival. However, our model selected patterns unique to subsets of cancers that differentiate clinical tumor subtypes with markedly different survival outcomes. Some of these subtypes were previously established, such as subtypes of uterine corpus endometrial carcinoma, while others may be novel, such as subtypes within a set of kidney carcinomas. Through simulations, we found that the hierarchical spike-and-slab prior performs best in terms of variable selection accuracy and predictive power when borrowing information is advantageous, but also offers competitive performance when it is not. CONCLUSIONS: We address the issue of prediction across multiple sources of data by using results from BIDIFAC+ in a Bayesian hierarchical model for overall patient survival. By incorporating spike-and-slab priors that borrow information across cancers, we identified molecular patterns that distinguish clinical tumor subtypes within a single cancer and within a group of cancers. We also corroborate the flexibility and performance of using spike-and-slab priors as a Bayesian variable selection approach. BioMed Central 2022-06-17 /pmc/articles/PMC9204947/ /pubmed/35710340 http://dx.doi.org/10.1186/s12859-022-04770-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Samorodnitsky, Sarah
Hoadley, Katherine A.
Lock, Eric F.
A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data
title A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data
title_full A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data
title_fullStr A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data
title_full_unstemmed A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data
title_short A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data
title_sort hierarchical spike-and-slab model for pan-cancer survival using pan-omic data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9204947/
https://www.ncbi.nlm.nih.gov/pubmed/35710340
http://dx.doi.org/10.1186/s12859-022-04770-3
work_keys_str_mv AT samorodnitskysarah ahierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata
AT hoadleykatherinea ahierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata
AT lockericf ahierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata
AT samorodnitskysarah hierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata
AT hoadleykatherinea hierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata
AT lockericf hierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata