Cargando…
A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data
BACKGROUND: Pan-omics, pan-cancer analysis has advanced our understanding of the molecular heterogeneity of cancer. However, such analyses have been limited in their ability to use information from multiple sources of data (e.g., omics platforms) and multiple sample sets (e.g., cancer types) to pred...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9204947/ https://www.ncbi.nlm.nih.gov/pubmed/35710340 http://dx.doi.org/10.1186/s12859-022-04770-3 |
_version_ | 1784729026888204288 |
---|---|
author | Samorodnitsky, Sarah Hoadley, Katherine A. Lock, Eric F. |
author_facet | Samorodnitsky, Sarah Hoadley, Katherine A. Lock, Eric F. |
author_sort | Samorodnitsky, Sarah |
collection | PubMed |
description | BACKGROUND: Pan-omics, pan-cancer analysis has advanced our understanding of the molecular heterogeneity of cancer. However, such analyses have been limited in their ability to use information from multiple sources of data (e.g., omics platforms) and multiple sample sets (e.g., cancer types) to predict clinical outcomes. We address the issue of prediction across multiple high-dimensional sources of data and sample sets by using molecular patterns identified by BIDIFAC+, a method for integrative dimension reduction of bidimensionally-linked matrices, in a Bayesian hierarchical model. Our model performs variable selection through spike-and-slab priors that borrow information across clustered data. We use this model to predict overall patient survival from the Cancer Genome Atlas with data from 29 cancer types and 4 omics sources and use simulations to characterize the performance of the hierarchical spike-and-slab prior. RESULTS: We found that molecular patterns shared across all or most cancers were largely not predictive of survival. However, our model selected patterns unique to subsets of cancers that differentiate clinical tumor subtypes with markedly different survival outcomes. Some of these subtypes were previously established, such as subtypes of uterine corpus endometrial carcinoma, while others may be novel, such as subtypes within a set of kidney carcinomas. Through simulations, we found that the hierarchical spike-and-slab prior performs best in terms of variable selection accuracy and predictive power when borrowing information is advantageous, but also offers competitive performance when it is not. CONCLUSIONS: We address the issue of prediction across multiple sources of data by using results from BIDIFAC+ in a Bayesian hierarchical model for overall patient survival. By incorporating spike-and-slab priors that borrow information across cancers, we identified molecular patterns that distinguish clinical tumor subtypes within a single cancer and within a group of cancers. We also corroborate the flexibility and performance of using spike-and-slab priors as a Bayesian variable selection approach. |
format | Online Article Text |
id | pubmed-9204947 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-92049472022-06-18 A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data Samorodnitsky, Sarah Hoadley, Katherine A. Lock, Eric F. BMC Bioinformatics Research BACKGROUND: Pan-omics, pan-cancer analysis has advanced our understanding of the molecular heterogeneity of cancer. However, such analyses have been limited in their ability to use information from multiple sources of data (e.g., omics platforms) and multiple sample sets (e.g., cancer types) to predict clinical outcomes. We address the issue of prediction across multiple high-dimensional sources of data and sample sets by using molecular patterns identified by BIDIFAC+, a method for integrative dimension reduction of bidimensionally-linked matrices, in a Bayesian hierarchical model. Our model performs variable selection through spike-and-slab priors that borrow information across clustered data. We use this model to predict overall patient survival from the Cancer Genome Atlas with data from 29 cancer types and 4 omics sources and use simulations to characterize the performance of the hierarchical spike-and-slab prior. RESULTS: We found that molecular patterns shared across all or most cancers were largely not predictive of survival. However, our model selected patterns unique to subsets of cancers that differentiate clinical tumor subtypes with markedly different survival outcomes. Some of these subtypes were previously established, such as subtypes of uterine corpus endometrial carcinoma, while others may be novel, such as subtypes within a set of kidney carcinomas. Through simulations, we found that the hierarchical spike-and-slab prior performs best in terms of variable selection accuracy and predictive power when borrowing information is advantageous, but also offers competitive performance when it is not. CONCLUSIONS: We address the issue of prediction across multiple sources of data by using results from BIDIFAC+ in a Bayesian hierarchical model for overall patient survival. By incorporating spike-and-slab priors that borrow information across cancers, we identified molecular patterns that distinguish clinical tumor subtypes within a single cancer and within a group of cancers. We also corroborate the flexibility and performance of using spike-and-slab priors as a Bayesian variable selection approach. BioMed Central 2022-06-17 /pmc/articles/PMC9204947/ /pubmed/35710340 http://dx.doi.org/10.1186/s12859-022-04770-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Samorodnitsky, Sarah Hoadley, Katherine A. Lock, Eric F. A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data |
title | A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data |
title_full | A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data |
title_fullStr | A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data |
title_full_unstemmed | A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data |
title_short | A hierarchical spike-and-slab model for pan-cancer survival using pan-omic data |
title_sort | hierarchical spike-and-slab model for pan-cancer survival using pan-omic data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9204947/ https://www.ncbi.nlm.nih.gov/pubmed/35710340 http://dx.doi.org/10.1186/s12859-022-04770-3 |
work_keys_str_mv | AT samorodnitskysarah ahierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata AT hoadleykatherinea ahierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata AT lockericf ahierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata AT samorodnitskysarah hierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata AT hoadleykatherinea hierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata AT lockericf hierarchicalspikeandslabmodelforpancancersurvivalusingpanomicdata |