Cargando…

Variational Bayes for high-dimensional proportional hazards models with applications within gene expression

MOTIVATION: Few Bayesian methods for analyzing high-dimensional sparse survival data provide scalable variable selection, effect estimation and uncertainty quantification. Such methods often either sacrifice uncertainty quantification by computing maximum a posteriori estimates, or quantify the unce...

Descripción completa

Detalles Bibliográficos
Autores principales: Komodromos, Michael, Aboagye, Eric O, Evangelou, Marina, Filippi, Sarah, Ray, Kolyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9364383/
https://www.ncbi.nlm.nih.gov/pubmed/35751586
http://dx.doi.org/10.1093/bioinformatics/btac416
_version_ 1784765135616737280
author Komodromos, Michael
Aboagye, Eric O
Evangelou, Marina
Filippi, Sarah
Ray, Kolyan
author_facet Komodromos, Michael
Aboagye, Eric O
Evangelou, Marina
Filippi, Sarah
Ray, Kolyan
author_sort Komodromos, Michael
collection PubMed
description MOTIVATION: Few Bayesian methods for analyzing high-dimensional sparse survival data provide scalable variable selection, effect estimation and uncertainty quantification. Such methods often either sacrifice uncertainty quantification by computing maximum a posteriori estimates, or quantify the uncertainty at high (unscalable) computational expense. RESULTS: We bridge this gap and develop an interpretable and scalable Bayesian proportional hazards model for prediction and variable selection, referred to as sparse variational Bayes. Our method, based on a mean-field variational approximation, overcomes the high computational cost of Markov chain Monte Carlo, whilst retaining useful features, providing a posterior distribution for the parameters and offering a natural mechanism for variable selection via posterior inclusion probabilities. The performance of our proposed method is assessed via extensive simulations and compared against other state-of-the-art Bayesian variable selection methods, demonstrating comparable or better performance. Finally, we demonstrate how the proposed method can be used for variable selection on two transcriptomic datasets with censored survival outcomes, and how the uncertainty quantification offered by our method can be used to provide an interpretable assessment of patient risk. AVAILABILITY AND IMPLEMENTATION: our method has been implemented as a freely available R package survival.svb (https://github.com/mkomod/survival.svb). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9364383
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-93643832022-08-11 Variational Bayes for high-dimensional proportional hazards models with applications within gene expression Komodromos, Michael Aboagye, Eric O Evangelou, Marina Filippi, Sarah Ray, Kolyan Bioinformatics Original Papers MOTIVATION: Few Bayesian methods for analyzing high-dimensional sparse survival data provide scalable variable selection, effect estimation and uncertainty quantification. Such methods often either sacrifice uncertainty quantification by computing maximum a posteriori estimates, or quantify the uncertainty at high (unscalable) computational expense. RESULTS: We bridge this gap and develop an interpretable and scalable Bayesian proportional hazards model for prediction and variable selection, referred to as sparse variational Bayes. Our method, based on a mean-field variational approximation, overcomes the high computational cost of Markov chain Monte Carlo, whilst retaining useful features, providing a posterior distribution for the parameters and offering a natural mechanism for variable selection via posterior inclusion probabilities. The performance of our proposed method is assessed via extensive simulations and compared against other state-of-the-art Bayesian variable selection methods, demonstrating comparable or better performance. Finally, we demonstrate how the proposed method can be used for variable selection on two transcriptomic datasets with censored survival outcomes, and how the uncertainty quantification offered by our method can be used to provide an interpretable assessment of patient risk. AVAILABILITY AND IMPLEMENTATION: our method has been implemented as a freely available R package survival.svb (https://github.com/mkomod/survival.svb). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-06-25 /pmc/articles/PMC9364383/ /pubmed/35751586 http://dx.doi.org/10.1093/bioinformatics/btac416 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Komodromos, Michael
Aboagye, Eric O
Evangelou, Marina
Filippi, Sarah
Ray, Kolyan
Variational Bayes for high-dimensional proportional hazards models with applications within gene expression
title Variational Bayes for high-dimensional proportional hazards models with applications within gene expression
title_full Variational Bayes for high-dimensional proportional hazards models with applications within gene expression
title_fullStr Variational Bayes for high-dimensional proportional hazards models with applications within gene expression
title_full_unstemmed Variational Bayes for high-dimensional proportional hazards models with applications within gene expression
title_short Variational Bayes for high-dimensional proportional hazards models with applications within gene expression
title_sort variational bayes for high-dimensional proportional hazards models with applications within gene expression
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9364383/
https://www.ncbi.nlm.nih.gov/pubmed/35751586
http://dx.doi.org/10.1093/bioinformatics/btac416
work_keys_str_mv AT komodromosmichael variationalbayesforhighdimensionalproportionalhazardsmodelswithapplicationswithingeneexpression
AT aboagyeerico variationalbayesforhighdimensionalproportionalhazardsmodelswithapplicationswithingeneexpression
AT evangeloumarina variationalbayesforhighdimensionalproportionalhazardsmodelswithapplicationswithingeneexpression
AT filippisarah variationalbayesforhighdimensionalproportionalhazardsmodelswithapplicationswithingeneexpression
AT raykolyan variationalbayesforhighdimensionalproportionalhazardsmodelswithapplicationswithingeneexpression