Cargando…
Bayesian variable selection for parametric survival model with applications to cancer omics data
BACKGROUND: Modeling thousands of markers simultaneously has been of great interest in testing association between genetic biomarkers and disease or disease-related quantitative traits. Recently, an expectation-maximization (EM) approach to Bayesian variable selection (EMVS) facilitating the Bayesia...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218990/ https://www.ncbi.nlm.nih.gov/pubmed/30400837 http://dx.doi.org/10.1186/s40246-018-0179-x |
_version_ | 1783368559707029504 |
---|---|
author | Duan, Weiwei Zhang, Ruyang Zhao, Yang Shen, Sipeng Wei, Yongyue Chen, Feng Christiani, David C. |
author_facet | Duan, Weiwei Zhang, Ruyang Zhao, Yang Shen, Sipeng Wei, Yongyue Chen, Feng Christiani, David C. |
author_sort | Duan, Weiwei |
collection | PubMed |
description | BACKGROUND: Modeling thousands of markers simultaneously has been of great interest in testing association between genetic biomarkers and disease or disease-related quantitative traits. Recently, an expectation-maximization (EM) approach to Bayesian variable selection (EMVS) facilitating the Bayesian computation was developed for continuous or binary outcome using a fast EM algorithm. However, it is not suitable to the analyses of time-to-event outcome in many public databases such as The Cancer Genome Atlas (TCGA). RESULTS: We extended the EMVS to high-dimensional parametric survival regression framework (SurvEMVS). A variant of cyclic coordinate descent (CCD) algorithm was used for efficient iteration in M-step, and the extended Bayesian information criteria (EBIC) was employed to make choice on hyperparameter tuning. We evaluated the performance of SurvEMVS using numeric simulations and illustrated the effectiveness on two real datasets. The results of numerical simulations and two real data analyses show the well performance of SurvEMVS in aspects of accuracy and computation. Some potential markers associated with survival of lung or stomach cancer were identified. CONCLUSIONS: These results suggest that our model is effective and can cope with high-dimensional omics data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s40246-018-0179-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6218990 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62189902018-11-08 Bayesian variable selection for parametric survival model with applications to cancer omics data Duan, Weiwei Zhang, Ruyang Zhao, Yang Shen, Sipeng Wei, Yongyue Chen, Feng Christiani, David C. Hum Genomics Primary Research BACKGROUND: Modeling thousands of markers simultaneously has been of great interest in testing association between genetic biomarkers and disease or disease-related quantitative traits. Recently, an expectation-maximization (EM) approach to Bayesian variable selection (EMVS) facilitating the Bayesian computation was developed for continuous or binary outcome using a fast EM algorithm. However, it is not suitable to the analyses of time-to-event outcome in many public databases such as The Cancer Genome Atlas (TCGA). RESULTS: We extended the EMVS to high-dimensional parametric survival regression framework (SurvEMVS). A variant of cyclic coordinate descent (CCD) algorithm was used for efficient iteration in M-step, and the extended Bayesian information criteria (EBIC) was employed to make choice on hyperparameter tuning. We evaluated the performance of SurvEMVS using numeric simulations and illustrated the effectiveness on two real datasets. The results of numerical simulations and two real data analyses show the well performance of SurvEMVS in aspects of accuracy and computation. Some potential markers associated with survival of lung or stomach cancer were identified. CONCLUSIONS: These results suggest that our model is effective and can cope with high-dimensional omics data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s40246-018-0179-x) contains supplementary material, which is available to authorized users. BioMed Central 2018-11-06 /pmc/articles/PMC6218990/ /pubmed/30400837 http://dx.doi.org/10.1186/s40246-018-0179-x Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Primary Research Duan, Weiwei Zhang, Ruyang Zhao, Yang Shen, Sipeng Wei, Yongyue Chen, Feng Christiani, David C. Bayesian variable selection for parametric survival model with applications to cancer omics data |
title | Bayesian variable selection for parametric survival model with applications to cancer omics data |
title_full | Bayesian variable selection for parametric survival model with applications to cancer omics data |
title_fullStr | Bayesian variable selection for parametric survival model with applications to cancer omics data |
title_full_unstemmed | Bayesian variable selection for parametric survival model with applications to cancer omics data |
title_short | Bayesian variable selection for parametric survival model with applications to cancer omics data |
title_sort | bayesian variable selection for parametric survival model with applications to cancer omics data |
topic | Primary Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218990/ https://www.ncbi.nlm.nih.gov/pubmed/30400837 http://dx.doi.org/10.1186/s40246-018-0179-x |
work_keys_str_mv | AT duanweiwei bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata AT zhangruyang bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata AT zhaoyang bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata AT shensipeng bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata AT weiyongyue bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata AT chenfeng bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata AT christianidavidc bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata |