Cargando…

Bayesian variable selection for parametric survival model with applications to cancer omics data

BACKGROUND: Modeling thousands of markers simultaneously has been of great interest in testing association between genetic biomarkers and disease or disease-related quantitative traits. Recently, an expectation-maximization (EM) approach to Bayesian variable selection (EMVS) facilitating the Bayesia...

Descripción completa

Detalles Bibliográficos
Autores principales: Duan, Weiwei, Zhang, Ruyang, Zhao, Yang, Shen, Sipeng, Wei, Yongyue, Chen, Feng, Christiani, David C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218990/
https://www.ncbi.nlm.nih.gov/pubmed/30400837
http://dx.doi.org/10.1186/s40246-018-0179-x
_version_ 1783368559707029504
author Duan, Weiwei
Zhang, Ruyang
Zhao, Yang
Shen, Sipeng
Wei, Yongyue
Chen, Feng
Christiani, David C.
author_facet Duan, Weiwei
Zhang, Ruyang
Zhao, Yang
Shen, Sipeng
Wei, Yongyue
Chen, Feng
Christiani, David C.
author_sort Duan, Weiwei
collection PubMed
description BACKGROUND: Modeling thousands of markers simultaneously has been of great interest in testing association between genetic biomarkers and disease or disease-related quantitative traits. Recently, an expectation-maximization (EM) approach to Bayesian variable selection (EMVS) facilitating the Bayesian computation was developed for continuous or binary outcome using a fast EM algorithm. However, it is not suitable to the analyses of time-to-event outcome in many public databases such as The Cancer Genome Atlas (TCGA). RESULTS: We extended the EMVS to high-dimensional parametric survival regression framework (SurvEMVS). A variant of cyclic coordinate descent (CCD) algorithm was used for efficient iteration in M-step, and the extended Bayesian information criteria (EBIC) was employed to make choice on hyperparameter tuning. We evaluated the performance of SurvEMVS using numeric simulations and illustrated the effectiveness on two real datasets. The results of numerical simulations and two real data analyses show the well performance of SurvEMVS in aspects of accuracy and computation. Some potential markers associated with survival of lung or stomach cancer were identified. CONCLUSIONS: These results suggest that our model is effective and can cope with high-dimensional omics data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s40246-018-0179-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6218990
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62189902018-11-08 Bayesian variable selection for parametric survival model with applications to cancer omics data Duan, Weiwei Zhang, Ruyang Zhao, Yang Shen, Sipeng Wei, Yongyue Chen, Feng Christiani, David C. Hum Genomics Primary Research BACKGROUND: Modeling thousands of markers simultaneously has been of great interest in testing association between genetic biomarkers and disease or disease-related quantitative traits. Recently, an expectation-maximization (EM) approach to Bayesian variable selection (EMVS) facilitating the Bayesian computation was developed for continuous or binary outcome using a fast EM algorithm. However, it is not suitable to the analyses of time-to-event outcome in many public databases such as The Cancer Genome Atlas (TCGA). RESULTS: We extended the EMVS to high-dimensional parametric survival regression framework (SurvEMVS). A variant of cyclic coordinate descent (CCD) algorithm was used for efficient iteration in M-step, and the extended Bayesian information criteria (EBIC) was employed to make choice on hyperparameter tuning. We evaluated the performance of SurvEMVS using numeric simulations and illustrated the effectiveness on two real datasets. The results of numerical simulations and two real data analyses show the well performance of SurvEMVS in aspects of accuracy and computation. Some potential markers associated with survival of lung or stomach cancer were identified. CONCLUSIONS: These results suggest that our model is effective and can cope with high-dimensional omics data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s40246-018-0179-x) contains supplementary material, which is available to authorized users. BioMed Central 2018-11-06 /pmc/articles/PMC6218990/ /pubmed/30400837 http://dx.doi.org/10.1186/s40246-018-0179-x Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Primary Research
Duan, Weiwei
Zhang, Ruyang
Zhao, Yang
Shen, Sipeng
Wei, Yongyue
Chen, Feng
Christiani, David C.
Bayesian variable selection for parametric survival model with applications to cancer omics data
title Bayesian variable selection for parametric survival model with applications to cancer omics data
title_full Bayesian variable selection for parametric survival model with applications to cancer omics data
title_fullStr Bayesian variable selection for parametric survival model with applications to cancer omics data
title_full_unstemmed Bayesian variable selection for parametric survival model with applications to cancer omics data
title_short Bayesian variable selection for parametric survival model with applications to cancer omics data
title_sort bayesian variable selection for parametric survival model with applications to cancer omics data
topic Primary Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218990/
https://www.ncbi.nlm.nih.gov/pubmed/30400837
http://dx.doi.org/10.1186/s40246-018-0179-x
work_keys_str_mv AT duanweiwei bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata
AT zhangruyang bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata
AT zhaoyang bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata
AT shensipeng bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata
AT weiyongyue bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata
AT chenfeng bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata
AT christianidavidc bayesianvariableselectionforparametricsurvivalmodelwithapplicationstocanceromicsdata