Cargando…

A Gradient Boosting Algorithm for Survival Analysis via Direct Optimization of Concordance Index

Survival analysis focuses on modeling and predicting the time to an event of interest. Many statistical models have been proposed for survival analysis. They often impose strong assumptions on hazard functions, which describe how the risk of an event changes over time depending on covariates associa...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Yifei, Jia, Zhenyu, Mercola, Dan, Xie, Xiaohui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853154/
https://www.ncbi.nlm.nih.gov/pubmed/24348746
http://dx.doi.org/10.1155/2013/873595
_version_ 1782478789804032000
author Chen, Yifei
Jia, Zhenyu
Mercola, Dan
Xie, Xiaohui
author_facet Chen, Yifei
Jia, Zhenyu
Mercola, Dan
Xie, Xiaohui
author_sort Chen, Yifei
collection PubMed
description Survival analysis focuses on modeling and predicting the time to an event of interest. Many statistical models have been proposed for survival analysis. They often impose strong assumptions on hazard functions, which describe how the risk of an event changes over time depending on covariates associated with each individual. In particular, the prevalent proportional hazards model assumes that covariates are multiplicatively related to the hazard. Here we propose a nonparametric model for survival analysis that does not explicitly assume particular forms of hazard functions. Our nonparametric model utilizes an ensemble of regression trees to determine how the hazard function varies according to the associated covariates. The ensemble model is trained using a gradient boosting method to optimize a smoothed approximation of the concordance index, which is one of the most widely used metrics in survival model performance evaluation. We implemented our model in a software package called GBMCI (gradient boosting machine for concordance index) and benchmarked the performance of our model against other popular survival models with a large-scale breast cancer prognosis dataset. Our experiment shows that GBMCI consistently outperforms other methods based on a number of covariate settings. GBMCI is implemented in R and is freely available online.
format Online
Article
Text
id pubmed-3853154
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-38531542013-12-12 A Gradient Boosting Algorithm for Survival Analysis via Direct Optimization of Concordance Index Chen, Yifei Jia, Zhenyu Mercola, Dan Xie, Xiaohui Comput Math Methods Med Research Article Survival analysis focuses on modeling and predicting the time to an event of interest. Many statistical models have been proposed for survival analysis. They often impose strong assumptions on hazard functions, which describe how the risk of an event changes over time depending on covariates associated with each individual. In particular, the prevalent proportional hazards model assumes that covariates are multiplicatively related to the hazard. Here we propose a nonparametric model for survival analysis that does not explicitly assume particular forms of hazard functions. Our nonparametric model utilizes an ensemble of regression trees to determine how the hazard function varies according to the associated covariates. The ensemble model is trained using a gradient boosting method to optimize a smoothed approximation of the concordance index, which is one of the most widely used metrics in survival model performance evaluation. We implemented our model in a software package called GBMCI (gradient boosting machine for concordance index) and benchmarked the performance of our model against other popular survival models with a large-scale breast cancer prognosis dataset. Our experiment shows that GBMCI consistently outperforms other methods based on a number of covariate settings. GBMCI is implemented in R and is freely available online. Hindawi Publishing Corporation 2013 2013-11-20 /pmc/articles/PMC3853154/ /pubmed/24348746 http://dx.doi.org/10.1155/2013/873595 Text en Copyright © 2013 Yifei Chen et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chen, Yifei
Jia, Zhenyu
Mercola, Dan
Xie, Xiaohui
A Gradient Boosting Algorithm for Survival Analysis via Direct Optimization of Concordance Index
title A Gradient Boosting Algorithm for Survival Analysis via Direct Optimization of Concordance Index
title_full A Gradient Boosting Algorithm for Survival Analysis via Direct Optimization of Concordance Index
title_fullStr A Gradient Boosting Algorithm for Survival Analysis via Direct Optimization of Concordance Index
title_full_unstemmed A Gradient Boosting Algorithm for Survival Analysis via Direct Optimization of Concordance Index
title_short A Gradient Boosting Algorithm for Survival Analysis via Direct Optimization of Concordance Index
title_sort gradient boosting algorithm for survival analysis via direct optimization of concordance index
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3853154/
https://www.ncbi.nlm.nih.gov/pubmed/24348746
http://dx.doi.org/10.1155/2013/873595
work_keys_str_mv AT chenyifei agradientboostingalgorithmforsurvivalanalysisviadirectoptimizationofconcordanceindex
AT jiazhenyu agradientboostingalgorithmforsurvivalanalysisviadirectoptimizationofconcordanceindex
AT mercoladan agradientboostingalgorithmforsurvivalanalysisviadirectoptimizationofconcordanceindex
AT xiexiaohui agradientboostingalgorithmforsurvivalanalysisviadirectoptimizationofconcordanceindex
AT chenyifei gradientboostingalgorithmforsurvivalanalysisviadirectoptimizationofconcordanceindex
AT jiazhenyu gradientboostingalgorithmforsurvivalanalysisviadirectoptimizationofconcordanceindex
AT mercoladan gradientboostingalgorithmforsurvivalanalysisviadirectoptimizationofconcordanceindex
AT xiexiaohui gradientboostingalgorithmforsurvivalanalysisviadirectoptimizationofconcordanceindex