Cargando…

Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models

BACKGROUND: When predictive survival models are built from high-dimensional data, there are often additional covariates, such as clinical scores, that by all means have to be included into the final model. While there are several techniques for the fitting of sparse high-dimensional survival models...

Descripción completa

Detalles Bibliográficos
Autores principales: Binder, Harald, Schumacher, Martin
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2245904/
https://www.ncbi.nlm.nih.gov/pubmed/18186927
http://dx.doi.org/10.1186/1471-2105-9-14
_version_ 1782150681498484736
author Binder, Harald
Schumacher, Martin
author_facet Binder, Harald
Schumacher, Martin
author_sort Binder, Harald
collection PubMed
description BACKGROUND: When predictive survival models are built from high-dimensional data, there are often additional covariates, such as clinical scores, that by all means have to be included into the final model. While there are several techniques for the fitting of sparse high-dimensional survival models by penalized parameter estimation, none allows for explicit consideration of such mandatory covariates. RESULTS: We introduce a new boosting algorithm for censored time-to-event data that shares the favorable properties of existing approaches, i.e., it results in sparse models with good prediction performance, but uses an offset-based update mechanism. The latter allows for tailored penalization of the covariates under consideration. Specifically, unpenalized mandatory covariates can be introduced. Microarray survival data from patients with diffuse large B-cell lymphoma, in combination with the recent, bootstrap-based prediction error curve technique, is used to illustrate the advantages of the new procedure. CONCLUSION: It is demonstrated that it can be highly beneficial in terms of prediction performance to use an estimation procedure that incorporates mandatory covariates into high-dimensional survival models. The new approach also allows to answer the question whether improved predictions are obtained by including microarray features in addition to classical clinical criteria.
format Text
id pubmed-2245904
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22459042008-02-20 Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models Binder, Harald Schumacher, Martin BMC Bioinformatics Methodology Article BACKGROUND: When predictive survival models are built from high-dimensional data, there are often additional covariates, such as clinical scores, that by all means have to be included into the final model. While there are several techniques for the fitting of sparse high-dimensional survival models by penalized parameter estimation, none allows for explicit consideration of such mandatory covariates. RESULTS: We introduce a new boosting algorithm for censored time-to-event data that shares the favorable properties of existing approaches, i.e., it results in sparse models with good prediction performance, but uses an offset-based update mechanism. The latter allows for tailored penalization of the covariates under consideration. Specifically, unpenalized mandatory covariates can be introduced. Microarray survival data from patients with diffuse large B-cell lymphoma, in combination with the recent, bootstrap-based prediction error curve technique, is used to illustrate the advantages of the new procedure. CONCLUSION: It is demonstrated that it can be highly beneficial in terms of prediction performance to use an estimation procedure that incorporates mandatory covariates into high-dimensional survival models. The new approach also allows to answer the question whether improved predictions are obtained by including microarray features in addition to classical clinical criteria. BioMed Central 2008-01-10 /pmc/articles/PMC2245904/ /pubmed/18186927 http://dx.doi.org/10.1186/1471-2105-9-14 Text en Copyright ©2008 Binder and Schumacher; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Binder, Harald
Schumacher, Martin
Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models
title Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models
title_full Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models
title_fullStr Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models
title_full_unstemmed Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models
title_short Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models
title_sort allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2245904/
https://www.ncbi.nlm.nih.gov/pubmed/18186927
http://dx.doi.org/10.1186/1471-2105-9-14
work_keys_str_mv AT binderharald allowingformandatorycovariatesinboostingestimationofsparsehighdimensionalsurvivalmodels
AT schumachermartin allowingformandatorycovariatesinboostingestimationofsparsehighdimensionalsurvivalmodels