Cargando…

A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures

BACKGROUND: High-dimensional molecular measurements, e.g. gene expression data, can be linked to clinical time-to-event endpoints by Cox regression models and regularized estimation approaches, such as componentwise boosting, and can incorporate a large number of covariates as well as provide variab...

Descripción completa

Detalles Bibliográficos
Autores principales: Weyer, Veronika, Binder, Harald
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4572441/
https://www.ncbi.nlm.nih.gov/pubmed/26374641
http://dx.doi.org/10.1186/s12859-015-0716-8
_version_ 1782390399717867520
author Weyer, Veronika
Binder, Harald
author_facet Weyer, Veronika
Binder, Harald
author_sort Weyer, Veronika
collection PubMed
description BACKGROUND: High-dimensional molecular measurements, e.g. gene expression data, can be linked to clinical time-to-event endpoints by Cox regression models and regularized estimation approaches, such as componentwise boosting, and can incorporate a large number of covariates as well as provide variable selection. If there is heterogeneity due to known patient subgroups, a stratified Cox model allows for separate baseline hazards in each subgroup. Variable selection will still depend on the relative stratum sizes in the data, which might be a convenience sample and not representative for future applications. Such effects need to be systematically investigated and could even help to more reliably identify components of risk prediction signatures. RESULTS: Correspondingly, we propose a weighted regression approach based on componentwise likelihood-based boosting which is implemented in the R package CoxBoost (https://github.com/binderh/CoxBoost). This approach focuses on building a risk prediction signature for a specific stratum by down-weighting the observations from the other strata using a range of weights. Stability of selection for specific covariates as a function of the weights is investigated by resampling inclusion frequencies, and two types of corresponding visualizations are suggested. This is illustrated for two applications with methylation and gene expression measurements from cancer patients. CONCLUSION: The proposed approach is meant to point out components of risk prediction signatures that are specific to the stratum of interest and components that are also important to other strata. Performance is mostly improved by incorporating down-weighted information from the other strata. This suggests more general usefulness for risk prediction signature development in data with heterogeneity due to known subgroups.
format Online
Article
Text
id pubmed-4572441
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45724412015-09-18 A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures Weyer, Veronika Binder, Harald BMC Bioinformatics Research Article BACKGROUND: High-dimensional molecular measurements, e.g. gene expression data, can be linked to clinical time-to-event endpoints by Cox regression models and regularized estimation approaches, such as componentwise boosting, and can incorporate a large number of covariates as well as provide variable selection. If there is heterogeneity due to known patient subgroups, a stratified Cox model allows for separate baseline hazards in each subgroup. Variable selection will still depend on the relative stratum sizes in the data, which might be a convenience sample and not representative for future applications. Such effects need to be systematically investigated and could even help to more reliably identify components of risk prediction signatures. RESULTS: Correspondingly, we propose a weighted regression approach based on componentwise likelihood-based boosting which is implemented in the R package CoxBoost (https://github.com/binderh/CoxBoost). This approach focuses on building a risk prediction signature for a specific stratum by down-weighting the observations from the other strata using a range of weights. Stability of selection for specific covariates as a function of the weights is investigated by resampling inclusion frequencies, and two types of corresponding visualizations are suggested. This is illustrated for two applications with methylation and gene expression measurements from cancer patients. CONCLUSION: The proposed approach is meant to point out components of risk prediction signatures that are specific to the stratum of interest and components that are also important to other strata. Performance is mostly improved by incorporating down-weighted information from the other strata. This suggests more general usefulness for risk prediction signature development in data with heterogeneity due to known subgroups. BioMed Central 2015-09-15 /pmc/articles/PMC4572441/ /pubmed/26374641 http://dx.doi.org/10.1186/s12859-015-0716-8 Text en © Weyer and Binder. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Weyer, Veronika
Binder, Harald
A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures
title A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures
title_full A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures
title_fullStr A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures
title_full_unstemmed A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures
title_short A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures
title_sort weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4572441/
https://www.ncbi.nlm.nih.gov/pubmed/26374641
http://dx.doi.org/10.1186/s12859-015-0716-8
work_keys_str_mv AT weyerveronika aweightingapproachforjudgingtheeffectofpatientstrataonhighdimensionalriskpredictionsignatures
AT binderharald aweightingapproachforjudgingtheeffectofpatientstrataonhighdimensionalriskpredictionsignatures
AT weyerveronika weightingapproachforjudgingtheeffectofpatientstrataonhighdimensionalriskpredictionsignatures
AT binderharald weightingapproachforjudgingtheeffectofpatientstrataonhighdimensionalriskpredictionsignatures