Cargando…
A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures
BACKGROUND: High-dimensional molecular measurements, e.g. gene expression data, can be linked to clinical time-to-event endpoints by Cox regression models and regularized estimation approaches, such as componentwise boosting, and can incorporate a large number of covariates as well as provide variab...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4572441/ https://www.ncbi.nlm.nih.gov/pubmed/26374641 http://dx.doi.org/10.1186/s12859-015-0716-8 |
_version_ | 1782390399717867520 |
---|---|
author | Weyer, Veronika Binder, Harald |
author_facet | Weyer, Veronika Binder, Harald |
author_sort | Weyer, Veronika |
collection | PubMed |
description | BACKGROUND: High-dimensional molecular measurements, e.g. gene expression data, can be linked to clinical time-to-event endpoints by Cox regression models and regularized estimation approaches, such as componentwise boosting, and can incorporate a large number of covariates as well as provide variable selection. If there is heterogeneity due to known patient subgroups, a stratified Cox model allows for separate baseline hazards in each subgroup. Variable selection will still depend on the relative stratum sizes in the data, which might be a convenience sample and not representative for future applications. Such effects need to be systematically investigated and could even help to more reliably identify components of risk prediction signatures. RESULTS: Correspondingly, we propose a weighted regression approach based on componentwise likelihood-based boosting which is implemented in the R package CoxBoost (https://github.com/binderh/CoxBoost). This approach focuses on building a risk prediction signature for a specific stratum by down-weighting the observations from the other strata using a range of weights. Stability of selection for specific covariates as a function of the weights is investigated by resampling inclusion frequencies, and two types of corresponding visualizations are suggested. This is illustrated for two applications with methylation and gene expression measurements from cancer patients. CONCLUSION: The proposed approach is meant to point out components of risk prediction signatures that are specific to the stratum of interest and components that are also important to other strata. Performance is mostly improved by incorporating down-weighted information from the other strata. This suggests more general usefulness for risk prediction signature development in data with heterogeneity due to known subgroups. |
format | Online Article Text |
id | pubmed-4572441 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45724412015-09-18 A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures Weyer, Veronika Binder, Harald BMC Bioinformatics Research Article BACKGROUND: High-dimensional molecular measurements, e.g. gene expression data, can be linked to clinical time-to-event endpoints by Cox regression models and regularized estimation approaches, such as componentwise boosting, and can incorporate a large number of covariates as well as provide variable selection. If there is heterogeneity due to known patient subgroups, a stratified Cox model allows for separate baseline hazards in each subgroup. Variable selection will still depend on the relative stratum sizes in the data, which might be a convenience sample and not representative for future applications. Such effects need to be systematically investigated and could even help to more reliably identify components of risk prediction signatures. RESULTS: Correspondingly, we propose a weighted regression approach based on componentwise likelihood-based boosting which is implemented in the R package CoxBoost (https://github.com/binderh/CoxBoost). This approach focuses on building a risk prediction signature for a specific stratum by down-weighting the observations from the other strata using a range of weights. Stability of selection for specific covariates as a function of the weights is investigated by resampling inclusion frequencies, and two types of corresponding visualizations are suggested. This is illustrated for two applications with methylation and gene expression measurements from cancer patients. CONCLUSION: The proposed approach is meant to point out components of risk prediction signatures that are specific to the stratum of interest and components that are also important to other strata. Performance is mostly improved by incorporating down-weighted information from the other strata. This suggests more general usefulness for risk prediction signature development in data with heterogeneity due to known subgroups. BioMed Central 2015-09-15 /pmc/articles/PMC4572441/ /pubmed/26374641 http://dx.doi.org/10.1186/s12859-015-0716-8 Text en © Weyer and Binder. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Weyer, Veronika Binder, Harald A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures |
title | A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures |
title_full | A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures |
title_fullStr | A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures |
title_full_unstemmed | A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures |
title_short | A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures |
title_sort | weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4572441/ https://www.ncbi.nlm.nih.gov/pubmed/26374641 http://dx.doi.org/10.1186/s12859-015-0716-8 |
work_keys_str_mv | AT weyerveronika aweightingapproachforjudgingtheeffectofpatientstrataonhighdimensionalriskpredictionsignatures AT binderharald aweightingapproachforjudgingtheeffectofpatientstrataonhighdimensionalriskpredictionsignatures AT weyerveronika weightingapproachforjudgingtheeffectofpatientstrataonhighdimensionalriskpredictionsignatures AT binderharald weightingapproachforjudgingtheeffectofpatientstrataonhighdimensionalriskpredictionsignatures |