Cargando…

Correlation‐adjusted regression survival scores for high‐dimensional variable selection

Background: The development of classification methods for personalized medicine is highly dependent on the identification of predictive genetic markers. In survival analysis, it is often necessary to discriminate between influential and noninfluential markers. It is common to perform univariate scre...

Descripción completa

Detalles Bibliográficos
Autores principales: Welchowski, Thomas, Zuber, Verena, Schmid, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6519238/
https://www.ncbi.nlm.nih.gov/pubmed/30793795
http://dx.doi.org/10.1002/sim.8116
_version_ 1783418605007798272
author Welchowski, Thomas
Zuber, Verena
Schmid, Matthias
author_facet Welchowski, Thomas
Zuber, Verena
Schmid, Matthias
author_sort Welchowski, Thomas
collection PubMed
description Background: The development of classification methods for personalized medicine is highly dependent on the identification of predictive genetic markers. In survival analysis, it is often necessary to discriminate between influential and noninfluential markers. It is common to perform univariate screening using Cox scores, which quantify the associations between survival and each of the markers to provide a ranking. Since Cox scores do not account for dependencies between the markers, their use is suboptimal in the presence of highly correlated markers. Methods: As an alternative to the Cox score, we propose the correlation‐adjusted regression survival (CARS) score for right‐censored survival outcomes. By removing the correlations between the markers, the CARS score quantifies the associations between the outcome and the set of “decorrelated” marker values. Estimation of the scores is based on inverse probability weighting, which is applied to log‐transformed event times. For high‐dimensional data, estimation is based on shrinkage techniques. Results: The consistency of the CARS score is proven under mild regularity conditions. In simulations with high correlations, survival models based on CARS score rankings achieved higher areas under the precision‐recall curve than competing methods. Two example applications on prostate and breast cancer confirmed these results. CARS scores are implemented in the R package carSurv. Conclusions: In research applications involving high‐dimensional genetic data, the use of CARS scores for marker selection is a favorable alternative to Cox scores even when correlations between covariates are low. Having a straightforward interpretation and low computational requirements, CARS scores are an easy‐to‐use screening tool in personalized medicine research.
format Online
Article
Text
id pubmed-6519238
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-65192382019-05-21 Correlation‐adjusted regression survival scores for high‐dimensional variable selection Welchowski, Thomas Zuber, Verena Schmid, Matthias Stat Med Research Articles Background: The development of classification methods for personalized medicine is highly dependent on the identification of predictive genetic markers. In survival analysis, it is often necessary to discriminate between influential and noninfluential markers. It is common to perform univariate screening using Cox scores, which quantify the associations between survival and each of the markers to provide a ranking. Since Cox scores do not account for dependencies between the markers, their use is suboptimal in the presence of highly correlated markers. Methods: As an alternative to the Cox score, we propose the correlation‐adjusted regression survival (CARS) score for right‐censored survival outcomes. By removing the correlations between the markers, the CARS score quantifies the associations between the outcome and the set of “decorrelated” marker values. Estimation of the scores is based on inverse probability weighting, which is applied to log‐transformed event times. For high‐dimensional data, estimation is based on shrinkage techniques. Results: The consistency of the CARS score is proven under mild regularity conditions. In simulations with high correlations, survival models based on CARS score rankings achieved higher areas under the precision‐recall curve than competing methods. Two example applications on prostate and breast cancer confirmed these results. CARS scores are implemented in the R package carSurv. Conclusions: In research applications involving high‐dimensional genetic data, the use of CARS scores for marker selection is a favorable alternative to Cox scores even when correlations between covariates are low. Having a straightforward interpretation and low computational requirements, CARS scores are an easy‐to‐use screening tool in personalized medicine research. John Wiley and Sons Inc. 2019-02-22 2019-06-15 /pmc/articles/PMC6519238/ /pubmed/30793795 http://dx.doi.org/10.1002/sim.8116 Text en © 2019 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Welchowski, Thomas
Zuber, Verena
Schmid, Matthias
Correlation‐adjusted regression survival scores for high‐dimensional variable selection
title Correlation‐adjusted regression survival scores for high‐dimensional variable selection
title_full Correlation‐adjusted regression survival scores for high‐dimensional variable selection
title_fullStr Correlation‐adjusted regression survival scores for high‐dimensional variable selection
title_full_unstemmed Correlation‐adjusted regression survival scores for high‐dimensional variable selection
title_short Correlation‐adjusted regression survival scores for high‐dimensional variable selection
title_sort correlation‐adjusted regression survival scores for high‐dimensional variable selection
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6519238/
https://www.ncbi.nlm.nih.gov/pubmed/30793795
http://dx.doi.org/10.1002/sim.8116
work_keys_str_mv AT welchowskithomas correlationadjustedregressionsurvivalscoresforhighdimensionalvariableselection
AT zuberverena correlationadjustedregressionsurvivalscoresforhighdimensionalvariableselection
AT schmidmatthias correlationadjustedregressionsurvivalscoresforhighdimensionalvariableselection