Cargando…
Model-based optimization of subgroup weights for survival analysis
MOTIVATION: To obtain a reliable prediction model for a specific cancer subgroup or cohort is often difficult due to limited sample size and, in survival analysis, due to potentially high censoring rates. Sometimes similar data from other patient subgroups are available, e.g. from other clinical cen...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612842/ https://www.ncbi.nlm.nih.gov/pubmed/31510644 http://dx.doi.org/10.1093/bioinformatics/btz361 |
_version_ | 1783432949422620672 |
---|---|
author | Richter, Jakob Madjar, Katrin Rahnenführer, Jörg |
author_facet | Richter, Jakob Madjar, Katrin Rahnenführer, Jörg |
author_sort | Richter, Jakob |
collection | PubMed |
description | MOTIVATION: To obtain a reliable prediction model for a specific cancer subgroup or cohort is often difficult due to limited sample size and, in survival analysis, due to potentially high censoring rates. Sometimes similar data from other patient subgroups are available, e.g. from other clinical centers. Simple pooling of all subgroups can decrease the variance of the predicted parameters of the prediction models, but also increase the bias due to heterogeneity between the cohorts. A promising compromise is to identify those subgroups with a similar relationship between covariates and target variable and then include only these for model building. RESULTS: We propose a subgroup-based weighted likelihood approach for survival prediction with high-dimensional genetic covariates. When predicting survival for a specific subgroup, for every other subgroup an individual weight determines the strength with which its observations enter into model building. MBO (model-based optimization) can be used to quickly find a good prediction model in the presence of a large number of hyperparameters. We use MBO to identify the best model for survival prediction of a specific subgroup by optimizing the weights for additional subgroups for a Cox model. The approach is evaluated on a set of lung cancer cohorts with gene expression measurements. The resulting models have competitive prediction quality, and they reflect the similarity of the corresponding cancer subgroups, with both weights close to 0 and close to 1 and medium weights. AVAILABILITY AND IMPLEMENTATION: mlrMBO is implemented as an R-package and is freely available at http://github.com/mlr-org/mlrMBO. |
format | Online Article Text |
id | pubmed-6612842 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-66128422019-07-12 Model-based optimization of subgroup weights for survival analysis Richter, Jakob Madjar, Katrin Rahnenführer, Jörg Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: To obtain a reliable prediction model for a specific cancer subgroup or cohort is often difficult due to limited sample size and, in survival analysis, due to potentially high censoring rates. Sometimes similar data from other patient subgroups are available, e.g. from other clinical centers. Simple pooling of all subgroups can decrease the variance of the predicted parameters of the prediction models, but also increase the bias due to heterogeneity between the cohorts. A promising compromise is to identify those subgroups with a similar relationship between covariates and target variable and then include only these for model building. RESULTS: We propose a subgroup-based weighted likelihood approach for survival prediction with high-dimensional genetic covariates. When predicting survival for a specific subgroup, for every other subgroup an individual weight determines the strength with which its observations enter into model building. MBO (model-based optimization) can be used to quickly find a good prediction model in the presence of a large number of hyperparameters. We use MBO to identify the best model for survival prediction of a specific subgroup by optimizing the weights for additional subgroups for a Cox model. The approach is evaluated on a set of lung cancer cohorts with gene expression measurements. The resulting models have competitive prediction quality, and they reflect the similarity of the corresponding cancer subgroups, with both weights close to 0 and close to 1 and medium weights. AVAILABILITY AND IMPLEMENTATION: mlrMBO is implemented as an R-package and is freely available at http://github.com/mlr-org/mlrMBO. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612842/ /pubmed/31510644 http://dx.doi.org/10.1093/bioinformatics/btz361 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2019 Conference Proceedings Richter, Jakob Madjar, Katrin Rahnenführer, Jörg Model-based optimization of subgroup weights for survival analysis |
title | Model-based optimization of subgroup weights for survival analysis |
title_full | Model-based optimization of subgroup weights for survival analysis |
title_fullStr | Model-based optimization of subgroup weights for survival analysis |
title_full_unstemmed | Model-based optimization of subgroup weights for survival analysis |
title_short | Model-based optimization of subgroup weights for survival analysis |
title_sort | model-based optimization of subgroup weights for survival analysis |
topic | Ismb/Eccb 2019 Conference Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612842/ https://www.ncbi.nlm.nih.gov/pubmed/31510644 http://dx.doi.org/10.1093/bioinformatics/btz361 |
work_keys_str_mv | AT richterjakob modelbasedoptimizationofsubgroupweightsforsurvivalanalysis AT madjarkatrin modelbasedoptimizationofsubgroupweightsforsurvivalanalysis AT rahnenfuhrerjorg modelbasedoptimizationofsubgroupweightsforsurvivalanalysis |