Cargando…

Model-based optimization of subgroup weights for survival analysis

MOTIVATION: To obtain a reliable prediction model for a specific cancer subgroup or cohort is often difficult due to limited sample size and, in survival analysis, due to potentially high censoring rates. Sometimes similar data from other patient subgroups are available, e.g. from other clinical cen...

Descripción completa

Detalles Bibliográficos
Autores principales: Richter, Jakob, Madjar, Katrin, Rahnenführer, Jörg
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612842/
https://www.ncbi.nlm.nih.gov/pubmed/31510644
http://dx.doi.org/10.1093/bioinformatics/btz361
_version_ 1783432949422620672
author Richter, Jakob
Madjar, Katrin
Rahnenführer, Jörg
author_facet Richter, Jakob
Madjar, Katrin
Rahnenführer, Jörg
author_sort Richter, Jakob
collection PubMed
description MOTIVATION: To obtain a reliable prediction model for a specific cancer subgroup or cohort is often difficult due to limited sample size and, in survival analysis, due to potentially high censoring rates. Sometimes similar data from other patient subgroups are available, e.g. from other clinical centers. Simple pooling of all subgroups can decrease the variance of the predicted parameters of the prediction models, but also increase the bias due to heterogeneity between the cohorts. A promising compromise is to identify those subgroups with a similar relationship between covariates and target variable and then include only these for model building. RESULTS: We propose a subgroup-based weighted likelihood approach for survival prediction with high-dimensional genetic covariates. When predicting survival for a specific subgroup, for every other subgroup an individual weight determines the strength with which its observations enter into model building. MBO (model-based optimization) can be used to quickly find a good prediction model in the presence of a large number of hyperparameters. We use MBO to identify the best model for survival prediction of a specific subgroup by optimizing the weights for additional subgroups for a Cox model. The approach is evaluated on a set of lung cancer cohorts with gene expression measurements. The resulting models have competitive prediction quality, and they reflect the similarity of the corresponding cancer subgroups, with both weights close to 0 and close to 1 and medium weights. AVAILABILITY AND IMPLEMENTATION: mlrMBO is implemented as an R-package and is freely available at http://github.com/mlr-org/mlrMBO.
format Online
Article
Text
id pubmed-6612842
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66128422019-07-12 Model-based optimization of subgroup weights for survival analysis Richter, Jakob Madjar, Katrin Rahnenführer, Jörg Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: To obtain a reliable prediction model for a specific cancer subgroup or cohort is often difficult due to limited sample size and, in survival analysis, due to potentially high censoring rates. Sometimes similar data from other patient subgroups are available, e.g. from other clinical centers. Simple pooling of all subgroups can decrease the variance of the predicted parameters of the prediction models, but also increase the bias due to heterogeneity between the cohorts. A promising compromise is to identify those subgroups with a similar relationship between covariates and target variable and then include only these for model building. RESULTS: We propose a subgroup-based weighted likelihood approach for survival prediction with high-dimensional genetic covariates. When predicting survival for a specific subgroup, for every other subgroup an individual weight determines the strength with which its observations enter into model building. MBO (model-based optimization) can be used to quickly find a good prediction model in the presence of a large number of hyperparameters. We use MBO to identify the best model for survival prediction of a specific subgroup by optimizing the weights for additional subgroups for a Cox model. The approach is evaluated on a set of lung cancer cohorts with gene expression measurements. The resulting models have competitive prediction quality, and they reflect the similarity of the corresponding cancer subgroups, with both weights close to 0 and close to 1 and medium weights. AVAILABILITY AND IMPLEMENTATION: mlrMBO is implemented as an R-package and is freely available at http://github.com/mlr-org/mlrMBO. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612842/ /pubmed/31510644 http://dx.doi.org/10.1093/bioinformatics/btz361 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2019 Conference Proceedings
Richter, Jakob
Madjar, Katrin
Rahnenführer, Jörg
Model-based optimization of subgroup weights for survival analysis
title Model-based optimization of subgroup weights for survival analysis
title_full Model-based optimization of subgroup weights for survival analysis
title_fullStr Model-based optimization of subgroup weights for survival analysis
title_full_unstemmed Model-based optimization of subgroup weights for survival analysis
title_short Model-based optimization of subgroup weights for survival analysis
title_sort model-based optimization of subgroup weights for survival analysis
topic Ismb/Eccb 2019 Conference Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612842/
https://www.ncbi.nlm.nih.gov/pubmed/31510644
http://dx.doi.org/10.1093/bioinformatics/btz361
work_keys_str_mv AT richterjakob modelbasedoptimizationofsubgroupweightsforsurvivalanalysis
AT madjarkatrin modelbasedoptimizationofsubgroupweightsforsurvivalanalysis
AT rahnenfuhrerjorg modelbasedoptimizationofsubgroupweightsforsurvivalanalysis