Cargando…
Survival prediction models: an introduction to discrete-time modeling
BACKGROUND: Prediction models for time-to-event outcomes are commonly used in biomedical research to obtain subject-specific probabilities that aid in making important clinical care decisions. There are several regression and machine learning methods for building these models that have been designed...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9316420/ https://www.ncbi.nlm.nih.gov/pubmed/35883032 http://dx.doi.org/10.1186/s12874-022-01679-6 |
_version_ | 1784754807493361664 |
---|---|
author | Suresh, Krithika Severn, Cameron Ghosh, Debashis |
author_facet | Suresh, Krithika Severn, Cameron Ghosh, Debashis |
author_sort | Suresh, Krithika |
collection | PubMed |
description | BACKGROUND: Prediction models for time-to-event outcomes are commonly used in biomedical research to obtain subject-specific probabilities that aid in making important clinical care decisions. There are several regression and machine learning methods for building these models that have been designed or modified to account for the censoring that occurs in time-to-event data. Discrete-time survival models, which have often been overlooked in the literature, provide an alternative approach for predictive modeling in the presence of censoring with limited loss in predictive accuracy. These models can take advantage of the range of nonparametric machine learning classification algorithms and their available software to predict survival outcomes. METHODS: Discrete-time survival models are applied to a person-period data set to predict the hazard of experiencing the failure event in pre-specified time intervals. This framework allows for any binary classification method to be applied to predict these conditional survival probabilities. Using time-dependent performance metrics that account for censoring, we compare the predictions from parametric and machine learning classification approaches applied within the discrete time-to-event framework to those from continuous-time survival prediction models. We outline the process for training and validating discrete-time prediction models, and demonstrate its application using the open-source R statistical programming environment. RESULTS: Using publicly available data sets, we show that some discrete-time prediction models achieve better prediction performance than the continuous-time Cox proportional hazards model. Random survival forests, a machine learning algorithm adapted to survival data, also had improved performance compared to the Cox model, but was sometimes outperformed by the discrete-time approaches. In comparing the binary classification methods in the discrete time-to-event framework, the relative performance of the different methods varied depending on the data set. CONCLUSIONS: We present a guide for developing survival prediction models using discrete-time methods and assessing their predictive performance with the aim of encouraging their use in medical research settings. These methods can be applied to data sets that have continuous time-to-event outcomes and multiple clinical predictors. They can also be extended to accommodate new binary classification algorithms as they become available. We provide R code for fitting discrete-time survival prediction models in a github repository. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12874-022-01679-6). |
format | Online Article Text |
id | pubmed-9316420 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-93164202022-07-27 Survival prediction models: an introduction to discrete-time modeling Suresh, Krithika Severn, Cameron Ghosh, Debashis BMC Med Res Methodol Research Article BACKGROUND: Prediction models for time-to-event outcomes are commonly used in biomedical research to obtain subject-specific probabilities that aid in making important clinical care decisions. There are several regression and machine learning methods for building these models that have been designed or modified to account for the censoring that occurs in time-to-event data. Discrete-time survival models, which have often been overlooked in the literature, provide an alternative approach for predictive modeling in the presence of censoring with limited loss in predictive accuracy. These models can take advantage of the range of nonparametric machine learning classification algorithms and their available software to predict survival outcomes. METHODS: Discrete-time survival models are applied to a person-period data set to predict the hazard of experiencing the failure event in pre-specified time intervals. This framework allows for any binary classification method to be applied to predict these conditional survival probabilities. Using time-dependent performance metrics that account for censoring, we compare the predictions from parametric and machine learning classification approaches applied within the discrete time-to-event framework to those from continuous-time survival prediction models. We outline the process for training and validating discrete-time prediction models, and demonstrate its application using the open-source R statistical programming environment. RESULTS: Using publicly available data sets, we show that some discrete-time prediction models achieve better prediction performance than the continuous-time Cox proportional hazards model. Random survival forests, a machine learning algorithm adapted to survival data, also had improved performance compared to the Cox model, but was sometimes outperformed by the discrete-time approaches. In comparing the binary classification methods in the discrete time-to-event framework, the relative performance of the different methods varied depending on the data set. CONCLUSIONS: We present a guide for developing survival prediction models using discrete-time methods and assessing their predictive performance with the aim of encouraging their use in medical research settings. These methods can be applied to data sets that have continuous time-to-event outcomes and multiple clinical predictors. They can also be extended to accommodate new binary classification algorithms as they become available. We provide R code for fitting discrete-time survival prediction models in a github repository. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12874-022-01679-6). BioMed Central 2022-07-26 /pmc/articles/PMC9316420/ /pubmed/35883032 http://dx.doi.org/10.1186/s12874-022-01679-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Suresh, Krithika Severn, Cameron Ghosh, Debashis Survival prediction models: an introduction to discrete-time modeling |
title | Survival prediction models: an introduction to discrete-time modeling |
title_full | Survival prediction models: an introduction to discrete-time modeling |
title_fullStr | Survival prediction models: an introduction to discrete-time modeling |
title_full_unstemmed | Survival prediction models: an introduction to discrete-time modeling |
title_short | Survival prediction models: an introduction to discrete-time modeling |
title_sort | survival prediction models: an introduction to discrete-time modeling |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9316420/ https://www.ncbi.nlm.nih.gov/pubmed/35883032 http://dx.doi.org/10.1186/s12874-022-01679-6 |
work_keys_str_mv | AT sureshkrithika survivalpredictionmodelsanintroductiontodiscretetimemodeling AT severncameron survivalpredictionmodelsanintroductiontodiscretetimemodeling AT ghoshdebashis survivalpredictionmodelsanintroductiontodiscretetimemodeling |