Cargando…
Comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time
The Cox proportional hazards model is commonly used in evaluating risk factors in cancer survival data. The model assumes an additive, linear relationship between the risk factors and the log hazard. However, this assumption may be too simplistic. Further, failure to take time-varying covariates int...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9877029/ https://www.ncbi.nlm.nih.gov/pubmed/36697455 http://dx.doi.org/10.1038/s41598-023-28393-7 |
_version_ | 1784878296535662592 |
---|---|
author | Cygu, Steve Seow, Hsien Dushoff, Jonathan Bolker, Benjamin M. |
author_facet | Cygu, Steve Seow, Hsien Dushoff, Jonathan Bolker, Benjamin M. |
author_sort | Cygu, Steve |
collection | PubMed |
description | The Cox proportional hazards model is commonly used in evaluating risk factors in cancer survival data. The model assumes an additive, linear relationship between the risk factors and the log hazard. However, this assumption may be too simplistic. Further, failure to take time-varying covariates into account, if present, may lower prediction accuracy. In this retrospective, population-based, prognostic study of data from patients diagnosed with cancer from 2008 to 2015 in Ontario, Canada, we applied machine learning-based time-to-event prediction methods and compared their predictive performance in two sets of analyses: (1) yearly-cohort-based time-invariant and (2) fully time-varying covariates analysis. Machine learning-based methods—gradient boosting model (gbm), random survival forest (rsf), elastic net (enet), lasso and ridge—were compared to the traditional Cox proportional hazards (coxph) model and the prior study which used the yearly-cohort-based time-invariant analysis. Using Harrell’s C index as our primary measure, we found that using both machine learning techniques and incorporating time-dependent covariates can improve predictive performance. Gradient boosting machine showed the best performance on test data in both time-invariant and time-varying covariates analysis. |
format | Online Article Text |
id | pubmed-9877029 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-98770292023-01-27 Comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time Cygu, Steve Seow, Hsien Dushoff, Jonathan Bolker, Benjamin M. Sci Rep Article The Cox proportional hazards model is commonly used in evaluating risk factors in cancer survival data. The model assumes an additive, linear relationship between the risk factors and the log hazard. However, this assumption may be too simplistic. Further, failure to take time-varying covariates into account, if present, may lower prediction accuracy. In this retrospective, population-based, prognostic study of data from patients diagnosed with cancer from 2008 to 2015 in Ontario, Canada, we applied machine learning-based time-to-event prediction methods and compared their predictive performance in two sets of analyses: (1) yearly-cohort-based time-invariant and (2) fully time-varying covariates analysis. Machine learning-based methods—gradient boosting model (gbm), random survival forest (rsf), elastic net (enet), lasso and ridge—were compared to the traditional Cox proportional hazards (coxph) model and the prior study which used the yearly-cohort-based time-invariant analysis. Using Harrell’s C index as our primary measure, we found that using both machine learning techniques and incorporating time-dependent covariates can improve predictive performance. Gradient boosting machine showed the best performance on test data in both time-invariant and time-varying covariates analysis. Nature Publishing Group UK 2023-01-25 /pmc/articles/PMC9877029/ /pubmed/36697455 http://dx.doi.org/10.1038/s41598-023-28393-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Cygu, Steve Seow, Hsien Dushoff, Jonathan Bolker, Benjamin M. Comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time |
title | Comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time |
title_full | Comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time |
title_fullStr | Comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time |
title_full_unstemmed | Comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time |
title_short | Comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time |
title_sort | comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9877029/ https://www.ncbi.nlm.nih.gov/pubmed/36697455 http://dx.doi.org/10.1038/s41598-023-28393-7 |
work_keys_str_mv | AT cygusteve comparingmachinelearningapproachestoincorporatetimevaryingcovariatesinpredictingcancersurvivaltime AT seowhsien comparingmachinelearningapproachestoincorporatetimevaryingcovariatesinpredictingcancersurvivaltime AT dushoffjonathan comparingmachinelearningapproachestoincorporatetimevaryingcovariatesinpredictingcancersurvivaltime AT bolkerbenjaminm comparingmachinelearningapproachestoincorporatetimevaryingcovariatesinpredictingcancersurvivaltime |