Cargando…

A comparison of time to event analysis methods, using weight status and breast cancer as a case study

Survival analysis with cohort study data has been traditionally performed using Cox proportional hazards models. Random survival forests (RSFs), a machine learning method, now present an alternative method. Using the UK Women’s Cohort Study (n = 34,493) we evaluate two methods: a Cox model and an RS...

Descripción completa

Detalles Bibliográficos
Autores principales: Aivaliotis, Georgios, Palczewski, Jan, Atkinson, Rebecca, Cade, Janet E., Morris, Michelle A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8263588/
https://www.ncbi.nlm.nih.gov/pubmed/34234154
http://dx.doi.org/10.1038/s41598-021-92944-z
_version_ 1783719416274354176
author Aivaliotis, Georgios
Palczewski, Jan
Atkinson, Rebecca
Cade, Janet E.
Morris, Michelle A.
author_facet Aivaliotis, Georgios
Palczewski, Jan
Atkinson, Rebecca
Cade, Janet E.
Morris, Michelle A.
author_sort Aivaliotis, Georgios
collection PubMed
description Survival analysis with cohort study data has been traditionally performed using Cox proportional hazards models. Random survival forests (RSFs), a machine learning method, now present an alternative method. Using the UK Women’s Cohort Study (n = 34,493) we evaluate two methods: a Cox model and an RSF, to investigate the association between Body Mass Index and time to breast cancer incidence. Robustness of the models were assessed by cross validation and bootstraping. Histograms of bootstrap coefficients are reported. C-Indices and Integrated Brier Scores are reported for all models. In post-menopausal women, the Cox model Hazard Ratios (HR) for Overweight (OW) and Obese (O) were 1.25 (1.04, 1.51) and 1.28 (0.98, 1.68) respectively and the RSF Odds Ratios (OR) with partial dependence on menopause for OW and O were 1.34 (1.31, 1.70) and 1.45 (1.42, 1.48). HR are non-significant results. Only the RSF appears confident about the effect of weight status on time to event. Bootstrapping demonstrated Cox model coefficients can vary significantly, weakening interpretation potential. An RSF was used to produce partial dependence plots (PDPs) showing OW and O weight status increase the probability of breast cancer incidence in post-menopausal women. All models have relatively low C-Index and high Integrated Brier Score. The RSF overfits the data. In our study, RSF can identify complex non-proportional hazard type patterns in the data, and allow more complicated relationships to be investigated using PDPs, but it overfits limiting extrapolation of results to new instances. Moreover, it is less easily interpreted than Cox models. The value of survival analysis remains paramount and therefore machine learning techniques like RSF should be considered as another method for analysis.
format Online
Article
Text
id pubmed-8263588
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-82635882021-07-09 A comparison of time to event analysis methods, using weight status and breast cancer as a case study Aivaliotis, Georgios Palczewski, Jan Atkinson, Rebecca Cade, Janet E. Morris, Michelle A. Sci Rep Article Survival analysis with cohort study data has been traditionally performed using Cox proportional hazards models. Random survival forests (RSFs), a machine learning method, now present an alternative method. Using the UK Women’s Cohort Study (n = 34,493) we evaluate two methods: a Cox model and an RSF, to investigate the association between Body Mass Index and time to breast cancer incidence. Robustness of the models were assessed by cross validation and bootstraping. Histograms of bootstrap coefficients are reported. C-Indices and Integrated Brier Scores are reported for all models. In post-menopausal women, the Cox model Hazard Ratios (HR) for Overweight (OW) and Obese (O) were 1.25 (1.04, 1.51) and 1.28 (0.98, 1.68) respectively and the RSF Odds Ratios (OR) with partial dependence on menopause for OW and O were 1.34 (1.31, 1.70) and 1.45 (1.42, 1.48). HR are non-significant results. Only the RSF appears confident about the effect of weight status on time to event. Bootstrapping demonstrated Cox model coefficients can vary significantly, weakening interpretation potential. An RSF was used to produce partial dependence plots (PDPs) showing OW and O weight status increase the probability of breast cancer incidence in post-menopausal women. All models have relatively low C-Index and high Integrated Brier Score. The RSF overfits the data. In our study, RSF can identify complex non-proportional hazard type patterns in the data, and allow more complicated relationships to be investigated using PDPs, but it overfits limiting extrapolation of results to new instances. Moreover, it is less easily interpreted than Cox models. The value of survival analysis remains paramount and therefore machine learning techniques like RSF should be considered as another method for analysis. Nature Publishing Group UK 2021-07-07 /pmc/articles/PMC8263588/ /pubmed/34234154 http://dx.doi.org/10.1038/s41598-021-92944-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Aivaliotis, Georgios
Palczewski, Jan
Atkinson, Rebecca
Cade, Janet E.
Morris, Michelle A.
A comparison of time to event analysis methods, using weight status and breast cancer as a case study
title A comparison of time to event analysis methods, using weight status and breast cancer as a case study
title_full A comparison of time to event analysis methods, using weight status and breast cancer as a case study
title_fullStr A comparison of time to event analysis methods, using weight status and breast cancer as a case study
title_full_unstemmed A comparison of time to event analysis methods, using weight status and breast cancer as a case study
title_short A comparison of time to event analysis methods, using weight status and breast cancer as a case study
title_sort comparison of time to event analysis methods, using weight status and breast cancer as a case study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8263588/
https://www.ncbi.nlm.nih.gov/pubmed/34234154
http://dx.doi.org/10.1038/s41598-021-92944-z
work_keys_str_mv AT aivaliotisgeorgios acomparisonoftimetoeventanalysismethodsusingweightstatusandbreastcancerasacasestudy
AT palczewskijan acomparisonoftimetoeventanalysismethodsusingweightstatusandbreastcancerasacasestudy
AT atkinsonrebecca acomparisonoftimetoeventanalysismethodsusingweightstatusandbreastcancerasacasestudy
AT cadejanete acomparisonoftimetoeventanalysismethodsusingweightstatusandbreastcancerasacasestudy
AT morrismichellea acomparisonoftimetoeventanalysismethodsusingweightstatusandbreastcancerasacasestudy
AT aivaliotisgeorgios comparisonoftimetoeventanalysismethodsusingweightstatusandbreastcancerasacasestudy
AT palczewskijan comparisonoftimetoeventanalysismethodsusingweightstatusandbreastcancerasacasestudy
AT atkinsonrebecca comparisonoftimetoeventanalysismethodsusingweightstatusandbreastcancerasacasestudy
AT cadejanete comparisonoftimetoeventanalysismethodsusingweightstatusandbreastcancerasacasestudy
AT morrismichellea comparisonoftimetoeventanalysismethodsusingweightstatusandbreastcancerasacasestudy