Cargando…

Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology

BACKGROUND: Large and complex population-based cancer data are becoming broadly available, thanks to purposeful linkage between cancer registry data and health electronic records. Aiming at understanding the explanatory power of factors on cancer survival, the modelling and selection of variables ne...

Descripción completa

Detalles Bibliográficos
Autores principales: Maringe, Camille, Belot, Aurélien, Rubio, Francisco Javier, Rachet, Bernard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6869178/
https://www.ncbi.nlm.nih.gov/pubmed/31747928
http://dx.doi.org/10.1186/s12874-019-0830-9
_version_ 1783472379637268480
author Maringe, Camille
Belot, Aurélien
Rubio, Francisco Javier
Rachet, Bernard
author_facet Maringe, Camille
Belot, Aurélien
Rubio, Francisco Javier
Rachet, Bernard
author_sort Maringe, Camille
collection PubMed
description BACKGROUND: Large and complex population-based cancer data are becoming broadly available, thanks to purposeful linkage between cancer registry data and health electronic records. Aiming at understanding the explanatory power of factors on cancer survival, the modelling and selection of variables need to be understood and exploited properly for improving model-based estimates of cancer survival. METHOD: We assess the performances of well-known model selection strategies developed by Royston and Sauerbrei and Wynant and Abrahamowicz that we adapt to the relative survival data setting and to test for interaction terms. RESULTS: We apply these to all male patients diagnosed with lung cancer in England in 2012 (N = 15,688), and followed-up until 31/12/2015. We model the effects of age at diagnosis, tumour stage, deprivation, comorbidity and emergency presentation, as well as interactions between age and all of the above. Given the size of the dataset, all model selection strategies favoured virtually the same model, except for a non-linear effect of age at diagnosis selected by the backward-based selection strategies (versus a linear effect selected otherwise). CONCLUSION: The results from extensive simulations evaluating varying model complexity and sample sizes provide guidelines on a model selection strategy in the context of excess hazard modelling.
format Online
Article
Text
id pubmed-6869178
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68691782019-12-12 Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology Maringe, Camille Belot, Aurélien Rubio, Francisco Javier Rachet, Bernard BMC Med Res Methodol Research Article BACKGROUND: Large and complex population-based cancer data are becoming broadly available, thanks to purposeful linkage between cancer registry data and health electronic records. Aiming at understanding the explanatory power of factors on cancer survival, the modelling and selection of variables need to be understood and exploited properly for improving model-based estimates of cancer survival. METHOD: We assess the performances of well-known model selection strategies developed by Royston and Sauerbrei and Wynant and Abrahamowicz that we adapt to the relative survival data setting and to test for interaction terms. RESULTS: We apply these to all male patients diagnosed with lung cancer in England in 2012 (N = 15,688), and followed-up until 31/12/2015. We model the effects of age at diagnosis, tumour stage, deprivation, comorbidity and emergency presentation, as well as interactions between age and all of the above. Given the size of the dataset, all model selection strategies favoured virtually the same model, except for a non-linear effect of age at diagnosis selected by the backward-based selection strategies (versus a linear effect selected otherwise). CONCLUSION: The results from extensive simulations evaluating varying model complexity and sample sizes provide guidelines on a model selection strategy in the context of excess hazard modelling. BioMed Central 2019-11-20 /pmc/articles/PMC6869178/ /pubmed/31747928 http://dx.doi.org/10.1186/s12874-019-0830-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Maringe, Camille
Belot, Aurélien
Rubio, Francisco Javier
Rachet, Bernard
Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology
title Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology
title_full Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology
title_fullStr Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology
title_full_unstemmed Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology
title_short Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology
title_sort comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6869178/
https://www.ncbi.nlm.nih.gov/pubmed/31747928
http://dx.doi.org/10.1186/s12874-019-0830-9
work_keys_str_mv AT maringecamille comparisonofmodelbuildingstrategiesforexcesshazardregressionmodelsinthecontextofcancerepidemiology
AT belotaurelien comparisonofmodelbuildingstrategiesforexcesshazardregressionmodelsinthecontextofcancerepidemiology
AT rubiofranciscojavier comparisonofmodelbuildingstrategiesforexcesshazardregressionmodelsinthecontextofcancerepidemiology
AT rachetbernard comparisonofmodelbuildingstrategiesforexcesshazardregressionmodelsinthecontextofcancerepidemiology