Cargando…

A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data

BACKGROUND: There is substantial interest in the adaptation and application of so-called machine learning approaches to prognostic modelling of censored time-to-event data. These methods must be compared and evaluated against existing methods in a variety of scenarios to determine their predictive p...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Hayley, Sweeting, Michael, Morris, Tim, Crowther, Michael J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9161606/
https://www.ncbi.nlm.nih.gov/pubmed/35650647
http://dx.doi.org/10.1186/s41512-022-00124-y
_version_ 1784719521034010624
author Smith, Hayley
Sweeting, Michael
Morris, Tim
Crowther, Michael J.
author_facet Smith, Hayley
Sweeting, Michael
Morris, Tim
Crowther, Michael J.
author_sort Smith, Hayley
collection PubMed
description BACKGROUND: There is substantial interest in the adaptation and application of so-called machine learning approaches to prognostic modelling of censored time-to-event data. These methods must be compared and evaluated against existing methods in a variety of scenarios to determine their predictive performance. A scoping review of how machine learning methods have been compared to traditional survival models is important to identify the comparisons that have been made and issues where they are lacking, biased towards one approach or misleading. METHODS: We conducted a scoping review of research articles published between 1 January 2000 and 2 December 2020 using PubMed. Eligible articles were those that used simulation studies to compare statistical and machine learning methods for risk prediction with a time-to-event outcome in a medical/healthcare setting. We focus on data-generating mechanisms (DGMs), the methods that have been compared, the estimands of the simulation studies, and the performance measures used to evaluate them. RESULTS: A total of ten articles were identified as eligible for the review. Six of the articles evaluated a method that was developed by the authors, four of which were machine learning methods, and the results almost always stated that this developed method’s performance was equivalent to or better than the other methods compared. Comparisons were often biased towards the novel approach, with the majority only comparing against a basic Cox proportional hazards model, and in scenarios where it is clear it would not perform well. In many of the articles reviewed, key information was unclear, such as the number of simulation repetitions and how performance measures were calculated. CONCLUSION: It is vital that method comparisons are unbiased and comprehensive, and this should be the goal even if realising it is difficult. Fully assessing how newly developed methods perform and how they compare to a variety of traditional statistical methods for prognostic modelling is imperative as these methods are already being applied in clinical contexts. Evaluations of the performance and usefulness of recently developed methods for risk prediction should be continued and reporting standards improved as these methods become increasingly popular. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s41512-022-00124-y.
format Online
Article
Text
id pubmed-9161606
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-91616062022-06-03 A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data Smith, Hayley Sweeting, Michael Morris, Tim Crowther, Michael J. Diagn Progn Res Review BACKGROUND: There is substantial interest in the adaptation and application of so-called machine learning approaches to prognostic modelling of censored time-to-event data. These methods must be compared and evaluated against existing methods in a variety of scenarios to determine their predictive performance. A scoping review of how machine learning methods have been compared to traditional survival models is important to identify the comparisons that have been made and issues where they are lacking, biased towards one approach or misleading. METHODS: We conducted a scoping review of research articles published between 1 January 2000 and 2 December 2020 using PubMed. Eligible articles were those that used simulation studies to compare statistical and machine learning methods for risk prediction with a time-to-event outcome in a medical/healthcare setting. We focus on data-generating mechanisms (DGMs), the methods that have been compared, the estimands of the simulation studies, and the performance measures used to evaluate them. RESULTS: A total of ten articles were identified as eligible for the review. Six of the articles evaluated a method that was developed by the authors, four of which were machine learning methods, and the results almost always stated that this developed method’s performance was equivalent to or better than the other methods compared. Comparisons were often biased towards the novel approach, with the majority only comparing against a basic Cox proportional hazards model, and in scenarios where it is clear it would not perform well. In many of the articles reviewed, key information was unclear, such as the number of simulation repetitions and how performance measures were calculated. CONCLUSION: It is vital that method comparisons are unbiased and comprehensive, and this should be the goal even if realising it is difficult. Fully assessing how newly developed methods perform and how they compare to a variety of traditional statistical methods for prognostic modelling is imperative as these methods are already being applied in clinical contexts. Evaluations of the performance and usefulness of recently developed methods for risk prediction should be continued and reporting standards improved as these methods become increasingly popular. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s41512-022-00124-y. BioMed Central 2022-06-02 /pmc/articles/PMC9161606/ /pubmed/35650647 http://dx.doi.org/10.1186/s41512-022-00124-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Review
Smith, Hayley
Sweeting, Michael
Morris, Tim
Crowther, Michael J.
A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data
title A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data
title_full A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data
title_fullStr A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data
title_full_unstemmed A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data
title_short A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data
title_sort scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9161606/
https://www.ncbi.nlm.nih.gov/pubmed/35650647
http://dx.doi.org/10.1186/s41512-022-00124-y
work_keys_str_mv AT smithhayley ascopingmethodologicalreviewofsimulationstudiescomparingstatisticalandmachinelearningapproachestoriskpredictionfortimetoeventdata
AT sweetingmichael ascopingmethodologicalreviewofsimulationstudiescomparingstatisticalandmachinelearningapproachestoriskpredictionfortimetoeventdata
AT morristim ascopingmethodologicalreviewofsimulationstudiescomparingstatisticalandmachinelearningapproachestoriskpredictionfortimetoeventdata
AT crowthermichaelj ascopingmethodologicalreviewofsimulationstudiescomparingstatisticalandmachinelearningapproachestoriskpredictionfortimetoeventdata
AT smithhayley scopingmethodologicalreviewofsimulationstudiescomparingstatisticalandmachinelearningapproachestoriskpredictionfortimetoeventdata
AT sweetingmichael scopingmethodologicalreviewofsimulationstudiescomparingstatisticalandmachinelearningapproachestoriskpredictionfortimetoeventdata
AT morristim scopingmethodologicalreviewofsimulationstudiescomparingstatisticalandmachinelearningapproachestoriskpredictionfortimetoeventdata
AT crowthermichaelj scopingmethodologicalreviewofsimulationstudiescomparingstatisticalandmachinelearningapproachestoriskpredictionfortimetoeventdata