Cargando…

SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data

Survival analysis is a branch of statistics that deals with both the tracking of time and the survival status simultaneously as the dependent response. Current comparisons of survival model performance mostly center on clinical data with classic statistical survival models, with prediction accuracy...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Yunwei, Wong, Germaine, Mann, Graham, Muller, Samuel, Yang, Jean Y H
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9338425/ https://www.ncbi.nlm.nih.gov/pubmed/35906887 http://dx.doi.org/10.1093/gigascience/giac071

_version_	1784759965441851392
author	Zhang, Yunwei Wong, Germaine Mann, Graham Muller, Samuel Yang, Jean Y H
author_facet	Zhang, Yunwei Wong, Germaine Mann, Graham Muller, Samuel Yang, Jean Y H
author_sort	Zhang, Yunwei
collection	PubMed
description	Survival analysis is a branch of statistics that deals with both the tracking of time and the survival status simultaneously as the dependent response. Current comparisons of survival model performance mostly center on clinical data with classic statistical survival models, with prediction accuracy often serving as the sole metric of model performance. Moreover, survival analysis approaches for censored omics data have not been thoroughly investigated. The common approach is to binarize the survival time and perform a classification analysis. Here, we develop a benchmarking design, SurvBenchmark, that evaluates a diverse collection of survival models for both clinical and omics data sets. SurvBenchmark not only focuses on classical approaches such as the Cox model but also evaluates state-of-the-art machine learning survival models. All approaches were assessed using multiple performance metrics; these include model predictability, stability, flexibility, and computational issues. Our systematic comparison design with 320 comparisons (20 methods over 16 data sets) shows that the performances of survival models vary in practice over real-world data sets and over the choice of the evaluation metric. In particular, we highlight that using multiple performance metrics is critical in providing a balanced assessment of various models. The results in our study will provide practical guidelines for translational scientists and clinicians, as well as define possible areas of investigation in both survival technique and benchmarking strategies.
format	Online Article Text
id	pubmed-9338425
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-93384252022-08-01 SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data Zhang, Yunwei Wong, Germaine Mann, Graham Muller, Samuel Yang, Jean Y H Gigascience Research Survival analysis is a branch of statistics that deals with both the tracking of time and the survival status simultaneously as the dependent response. Current comparisons of survival model performance mostly center on clinical data with classic statistical survival models, with prediction accuracy often serving as the sole metric of model performance. Moreover, survival analysis approaches for censored omics data have not been thoroughly investigated. The common approach is to binarize the survival time and perform a classification analysis. Here, we develop a benchmarking design, SurvBenchmark, that evaluates a diverse collection of survival models for both clinical and omics data sets. SurvBenchmark not only focuses on classical approaches such as the Cox model but also evaluates state-of-the-art machine learning survival models. All approaches were assessed using multiple performance metrics; these include model predictability, stability, flexibility, and computational issues. Our systematic comparison design with 320 comparisons (20 methods over 16 data sets) shows that the performances of survival models vary in practice over real-world data sets and over the choice of the evaluation metric. In particular, we highlight that using multiple performance metrics is critical in providing a balanced assessment of various models. The results in our study will provide practical guidelines for translational scientists and clinicians, as well as define possible areas of investigation in both survival technique and benchmarking strategies. Oxford University Press 2022-07-30 /pmc/articles/PMC9338425/ /pubmed/35906887 http://dx.doi.org/10.1093/gigascience/giac071 Text en © The Author(s) 2022. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Zhang, Yunwei Wong, Germaine Mann, Graham Muller, Samuel Yang, Jean Y H SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data
title	SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data
title_full	SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data
title_fullStr	SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data
title_full_unstemmed	SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data
title_short	SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data
title_sort	survbenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9338425/ https://www.ncbi.nlm.nih.gov/pubmed/35906887 http://dx.doi.org/10.1093/gigascience/giac071
work_keys_str_mv	AT zhangyunwei survbenchmarkcomprehensivebenchmarkingstudyofsurvivalanalysismethodsusingbothomicsdataandclinicaldata AT wonggermaine survbenchmarkcomprehensivebenchmarkingstudyofsurvivalanalysismethodsusingbothomicsdataandclinicaldata AT manngraham survbenchmarkcomprehensivebenchmarkingstudyofsurvivalanalysismethodsusingbothomicsdataandclinicaldata AT mullersamuel survbenchmarkcomprehensivebenchmarkingstudyofsurvivalanalysismethodsusingbothomicsdataandclinicaldata AT yangjeanyh survbenchmarkcomprehensivebenchmarkingstudyofsurvivalanalysismethodsusingbothomicsdataandclinicaldata

SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data

Ejemplares similares