Cargando…

Estimation of Heterogeneous Restricted Mean Survival Time Using Random Forest

Estimation and prediction of heterogeneous restricted mean survival time (hRMST) is of great clinical importance, which can provide an easily interpretable and clinically meaningful summary of the survival function in the presence of censoring and individual covariates. The existing methods for the...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Mingyang, Li, Hongzhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7873855/
https://www.ncbi.nlm.nih.gov/pubmed/33584791
http://dx.doi.org/10.3389/fgene.2020.587378
_version_ 1783649461579284480
author Liu, Mingyang
Li, Hongzhe
author_facet Liu, Mingyang
Li, Hongzhe
author_sort Liu, Mingyang
collection PubMed
description Estimation and prediction of heterogeneous restricted mean survival time (hRMST) is of great clinical importance, which can provide an easily interpretable and clinically meaningful summary of the survival function in the presence of censoring and individual covariates. The existing methods for the modeling of hRMST rely on proportional hazards or other parametric assumptions on the survival distribution. In this paper, we propose a random forest based estimation of hRMST for right-censored survival data with covariates and prove a central limit theorem for the resulting estimator. In addition, we present a computationally efficient construction for the confidence interval of hRMST. Our simulations show that the resulting confidence intervals have the correct coverage probability of the hRMST, and the random forest based estimate of hRMST has smaller prediction errors than the parametric models when the models are mis-specified. We apply the method to the ovarian cancer data set from The Cancer Genome Atlas (TCGA) project to predict hRMST and show an improved prediction performance over the existing methods. A software implementation, srf using R and C++, is available at https://github.com/lmy1019/SRF.
format Online
Article
Text
id pubmed-7873855
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-78738552021-02-11 Estimation of Heterogeneous Restricted Mean Survival Time Using Random Forest Liu, Mingyang Li, Hongzhe Front Genet Genetics Estimation and prediction of heterogeneous restricted mean survival time (hRMST) is of great clinical importance, which can provide an easily interpretable and clinically meaningful summary of the survival function in the presence of censoring and individual covariates. The existing methods for the modeling of hRMST rely on proportional hazards or other parametric assumptions on the survival distribution. In this paper, we propose a random forest based estimation of hRMST for right-censored survival data with covariates and prove a central limit theorem for the resulting estimator. In addition, we present a computationally efficient construction for the confidence interval of hRMST. Our simulations show that the resulting confidence intervals have the correct coverage probability of the hRMST, and the random forest based estimate of hRMST has smaller prediction errors than the parametric models when the models are mis-specified. We apply the method to the ovarian cancer data set from The Cancer Genome Atlas (TCGA) project to predict hRMST and show an improved prediction performance over the existing methods. A software implementation, srf using R and C++, is available at https://github.com/lmy1019/SRF. Frontiers Media S.A. 2021-01-07 /pmc/articles/PMC7873855/ /pubmed/33584791 http://dx.doi.org/10.3389/fgene.2020.587378 Text en Copyright © 2021 Liu and Li. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Liu, Mingyang
Li, Hongzhe
Estimation of Heterogeneous Restricted Mean Survival Time Using Random Forest
title Estimation of Heterogeneous Restricted Mean Survival Time Using Random Forest
title_full Estimation of Heterogeneous Restricted Mean Survival Time Using Random Forest
title_fullStr Estimation of Heterogeneous Restricted Mean Survival Time Using Random Forest
title_full_unstemmed Estimation of Heterogeneous Restricted Mean Survival Time Using Random Forest
title_short Estimation of Heterogeneous Restricted Mean Survival Time Using Random Forest
title_sort estimation of heterogeneous restricted mean survival time using random forest
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7873855/
https://www.ncbi.nlm.nih.gov/pubmed/33584791
http://dx.doi.org/10.3389/fgene.2020.587378
work_keys_str_mv AT liumingyang estimationofheterogeneousrestrictedmeansurvivaltimeusingrandomforest
AT lihongzhe estimationofheterogeneousrestrictedmeansurvivaltimeusingrandomforest