Cargando…

Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction

Background. Tree models provide easily interpretable prognostic tool, but instable results. Two approaches to enhance the generalizability of the results are pruning and random survival forest (RSF). The aim of this study is to assess the generalizability of saturated tree (ST), pruned tree (PT), an...

Descripción completa

Detalles Bibliográficos
Autores principales: Yosefian, Iman, Mosa Farkhani, Ehsan, Baneshi, Mohammad Reza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4698527/
https://www.ncbi.nlm.nih.gov/pubmed/26858773
http://dx.doi.org/10.1155/2015/576413
_version_ 1782408038699761664
author Yosefian, Iman
Mosa Farkhani, Ehsan
Baneshi, Mohammad Reza
author_facet Yosefian, Iman
Mosa Farkhani, Ehsan
Baneshi, Mohammad Reza
author_sort Yosefian, Iman
collection PubMed
description Background. Tree models provide easily interpretable prognostic tool, but instable results. Two approaches to enhance the generalizability of the results are pruning and random survival forest (RSF). The aim of this study is to assess the generalizability of saturated tree (ST), pruned tree (PT), and RSF. Methods. Data of 607 patients was randomly divided into training and test set applying 10-fold cross-validation. Using training sets, all three models were applied. Using Log-Rank test, ST was constructed by searching for optimal cutoffs. PT was selected plotting error rate versus minimum sample size in terminal nodes. In construction of RSF, 1000 bootstrap samples were drawn from the training set. C-index and integrated Brier score (IBS) statistic were used to compare models. Results. ST provides the most overoptimized statistics. Mean difference between C-index in training and test set was 0.237. Corresponding figure in PT and RSF was 0.054 and 0.007. In terms of IBS, the difference was 0.136 in ST, 0.021 in PT, and 0.0003 in RSF. Conclusion. Pruning of tree and assessment of its performance of a test set partially improve the generalizability of decision trees. RSF provides results that are highly generalizable.
format Online
Article
Text
id pubmed-4698527
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-46985272016-02-08 Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction Yosefian, Iman Mosa Farkhani, Ehsan Baneshi, Mohammad Reza Comput Math Methods Med Research Article Background. Tree models provide easily interpretable prognostic tool, but instable results. Two approaches to enhance the generalizability of the results are pruning and random survival forest (RSF). The aim of this study is to assess the generalizability of saturated tree (ST), pruned tree (PT), and RSF. Methods. Data of 607 patients was randomly divided into training and test set applying 10-fold cross-validation. Using training sets, all three models were applied. Using Log-Rank test, ST was constructed by searching for optimal cutoffs. PT was selected plotting error rate versus minimum sample size in terminal nodes. In construction of RSF, 1000 bootstrap samples were drawn from the training set. C-index and integrated Brier score (IBS) statistic were used to compare models. Results. ST provides the most overoptimized statistics. Mean difference between C-index in training and test set was 0.237. Corresponding figure in PT and RSF was 0.054 and 0.007. In terms of IBS, the difference was 0.136 in ST, 0.021 in PT, and 0.0003 in RSF. Conclusion. Pruning of tree and assessment of its performance of a test set partially improve the generalizability of decision trees. RSF provides results that are highly generalizable. Hindawi Publishing Corporation 2015 2015-12-21 /pmc/articles/PMC4698527/ /pubmed/26858773 http://dx.doi.org/10.1155/2015/576413 Text en Copyright © 2015 Iman Yosefian et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yosefian, Iman
Mosa Farkhani, Ehsan
Baneshi, Mohammad Reza
Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction
title Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction
title_full Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction
title_fullStr Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction
title_full_unstemmed Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction
title_short Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction
title_sort application of random forest survival models to increase generalizability of decision trees: a case study in acute myocardial infarction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4698527/
https://www.ncbi.nlm.nih.gov/pubmed/26858773
http://dx.doi.org/10.1155/2015/576413
work_keys_str_mv AT yosefianiman applicationofrandomforestsurvivalmodelstoincreasegeneralizabilityofdecisiontreesacasestudyinacutemyocardialinfarction
AT mosafarkhaniehsan applicationofrandomforestsurvivalmodelstoincreasegeneralizabilityofdecisiontreesacasestudyinacutemyocardialinfarction
AT baneshimohammadreza applicationofrandomforestsurvivalmodelstoincreasegeneralizabilityofdecisiontreesacasestudyinacutemyocardialinfarction