Cargando…
Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction
Background. Tree models provide easily interpretable prognostic tool, but instable results. Two approaches to enhance the generalizability of the results are pruning and random survival forest (RSF). The aim of this study is to assess the generalizability of saturated tree (ST), pruned tree (PT), an...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4698527/ https://www.ncbi.nlm.nih.gov/pubmed/26858773 http://dx.doi.org/10.1155/2015/576413 |
_version_ | 1782408038699761664 |
---|---|
author | Yosefian, Iman Mosa Farkhani, Ehsan Baneshi, Mohammad Reza |
author_facet | Yosefian, Iman Mosa Farkhani, Ehsan Baneshi, Mohammad Reza |
author_sort | Yosefian, Iman |
collection | PubMed |
description | Background. Tree models provide easily interpretable prognostic tool, but instable results. Two approaches to enhance the generalizability of the results are pruning and random survival forest (RSF). The aim of this study is to assess the generalizability of saturated tree (ST), pruned tree (PT), and RSF. Methods. Data of 607 patients was randomly divided into training and test set applying 10-fold cross-validation. Using training sets, all three models were applied. Using Log-Rank test, ST was constructed by searching for optimal cutoffs. PT was selected plotting error rate versus minimum sample size in terminal nodes. In construction of RSF, 1000 bootstrap samples were drawn from the training set. C-index and integrated Brier score (IBS) statistic were used to compare models. Results. ST provides the most overoptimized statistics. Mean difference between C-index in training and test set was 0.237. Corresponding figure in PT and RSF was 0.054 and 0.007. In terms of IBS, the difference was 0.136 in ST, 0.021 in PT, and 0.0003 in RSF. Conclusion. Pruning of tree and assessment of its performance of a test set partially improve the generalizability of decision trees. RSF provides results that are highly generalizable. |
format | Online Article Text |
id | pubmed-4698527 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-46985272016-02-08 Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction Yosefian, Iman Mosa Farkhani, Ehsan Baneshi, Mohammad Reza Comput Math Methods Med Research Article Background. Tree models provide easily interpretable prognostic tool, but instable results. Two approaches to enhance the generalizability of the results are pruning and random survival forest (RSF). The aim of this study is to assess the generalizability of saturated tree (ST), pruned tree (PT), and RSF. Methods. Data of 607 patients was randomly divided into training and test set applying 10-fold cross-validation. Using training sets, all three models were applied. Using Log-Rank test, ST was constructed by searching for optimal cutoffs. PT was selected plotting error rate versus minimum sample size in terminal nodes. In construction of RSF, 1000 bootstrap samples were drawn from the training set. C-index and integrated Brier score (IBS) statistic were used to compare models. Results. ST provides the most overoptimized statistics. Mean difference between C-index in training and test set was 0.237. Corresponding figure in PT and RSF was 0.054 and 0.007. In terms of IBS, the difference was 0.136 in ST, 0.021 in PT, and 0.0003 in RSF. Conclusion. Pruning of tree and assessment of its performance of a test set partially improve the generalizability of decision trees. RSF provides results that are highly generalizable. Hindawi Publishing Corporation 2015 2015-12-21 /pmc/articles/PMC4698527/ /pubmed/26858773 http://dx.doi.org/10.1155/2015/576413 Text en Copyright © 2015 Iman Yosefian et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Yosefian, Iman Mosa Farkhani, Ehsan Baneshi, Mohammad Reza Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction |
title | Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction |
title_full | Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction |
title_fullStr | Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction |
title_full_unstemmed | Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction |
title_short | Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction |
title_sort | application of random forest survival models to increase generalizability of decision trees: a case study in acute myocardial infarction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4698527/ https://www.ncbi.nlm.nih.gov/pubmed/26858773 http://dx.doi.org/10.1155/2015/576413 |
work_keys_str_mv | AT yosefianiman applicationofrandomforestsurvivalmodelstoincreasegeneralizabilityofdecisiontreesacasestudyinacutemyocardialinfarction AT mosafarkhaniehsan applicationofrandomforestsurvivalmodelstoincreasegeneralizabilityofdecisiontreesacasestudyinacutemyocardialinfarction AT baneshimohammadreza applicationofrandomforestsurvivalmodelstoincreasegeneralizabilityofdecisiontreesacasestudyinacutemyocardialinfarction |