Cargando…
Decision tree methods: applications for classification and prediction
Decision tree methodology is a commonly used data mining method for establishing classification systems based on multiple covariates or for developing prediction algorithms for a target variable. This method classifies a population into branch-like segments that construct an inverted tree with a roo...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Shanghai Municipal Bureau of Publishing
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4466856/ https://www.ncbi.nlm.nih.gov/pubmed/26120265 http://dx.doi.org/10.11919/j.issn.1002-0829.215044 |
_version_ | 1782376295323140096 |
---|---|
author | SONG, Yan-yan LU, Ying |
author_facet | SONG, Yan-yan LU, Ying |
author_sort | SONG, Yan-yan |
collection | PubMed |
description | Decision tree methodology is a commonly used data mining method for establishing classification systems based on multiple covariates or for developing prediction algorithms for a target variable. This method classifies a population into branch-like segments that construct an inverted tree with a root node, internal nodes, and leaf nodes. The algorithm is non-parametric and can efficiently deal with large, complicated datasets without imposing a complicated parametric structure. When the sample size is large enough, study data can be divided into training and validation datasets. Using the training dataset to build a decision tree model and a validation dataset to decide on the appropriate tree size needed to achieve the optimal final model. This paper introduces frequently used algorithms used to develop decision trees (including CART, C4.5, CHAID, and QUEST) and describes the SPSS and SAS programs that can be used to visualize tree structure. |
format | Online Article Text |
id | pubmed-4466856 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Shanghai Municipal Bureau of Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-44668562015-06-26 Decision tree methods: applications for classification and prediction SONG, Yan-yan LU, Ying Shanghai Arch Psychiatry Biostatistics in Psychiatry (26) Decision tree methodology is a commonly used data mining method for establishing classification systems based on multiple covariates or for developing prediction algorithms for a target variable. This method classifies a population into branch-like segments that construct an inverted tree with a root node, internal nodes, and leaf nodes. The algorithm is non-parametric and can efficiently deal with large, complicated datasets without imposing a complicated parametric structure. When the sample size is large enough, study data can be divided into training and validation datasets. Using the training dataset to build a decision tree model and a validation dataset to decide on the appropriate tree size needed to achieve the optimal final model. This paper introduces frequently used algorithms used to develop decision trees (including CART, C4.5, CHAID, and QUEST) and describes the SPSS and SAS programs that can be used to visualize tree structure. Shanghai Municipal Bureau of Publishing 2015-04-25 /pmc/articles/PMC4466856/ /pubmed/26120265 http://dx.doi.org/10.11919/j.issn.1002-0829.215044 Text en Copyright © 2015 by Shanghai Municipal Bureau of Publishing http://creativecommons.org/licenses/by-nc-sa/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/ |
spellingShingle | Biostatistics in Psychiatry (26) SONG, Yan-yan LU, Ying Decision tree methods: applications for classification and prediction |
title | Decision tree methods: applications for classification and prediction |
title_full | Decision tree methods: applications for classification and prediction |
title_fullStr | Decision tree methods: applications for classification and prediction |
title_full_unstemmed | Decision tree methods: applications for classification and prediction |
title_short | Decision tree methods: applications for classification and prediction |
title_sort | decision tree methods: applications for classification and prediction |
topic | Biostatistics in Psychiatry (26) |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4466856/ https://www.ncbi.nlm.nih.gov/pubmed/26120265 http://dx.doi.org/10.11919/j.issn.1002-0829.215044 |
work_keys_str_mv | AT songyanyan decisiontreemethodsapplicationsforclassificationandprediction AT luying decisiontreemethodsapplicationsforclassificationandprediction |