Cargando…

Decision tree methods: applications for classification and prediction

Decision tree methodology is a commonly used data mining method for establishing classification systems based on multiple covariates or for developing prediction algorithms for a target variable. This method classifies a population into branch-like segments that construct an inverted tree with a roo...

Descripción completa

Detalles Bibliográficos
Autores principales: SONG, Yan-yan, LU, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Shanghai Municipal Bureau of Publishing 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4466856/
https://www.ncbi.nlm.nih.gov/pubmed/26120265
http://dx.doi.org/10.11919/j.issn.1002-0829.215044
_version_ 1782376295323140096
author SONG, Yan-yan
LU, Ying
author_facet SONG, Yan-yan
LU, Ying
author_sort SONG, Yan-yan
collection PubMed
description Decision tree methodology is a commonly used data mining method for establishing classification systems based on multiple covariates or for developing prediction algorithms for a target variable. This method classifies a population into branch-like segments that construct an inverted tree with a root node, internal nodes, and leaf nodes. The algorithm is non-parametric and can efficiently deal with large, complicated datasets without imposing a complicated parametric structure. When the sample size is large enough, study data can be divided into training and validation datasets. Using the training dataset to build a decision tree model and a validation dataset to decide on the appropriate tree size needed to achieve the optimal final model. This paper introduces frequently used algorithms used to develop decision trees (including CART, C4.5, CHAID, and QUEST) and describes the SPSS and SAS programs that can be used to visualize tree structure.
format Online
Article
Text
id pubmed-4466856
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Shanghai Municipal Bureau of Publishing
record_format MEDLINE/PubMed
spelling pubmed-44668562015-06-26 Decision tree methods: applications for classification and prediction SONG, Yan-yan LU, Ying Shanghai Arch Psychiatry Biostatistics in Psychiatry (26) Decision tree methodology is a commonly used data mining method for establishing classification systems based on multiple covariates or for developing prediction algorithms for a target variable. This method classifies a population into branch-like segments that construct an inverted tree with a root node, internal nodes, and leaf nodes. The algorithm is non-parametric and can efficiently deal with large, complicated datasets without imposing a complicated parametric structure. When the sample size is large enough, study data can be divided into training and validation datasets. Using the training dataset to build a decision tree model and a validation dataset to decide on the appropriate tree size needed to achieve the optimal final model. This paper introduces frequently used algorithms used to develop decision trees (including CART, C4.5, CHAID, and QUEST) and describes the SPSS and SAS programs that can be used to visualize tree structure. Shanghai Municipal Bureau of Publishing 2015-04-25 /pmc/articles/PMC4466856/ /pubmed/26120265 http://dx.doi.org/10.11919/j.issn.1002-0829.215044 Text en Copyright © 2015 by Shanghai Municipal Bureau of Publishing http://creativecommons.org/licenses/by-nc-sa/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/
spellingShingle Biostatistics in Psychiatry (26)
SONG, Yan-yan
LU, Ying
Decision tree methods: applications for classification and prediction
title Decision tree methods: applications for classification and prediction
title_full Decision tree methods: applications for classification and prediction
title_fullStr Decision tree methods: applications for classification and prediction
title_full_unstemmed Decision tree methods: applications for classification and prediction
title_short Decision tree methods: applications for classification and prediction
title_sort decision tree methods: applications for classification and prediction
topic Biostatistics in Psychiatry (26)
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4466856/
https://www.ncbi.nlm.nih.gov/pubmed/26120265
http://dx.doi.org/10.11919/j.issn.1002-0829.215044
work_keys_str_mv AT songyanyan decisiontreemethodsapplicationsforclassificationandprediction
AT luying decisiontreemethodsapplicationsforclassificationandprediction