Cargando…

Forest Pruning Based on Branch Importance

A forest is an ensemble with decision trees as members. This paper proposes a novel strategy to pruning forest to enhance ensemble generalization ability and reduce ensemble size. Unlike conventional ensemble pruning approaches, the proposed method tries to evaluate the importance of branches of tre...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Xiangkui, Wu, Chang-an, Guo, Huaping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5474283/
https://www.ncbi.nlm.nih.gov/pubmed/28659973
http://dx.doi.org/10.1155/2017/3162571
_version_ 1783244420831772672
author Jiang, Xiangkui
Wu, Chang-an
Guo, Huaping
author_facet Jiang, Xiangkui
Wu, Chang-an
Guo, Huaping
author_sort Jiang, Xiangkui
collection PubMed
description A forest is an ensemble with decision trees as members. This paper proposes a novel strategy to pruning forest to enhance ensemble generalization ability and reduce ensemble size. Unlike conventional ensemble pruning approaches, the proposed method tries to evaluate the importance of branches of trees with respect to the whole ensemble using a novel proposed metric called importance gain. The importance of a branch is designed by considering ensemble accuracy and the diversity of ensemble members, and thus the metric reasonably evaluates how much improvement of the ensemble accuracy can be achieved when a branch is pruned. Our experiments show that the proposed method can significantly reduce ensemble size and improve ensemble accuracy, no matter whether ensembles are constructed by a certain algorithm such as bagging or obtained by an ensemble selection algorithm, no matter whether each decision tree is pruned or unpruned.
format Online
Article
Text
id pubmed-5474283
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-54742832017-06-28 Forest Pruning Based on Branch Importance Jiang, Xiangkui Wu, Chang-an Guo, Huaping Comput Intell Neurosci Research Article A forest is an ensemble with decision trees as members. This paper proposes a novel strategy to pruning forest to enhance ensemble generalization ability and reduce ensemble size. Unlike conventional ensemble pruning approaches, the proposed method tries to evaluate the importance of branches of trees with respect to the whole ensemble using a novel proposed metric called importance gain. The importance of a branch is designed by considering ensemble accuracy and the diversity of ensemble members, and thus the metric reasonably evaluates how much improvement of the ensemble accuracy can be achieved when a branch is pruned. Our experiments show that the proposed method can significantly reduce ensemble size and improve ensemble accuracy, no matter whether ensembles are constructed by a certain algorithm such as bagging or obtained by an ensemble selection algorithm, no matter whether each decision tree is pruned or unpruned. Hindawi 2017 2017-06-01 /pmc/articles/PMC5474283/ /pubmed/28659973 http://dx.doi.org/10.1155/2017/3162571 Text en Copyright © 2017 Xiangkui Jiang et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jiang, Xiangkui
Wu, Chang-an
Guo, Huaping
Forest Pruning Based on Branch Importance
title Forest Pruning Based on Branch Importance
title_full Forest Pruning Based on Branch Importance
title_fullStr Forest Pruning Based on Branch Importance
title_full_unstemmed Forest Pruning Based on Branch Importance
title_short Forest Pruning Based on Branch Importance
title_sort forest pruning based on branch importance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5474283/
https://www.ncbi.nlm.nih.gov/pubmed/28659973
http://dx.doi.org/10.1155/2017/3162571
work_keys_str_mv AT jiangxiangkui forestpruningbasedonbranchimportance
AT wuchangan forestpruningbasedonbranchimportance
AT guohuaping forestpruningbasedonbranchimportance