Cargando…

Random forest methodology for model-based recursive partitioning: the mobForest package for R

BACKGROUND: Recursive partitioning is a non-parametric modeling technique, widely used in regression and classification problems. Model-based recursive partitioning is used to identify groups of observations with similar values of parameters of the model of interest. The mob() function in the party...

Descripción completa

Detalles Bibliográficos
Autores principales: Garge, Nikhil R, Bobashev, Georgiy, Eggleston, Barry
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3626834/
https://www.ncbi.nlm.nih.gov/pubmed/23577585
http://dx.doi.org/10.1186/1471-2105-14-125
_version_ 1782266252125798400
author Garge, Nikhil R
Bobashev, Georgiy
Eggleston, Barry
author_facet Garge, Nikhil R
Bobashev, Georgiy
Eggleston, Barry
author_sort Garge, Nikhil R
collection PubMed
description BACKGROUND: Recursive partitioning is a non-parametric modeling technique, widely used in regression and classification problems. Model-based recursive partitioning is used to identify groups of observations with similar values of parameters of the model of interest. The mob() function in the party package in R implements model-based recursive partitioning method. This method produces predictions based on single tree models. Predictions obtained through single tree models are very sensitive to small changes to the learning sample. We extend the model-based recursive partition method to produce predictions based on multiple tree models constructed on random samples achieved either through bootstrapping (random sampling with replacement) or subsampling (random sampling without replacement) on learning data. RESULTS: Here we present an R package called “mobForest” that implements bagging and random forests methodology for model-based recursive partitioning. The mobForest package constructs large number of model-based trees and the predictions are aggregated across these trees resulting in more stable predictions. The package also includes functions for computing predictive accuracy estimates and plots, residuals plot, and variable importance plot. CONCLUSION: The mobForest package implements a random forest type approach for model-based recursive partitioning. The R package along with it source code is available at http://CRAN.R-project.org/package=mobForest.
format Online
Article
Text
id pubmed-3626834
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36268342013-04-24 Random forest methodology for model-based recursive partitioning: the mobForest package for R Garge, Nikhil R Bobashev, Georgiy Eggleston, Barry BMC Bioinformatics Software BACKGROUND: Recursive partitioning is a non-parametric modeling technique, widely used in regression and classification problems. Model-based recursive partitioning is used to identify groups of observations with similar values of parameters of the model of interest. The mob() function in the party package in R implements model-based recursive partitioning method. This method produces predictions based on single tree models. Predictions obtained through single tree models are very sensitive to small changes to the learning sample. We extend the model-based recursive partition method to produce predictions based on multiple tree models constructed on random samples achieved either through bootstrapping (random sampling with replacement) or subsampling (random sampling without replacement) on learning data. RESULTS: Here we present an R package called “mobForest” that implements bagging and random forests methodology for model-based recursive partitioning. The mobForest package constructs large number of model-based trees and the predictions are aggregated across these trees resulting in more stable predictions. The package also includes functions for computing predictive accuracy estimates and plots, residuals plot, and variable importance plot. CONCLUSION: The mobForest package implements a random forest type approach for model-based recursive partitioning. The R package along with it source code is available at http://CRAN.R-project.org/package=mobForest. BioMed Central 2013-04-11 /pmc/articles/PMC3626834/ /pubmed/23577585 http://dx.doi.org/10.1186/1471-2105-14-125 Text en Copyright © 2013 Garge et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Garge, Nikhil R
Bobashev, Georgiy
Eggleston, Barry
Random forest methodology for model-based recursive partitioning: the mobForest package for R
title Random forest methodology for model-based recursive partitioning: the mobForest package for R
title_full Random forest methodology for model-based recursive partitioning: the mobForest package for R
title_fullStr Random forest methodology for model-based recursive partitioning: the mobForest package for R
title_full_unstemmed Random forest methodology for model-based recursive partitioning: the mobForest package for R
title_short Random forest methodology for model-based recursive partitioning: the mobForest package for R
title_sort random forest methodology for model-based recursive partitioning: the mobforest package for r
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3626834/
https://www.ncbi.nlm.nih.gov/pubmed/23577585
http://dx.doi.org/10.1186/1471-2105-14-125
work_keys_str_mv AT gargenikhilr randomforestmethodologyformodelbasedrecursivepartitioningthemobforestpackageforr
AT bobashevgeorgiy randomforestmethodologyformodelbasedrecursivepartitioningthemobforestpackageforr
AT egglestonbarry randomforestmethodologyformodelbasedrecursivepartitioningthemobforestpackageforr