Cargando…

Synthetic learning machines

BACKGROUND: Using a collection of different terminal nodesize constructed random forests, each generating a synthetic feature, a synthetic random forest is defined as a kind of hyperforest, calculated using the new input synthetic features, along with the original features. RESULTS: Using a large co...

Descripción completa

Detalles Bibliográficos
Autores principales: Ishwaran, Hemant, Malley, James D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4279689/
https://www.ncbi.nlm.nih.gov/pubmed/25614764
http://dx.doi.org/10.1186/s13040-014-0028-y
_version_ 1782350746089422848
author Ishwaran, Hemant
Malley, James D
author_facet Ishwaran, Hemant
Malley, James D
author_sort Ishwaran, Hemant
collection PubMed
description BACKGROUND: Using a collection of different terminal nodesize constructed random forests, each generating a synthetic feature, a synthetic random forest is defined as a kind of hyperforest, calculated using the new input synthetic features, along with the original features. RESULTS: Using a large collection of regression and multiclass datasets we show that synthetic random forests outperforms both conventional random forests and the optimized forest from the regresssion portfolio. CONCLUSIONS: Synthetic forests removes the need for tuning random forests with no additional effort on the part of the researcher. Importantly, the synthetic forest does this with evidently no loss in prediction compared to a well-optimized single random forest.
format Online
Article
Text
id pubmed-4279689
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42796892015-01-22 Synthetic learning machines Ishwaran, Hemant Malley, James D BioData Min Methodology BACKGROUND: Using a collection of different terminal nodesize constructed random forests, each generating a synthetic feature, a synthetic random forest is defined as a kind of hyperforest, calculated using the new input synthetic features, along with the original features. RESULTS: Using a large collection of regression and multiclass datasets we show that synthetic random forests outperforms both conventional random forests and the optimized forest from the regresssion portfolio. CONCLUSIONS: Synthetic forests removes the need for tuning random forests with no additional effort on the part of the researcher. Importantly, the synthetic forest does this with evidently no loss in prediction compared to a well-optimized single random forest. BioMed Central 2014-12-18 /pmc/articles/PMC4279689/ /pubmed/25614764 http://dx.doi.org/10.1186/s13040-014-0028-y Text en © Ishwaran and Malley; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Ishwaran, Hemant
Malley, James D
Synthetic learning machines
title Synthetic learning machines
title_full Synthetic learning machines
title_fullStr Synthetic learning machines
title_full_unstemmed Synthetic learning machines
title_short Synthetic learning machines
title_sort synthetic learning machines
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4279689/
https://www.ncbi.nlm.nih.gov/pubmed/25614764
http://dx.doi.org/10.1186/s13040-014-0028-y
work_keys_str_mv AT ishwaranhemant syntheticlearningmachines
AT malleyjamesd syntheticlearningmachines