Cargando…
Synthetic learning machines
BACKGROUND: Using a collection of different terminal nodesize constructed random forests, each generating a synthetic feature, a synthetic random forest is defined as a kind of hyperforest, calculated using the new input synthetic features, along with the original features. RESULTS: Using a large co...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4279689/ https://www.ncbi.nlm.nih.gov/pubmed/25614764 http://dx.doi.org/10.1186/s13040-014-0028-y |
_version_ | 1782350746089422848 |
---|---|
author | Ishwaran, Hemant Malley, James D |
author_facet | Ishwaran, Hemant Malley, James D |
author_sort | Ishwaran, Hemant |
collection | PubMed |
description | BACKGROUND: Using a collection of different terminal nodesize constructed random forests, each generating a synthetic feature, a synthetic random forest is defined as a kind of hyperforest, calculated using the new input synthetic features, along with the original features. RESULTS: Using a large collection of regression and multiclass datasets we show that synthetic random forests outperforms both conventional random forests and the optimized forest from the regresssion portfolio. CONCLUSIONS: Synthetic forests removes the need for tuning random forests with no additional effort on the part of the researcher. Importantly, the synthetic forest does this with evidently no loss in prediction compared to a well-optimized single random forest. |
format | Online Article Text |
id | pubmed-4279689 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42796892015-01-22 Synthetic learning machines Ishwaran, Hemant Malley, James D BioData Min Methodology BACKGROUND: Using a collection of different terminal nodesize constructed random forests, each generating a synthetic feature, a synthetic random forest is defined as a kind of hyperforest, calculated using the new input synthetic features, along with the original features. RESULTS: Using a large collection of regression and multiclass datasets we show that synthetic random forests outperforms both conventional random forests and the optimized forest from the regresssion portfolio. CONCLUSIONS: Synthetic forests removes the need for tuning random forests with no additional effort on the part of the researcher. Importantly, the synthetic forest does this with evidently no loss in prediction compared to a well-optimized single random forest. BioMed Central 2014-12-18 /pmc/articles/PMC4279689/ /pubmed/25614764 http://dx.doi.org/10.1186/s13040-014-0028-y Text en © Ishwaran and Malley; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Ishwaran, Hemant Malley, James D Synthetic learning machines |
title | Synthetic learning machines |
title_full | Synthetic learning machines |
title_fullStr | Synthetic learning machines |
title_full_unstemmed | Synthetic learning machines |
title_short | Synthetic learning machines |
title_sort | synthetic learning machines |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4279689/ https://www.ncbi.nlm.nih.gov/pubmed/25614764 http://dx.doi.org/10.1186/s13040-014-0028-y |
work_keys_str_mv | AT ishwaranhemant syntheticlearningmachines AT malleyjamesd syntheticlearningmachines |