Cargando…

An R package for ensemble learning stacking

SUMMARY: Supervised learning is widely used in biology for prediction, and ensemble learning, including stacking, is a promising technique for increasing and stabilizing the prediction accuracy. In this study, we developed an R package for stacking. This package depends on the R package caret and ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Nukui, Taichi, Onogi, Akio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10561990/
https://www.ncbi.nlm.nih.gov/pubmed/37818175
http://dx.doi.org/10.1093/bioadv/vbad139
_version_ 1785118031676964864
author Nukui, Taichi
Onogi, Akio
author_facet Nukui, Taichi
Onogi, Akio
author_sort Nukui, Taichi
collection PubMed
description SUMMARY: Supervised learning is widely used in biology for prediction, and ensemble learning, including stacking, is a promising technique for increasing and stabilizing the prediction accuracy. In this study, we developed an R package for stacking. This package depends on the R package caret and can handle models supported by caret. Stacking involves cross-validation of training data with multiple base learners, and the predicted values are used as explanatory variables for the meta-learner. In the prediction, the testing data were fed into the base models, and the returned values were averaged for each base learner. The averaged values were then fed into the meta-model, and the final predictions were returned. Using this package, the training and prediction procedures for stacking can be conducted using one-row scripts. AVAILABILITY AND IMPLEMENTATION: The R package stacking is available at the Comprehensive R Archive Network (CRAN) (https://cran.r-project.org/) and GitHub (https://github.com/Onogi/stacking). R scripts to reproduce the presented results are also reposited at GitHub.
format Online
Article
Text
id pubmed-10561990
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105619902023-10-10 An R package for ensemble learning stacking Nukui, Taichi Onogi, Akio Bioinform Adv Application Note SUMMARY: Supervised learning is widely used in biology for prediction, and ensemble learning, including stacking, is a promising technique for increasing and stabilizing the prediction accuracy. In this study, we developed an R package for stacking. This package depends on the R package caret and can handle models supported by caret. Stacking involves cross-validation of training data with multiple base learners, and the predicted values are used as explanatory variables for the meta-learner. In the prediction, the testing data were fed into the base models, and the returned values were averaged for each base learner. The averaged values were then fed into the meta-model, and the final predictions were returned. Using this package, the training and prediction procedures for stacking can be conducted using one-row scripts. AVAILABILITY AND IMPLEMENTATION: The R package stacking is available at the Comprehensive R Archive Network (CRAN) (https://cran.r-project.org/) and GitHub (https://github.com/Onogi/stacking). R scripts to reproduce the presented results are also reposited at GitHub. Oxford University Press 2023-09-29 /pmc/articles/PMC10561990/ /pubmed/37818175 http://dx.doi.org/10.1093/bioadv/vbad139 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Application Note
Nukui, Taichi
Onogi, Akio
An R package for ensemble learning stacking
title An R package for ensemble learning stacking
title_full An R package for ensemble learning stacking
title_fullStr An R package for ensemble learning stacking
title_full_unstemmed An R package for ensemble learning stacking
title_short An R package for ensemble learning stacking
title_sort r package for ensemble learning stacking
topic Application Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10561990/
https://www.ncbi.nlm.nih.gov/pubmed/37818175
http://dx.doi.org/10.1093/bioadv/vbad139
work_keys_str_mv AT nukuitaichi anrpackageforensemblelearningstacking
AT onogiakio anrpackageforensemblelearningstacking
AT nukuitaichi rpackageforensemblelearningstacking
AT onogiakio rpackageforensemblelearningstacking