Cargando…

TrainSel: An R Package for Selection of Training Populations

A major barrier to the wider use of supervised learning in emerging applications, such as genomic selection, is the lack of sufficient and representative labeled data to train prediction models. The amount and quality of labeled training data in many applications is usually limited and therefore car...

Descripción completa

Detalles Bibliográficos
Autores principales: Akdemir, Deniz, Rio, Simon, Isidro y Sánchez, Julio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138169/
https://www.ncbi.nlm.nih.gov/pubmed/34025720
http://dx.doi.org/10.3389/fgene.2021.655287
_version_ 1783695756097486848
author Akdemir, Deniz
Rio, Simon
Isidro y Sánchez, Julio
author_facet Akdemir, Deniz
Rio, Simon
Isidro y Sánchez, Julio
author_sort Akdemir, Deniz
collection PubMed
description A major barrier to the wider use of supervised learning in emerging applications, such as genomic selection, is the lack of sufficient and representative labeled data to train prediction models. The amount and quality of labeled training data in many applications is usually limited and therefore careful selection of the training examples to be labeled can be useful for improving the accuracies in predictive learning tasks. In this paper, we present an R package, TrainSel, which provides flexible, efficient, and easy-to-use tools that can be used for the selection of training populations (STP). We illustrate its use, performance, and potentials in four different supervised learning applications within and outside of the plant breeding area.
format Online
Article
Text
id pubmed-8138169
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81381692021-05-22 TrainSel: An R Package for Selection of Training Populations Akdemir, Deniz Rio, Simon Isidro y Sánchez, Julio Front Genet Genetics A major barrier to the wider use of supervised learning in emerging applications, such as genomic selection, is the lack of sufficient and representative labeled data to train prediction models. The amount and quality of labeled training data in many applications is usually limited and therefore careful selection of the training examples to be labeled can be useful for improving the accuracies in predictive learning tasks. In this paper, we present an R package, TrainSel, which provides flexible, efficient, and easy-to-use tools that can be used for the selection of training populations (STP). We illustrate its use, performance, and potentials in four different supervised learning applications within and outside of the plant breeding area. Frontiers Media S.A. 2021-05-07 /pmc/articles/PMC8138169/ /pubmed/34025720 http://dx.doi.org/10.3389/fgene.2021.655287 Text en Copyright © 2021 Akdemir, Rio and Isidro y Sánchez. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Akdemir, Deniz
Rio, Simon
Isidro y Sánchez, Julio
TrainSel: An R Package for Selection of Training Populations
title TrainSel: An R Package for Selection of Training Populations
title_full TrainSel: An R Package for Selection of Training Populations
title_fullStr TrainSel: An R Package for Selection of Training Populations
title_full_unstemmed TrainSel: An R Package for Selection of Training Populations
title_short TrainSel: An R Package for Selection of Training Populations
title_sort trainsel: an r package for selection of training populations
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138169/
https://www.ncbi.nlm.nih.gov/pubmed/34025720
http://dx.doi.org/10.3389/fgene.2021.655287
work_keys_str_mv AT akdemirdeniz trainselanrpackageforselectionoftrainingpopulations
AT riosimon trainselanrpackageforselectionoftrainingpopulations
AT isidroysanchezjulio trainselanrpackageforselectionoftrainingpopulations