Cargando…
TrainSel: An R Package for Selection of Training Populations
A major barrier to the wider use of supervised learning in emerging applications, such as genomic selection, is the lack of sufficient and representative labeled data to train prediction models. The amount and quality of labeled training data in many applications is usually limited and therefore car...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138169/ https://www.ncbi.nlm.nih.gov/pubmed/34025720 http://dx.doi.org/10.3389/fgene.2021.655287 |
_version_ | 1783695756097486848 |
---|---|
author | Akdemir, Deniz Rio, Simon Isidro y Sánchez, Julio |
author_facet | Akdemir, Deniz Rio, Simon Isidro y Sánchez, Julio |
author_sort | Akdemir, Deniz |
collection | PubMed |
description | A major barrier to the wider use of supervised learning in emerging applications, such as genomic selection, is the lack of sufficient and representative labeled data to train prediction models. The amount and quality of labeled training data in many applications is usually limited and therefore careful selection of the training examples to be labeled can be useful for improving the accuracies in predictive learning tasks. In this paper, we present an R package, TrainSel, which provides flexible, efficient, and easy-to-use tools that can be used for the selection of training populations (STP). We illustrate its use, performance, and potentials in four different supervised learning applications within and outside of the plant breeding area. |
format | Online Article Text |
id | pubmed-8138169 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81381692021-05-22 TrainSel: An R Package for Selection of Training Populations Akdemir, Deniz Rio, Simon Isidro y Sánchez, Julio Front Genet Genetics A major barrier to the wider use of supervised learning in emerging applications, such as genomic selection, is the lack of sufficient and representative labeled data to train prediction models. The amount and quality of labeled training data in many applications is usually limited and therefore careful selection of the training examples to be labeled can be useful for improving the accuracies in predictive learning tasks. In this paper, we present an R package, TrainSel, which provides flexible, efficient, and easy-to-use tools that can be used for the selection of training populations (STP). We illustrate its use, performance, and potentials in four different supervised learning applications within and outside of the plant breeding area. Frontiers Media S.A. 2021-05-07 /pmc/articles/PMC8138169/ /pubmed/34025720 http://dx.doi.org/10.3389/fgene.2021.655287 Text en Copyright © 2021 Akdemir, Rio and Isidro y Sánchez. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Akdemir, Deniz Rio, Simon Isidro y Sánchez, Julio TrainSel: An R Package for Selection of Training Populations |
title | TrainSel: An R Package for Selection of Training Populations |
title_full | TrainSel: An R Package for Selection of Training Populations |
title_fullStr | TrainSel: An R Package for Selection of Training Populations |
title_full_unstemmed | TrainSel: An R Package for Selection of Training Populations |
title_short | TrainSel: An R Package for Selection of Training Populations |
title_sort | trainsel: an r package for selection of training populations |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138169/ https://www.ncbi.nlm.nih.gov/pubmed/34025720 http://dx.doi.org/10.3389/fgene.2021.655287 |
work_keys_str_mv | AT akdemirdeniz trainselanrpackageforselectionoftrainingpopulations AT riosimon trainselanrpackageforselectionoftrainingpopulations AT isidroysanchezjulio trainselanrpackageforselectionoftrainingpopulations |