Cargando…
learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data
We introduce the R-package learnMET, developed as a flexible framework to enable a collection of analyses on multi-environment trial breeding data with machine learning-based models. learnMET allows the combination of genomic information with environmental data such as climate and/or soil characteri...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9635651/ https://www.ncbi.nlm.nih.gov/pubmed/36124944 http://dx.doi.org/10.1093/g3journal/jkac226 |
_version_ | 1784824755056017408 |
---|---|
author | Westhues, Cathy C Simianer, Henner Beissinger, Timothy M |
author_facet | Westhues, Cathy C Simianer, Henner Beissinger, Timothy M |
author_sort | Westhues, Cathy C |
collection | PubMed |
description | We introduce the R-package learnMET, developed as a flexible framework to enable a collection of analyses on multi-environment trial breeding data with machine learning-based models. learnMET allows the combination of genomic information with environmental data such as climate and/or soil characteristics. Notably, the package offers the possibility of incorporating weather data from field weather stations, or to retrieve global meteorological datasets from a NASA database. Daily weather data can be aggregated over specific periods of time based on naive (for instance, nonoverlapping 10-day windows) or phenological approaches. Different machine learning methods for genomic prediction are implemented, including gradient-boosted decision trees, random forests, stacked ensemble models, and multilayer perceptrons. These prediction models can be evaluated via a collection of cross-validation schemes that mimic typical scenarios encountered by plant breeders working with multi-environment trial experimental data in a user-friendly way. The package is published under an MIT license and accessible on GitHub. |
format | Online Article Text |
id | pubmed-9635651 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-96356512022-11-07 learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data Westhues, Cathy C Simianer, Henner Beissinger, Timothy M G3 (Bethesda) Software and Data Resources We introduce the R-package learnMET, developed as a flexible framework to enable a collection of analyses on multi-environment trial breeding data with machine learning-based models. learnMET allows the combination of genomic information with environmental data such as climate and/or soil characteristics. Notably, the package offers the possibility of incorporating weather data from field weather stations, or to retrieve global meteorological datasets from a NASA database. Daily weather data can be aggregated over specific periods of time based on naive (for instance, nonoverlapping 10-day windows) or phenological approaches. Different machine learning methods for genomic prediction are implemented, including gradient-boosted decision trees, random forests, stacked ensemble models, and multilayer perceptrons. These prediction models can be evaluated via a collection of cross-validation schemes that mimic typical scenarios encountered by plant breeders working with multi-environment trial experimental data in a user-friendly way. The package is published under an MIT license and accessible on GitHub. Oxford University Press 2022-09-19 /pmc/articles/PMC9635651/ /pubmed/36124944 http://dx.doi.org/10.1093/g3journal/jkac226 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software and Data Resources Westhues, Cathy C Simianer, Henner Beissinger, Timothy M learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data |
title | learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data |
title_full | learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data |
title_fullStr | learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data |
title_full_unstemmed | learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data |
title_short | learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data |
title_sort | learnmet: an r package to apply machine learning methods for genomic prediction using multi-environment trial data |
topic | Software and Data Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9635651/ https://www.ncbi.nlm.nih.gov/pubmed/36124944 http://dx.doi.org/10.1093/g3journal/jkac226 |
work_keys_str_mv | AT westhuescathyc learnmetanrpackagetoapplymachinelearningmethodsforgenomicpredictionusingmultienvironmenttrialdata AT simianerhenner learnmetanrpackagetoapplymachinelearningmethodsforgenomicpredictionusingmultienvironmenttrialdata AT beissingertimothym learnmetanrpackagetoapplymachinelearningmethodsforgenomicpredictionusingmultienvironmenttrialdata |