Cargando…
An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments
The effects of climate change create formidable challenges for breeders striving to produce sufficient food quantities in rapidly changing environments. It is therefore critical to investigate the ability of multi-environment genomic prediction (GP) models to predict genomic estimated breeding value...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8343134/ https://www.ncbi.nlm.nih.gov/pubmed/34367248 http://dx.doi.org/10.3389/fgene.2021.689319 |
_version_ | 1783734213703368704 |
---|---|
author | Widener, Sarah Graef, George Lipka, Alexander E. Jarquin, Diego |
author_facet | Widener, Sarah Graef, George Lipka, Alexander E. Jarquin, Diego |
author_sort | Widener, Sarah |
collection | PubMed |
description | The effects of climate change create formidable challenges for breeders striving to produce sufficient food quantities in rapidly changing environments. It is therefore critical to investigate the ability of multi-environment genomic prediction (GP) models to predict genomic estimated breeding values (GEBVs) in extreme environments. Exploration of the impact of training set composition on the accuracy of such GEBVs is also essential. Accordingly, we examined the influence of the number of training environments and the use of environmental covariates (ECs) in GS models on four subsets of n = 500 lines of the soybean nested association mapping (SoyNAM) panel grown in nine environments in the US-North Central Region. The ensuing analyses provided insights into the influence of both of these factors for predicting grain yield in the most and the least extreme of these environments. We found that only a subset of the available environments was needed to obtain the highest observed prediction accuracies. The inclusion of ECs in the GP model did not substantially increase prediction accuracies relative to competing models, and instead more often resulted in negative prediction accuracies. Combined with the overall low prediction accuracies for grain yield in the most extreme environment, our findings highlight weaknesses in current GP approaches for prediction in extreme environments, and point to specific areas on which to focus future research efforts. |
format | Online Article Text |
id | pubmed-8343134 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-83431342021-08-07 An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments Widener, Sarah Graef, George Lipka, Alexander E. Jarquin, Diego Front Genet Genetics The effects of climate change create formidable challenges for breeders striving to produce sufficient food quantities in rapidly changing environments. It is therefore critical to investigate the ability of multi-environment genomic prediction (GP) models to predict genomic estimated breeding values (GEBVs) in extreme environments. Exploration of the impact of training set composition on the accuracy of such GEBVs is also essential. Accordingly, we examined the influence of the number of training environments and the use of environmental covariates (ECs) in GS models on four subsets of n = 500 lines of the soybean nested association mapping (SoyNAM) panel grown in nine environments in the US-North Central Region. The ensuing analyses provided insights into the influence of both of these factors for predicting grain yield in the most and the least extreme of these environments. We found that only a subset of the available environments was needed to obtain the highest observed prediction accuracies. The inclusion of ECs in the GP model did not substantially increase prediction accuracies relative to competing models, and instead more often resulted in negative prediction accuracies. Combined with the overall low prediction accuracies for grain yield in the most extreme environment, our findings highlight weaknesses in current GP approaches for prediction in extreme environments, and point to specific areas on which to focus future research efforts. Frontiers Media S.A. 2021-07-23 /pmc/articles/PMC8343134/ /pubmed/34367248 http://dx.doi.org/10.3389/fgene.2021.689319 Text en Copyright © 2021 Widener, Graef, Lipka and Jarquin. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Widener, Sarah Graef, George Lipka, Alexander E. Jarquin, Diego An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments |
title | An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments |
title_full | An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments |
title_fullStr | An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments |
title_full_unstemmed | An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments |
title_short | An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments |
title_sort | assessment of the factors influencing the prediction accuracy of genomic prediction models across multiple environments |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8343134/ https://www.ncbi.nlm.nih.gov/pubmed/34367248 http://dx.doi.org/10.3389/fgene.2021.689319 |
work_keys_str_mv | AT widenersarah anassessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments AT graefgeorge anassessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments AT lipkaalexandere anassessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments AT jarquindiego anassessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments AT widenersarah assessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments AT graefgeorge assessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments AT lipkaalexandere assessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments AT jarquindiego assessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments |