Cargando…

An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments

The effects of climate change create formidable challenges for breeders striving to produce sufficient food quantities in rapidly changing environments. It is therefore critical to investigate the ability of multi-environment genomic prediction (GP) models to predict genomic estimated breeding value...

Descripción completa

Detalles Bibliográficos
Autores principales: Widener, Sarah, Graef, George, Lipka, Alexander E., Jarquin, Diego
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8343134/
https://www.ncbi.nlm.nih.gov/pubmed/34367248
http://dx.doi.org/10.3389/fgene.2021.689319
_version_ 1783734213703368704
author Widener, Sarah
Graef, George
Lipka, Alexander E.
Jarquin, Diego
author_facet Widener, Sarah
Graef, George
Lipka, Alexander E.
Jarquin, Diego
author_sort Widener, Sarah
collection PubMed
description The effects of climate change create formidable challenges for breeders striving to produce sufficient food quantities in rapidly changing environments. It is therefore critical to investigate the ability of multi-environment genomic prediction (GP) models to predict genomic estimated breeding values (GEBVs) in extreme environments. Exploration of the impact of training set composition on the accuracy of such GEBVs is also essential. Accordingly, we examined the influence of the number of training environments and the use of environmental covariates (ECs) in GS models on four subsets of n = 500 lines of the soybean nested association mapping (SoyNAM) panel grown in nine environments in the US-North Central Region. The ensuing analyses provided insights into the influence of both of these factors for predicting grain yield in the most and the least extreme of these environments. We found that only a subset of the available environments was needed to obtain the highest observed prediction accuracies. The inclusion of ECs in the GP model did not substantially increase prediction accuracies relative to competing models, and instead more often resulted in negative prediction accuracies. Combined with the overall low prediction accuracies for grain yield in the most extreme environment, our findings highlight weaknesses in current GP approaches for prediction in extreme environments, and point to specific areas on which to focus future research efforts.
format Online
Article
Text
id pubmed-8343134
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-83431342021-08-07 An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments Widener, Sarah Graef, George Lipka, Alexander E. Jarquin, Diego Front Genet Genetics The effects of climate change create formidable challenges for breeders striving to produce sufficient food quantities in rapidly changing environments. It is therefore critical to investigate the ability of multi-environment genomic prediction (GP) models to predict genomic estimated breeding values (GEBVs) in extreme environments. Exploration of the impact of training set composition on the accuracy of such GEBVs is also essential. Accordingly, we examined the influence of the number of training environments and the use of environmental covariates (ECs) in GS models on four subsets of n = 500 lines of the soybean nested association mapping (SoyNAM) panel grown in nine environments in the US-North Central Region. The ensuing analyses provided insights into the influence of both of these factors for predicting grain yield in the most and the least extreme of these environments. We found that only a subset of the available environments was needed to obtain the highest observed prediction accuracies. The inclusion of ECs in the GP model did not substantially increase prediction accuracies relative to competing models, and instead more often resulted in negative prediction accuracies. Combined with the overall low prediction accuracies for grain yield in the most extreme environment, our findings highlight weaknesses in current GP approaches for prediction in extreme environments, and point to specific areas on which to focus future research efforts. Frontiers Media S.A. 2021-07-23 /pmc/articles/PMC8343134/ /pubmed/34367248 http://dx.doi.org/10.3389/fgene.2021.689319 Text en Copyright © 2021 Widener, Graef, Lipka and Jarquin. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Widener, Sarah
Graef, George
Lipka, Alexander E.
Jarquin, Diego
An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments
title An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments
title_full An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments
title_fullStr An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments
title_full_unstemmed An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments
title_short An Assessment of the Factors Influencing the Prediction Accuracy of Genomic Prediction Models Across Multiple Environments
title_sort assessment of the factors influencing the prediction accuracy of genomic prediction models across multiple environments
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8343134/
https://www.ncbi.nlm.nih.gov/pubmed/34367248
http://dx.doi.org/10.3389/fgene.2021.689319
work_keys_str_mv AT widenersarah anassessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments
AT graefgeorge anassessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments
AT lipkaalexandere anassessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments
AT jarquindiego anassessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments
AT widenersarah assessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments
AT graefgeorge assessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments
AT lipkaalexandere assessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments
AT jarquindiego assessmentofthefactorsinfluencingthepredictionaccuracyofgenomicpredictionmodelsacrossmultipleenvironments