Cargando…

Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel

KEY MESSAGE: The optimization of training populations and the use of diagnostic markers as fixed effects increase the predictive ability of genomic prediction models in a cooperative wheat breeding panel. ABSTRACT: Plant breeding programs often have access to a large amount of historical data that i...

Descripción completa

Detalles Bibliográficos
Autores principales: Sarinelli, J. Martin, Murphy, J. Paul, Tyagi, Priyanka, Holland, James B., Johnson, Jerry W., Mergoum, Mohamed, Mason, Richard E., Babar, Ali, Harrison, Stephen, Sutton, Russell, Griffey, Carl A., Brown-Guedira, Gina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6449317/
https://www.ncbi.nlm.nih.gov/pubmed/30680419
http://dx.doi.org/10.1007/s00122-019-03276-6
_version_ 1783408817809129472
author Sarinelli, J. Martin
Murphy, J. Paul
Tyagi, Priyanka
Holland, James B.
Johnson, Jerry W.
Mergoum, Mohamed
Mason, Richard E.
Babar, Ali
Harrison, Stephen
Sutton, Russell
Griffey, Carl A.
Brown-Guedira, Gina
author_facet Sarinelli, J. Martin
Murphy, J. Paul
Tyagi, Priyanka
Holland, James B.
Johnson, Jerry W.
Mergoum, Mohamed
Mason, Richard E.
Babar, Ali
Harrison, Stephen
Sutton, Russell
Griffey, Carl A.
Brown-Guedira, Gina
author_sort Sarinelli, J. Martin
collection PubMed
description KEY MESSAGE: The optimization of training populations and the use of diagnostic markers as fixed effects increase the predictive ability of genomic prediction models in a cooperative wheat breeding panel. ABSTRACT: Plant breeding programs often have access to a large amount of historical data that is highly unbalanced, particularly across years. This study examined approaches to utilize these data sets as training populations to integrate genomic selection into existing pipelines. We used cross-validation to evaluate predictive ability in an unbalanced data set of 467 winter wheat (Triticum aestivum L.) genotypes evaluated in the Gulf Atlantic Wheat Nursery from 2008 to 2016. We evaluated the impact of different training population sizes and training population selection methods (Random, Clustering, PEVmean and PEVmean1) on predictive ability. We also evaluated inclusion of markers associated with major genes as fixed effects in prediction models for heading date, plant height, and resistance to powdery mildew (caused by Blumeria graminis f. sp. tritici). Increases in predictive ability as the size of the training population increased were more evident for Random and Clustering training population selection methods than for PEVmean and PEVmean1. The selection methods based on minimization of the prediction error variance (PEV) outperformed the Random and Clustering methods across all the population sizes. Major genes added as fixed effects always improved model predictive ability, with the greatest gains coming from combinations of multiple genes. Maximum predictabilities among all prediction methods were 0.64 for grain yield, 0.56 for test weight, 0.71 for heading date, 0.73 for plant height, and 0.60 for powdery mildew resistance. Our results demonstrate the utility of combining unbalanced phenotypic records with genome-wide SNP marker data for predicting the performance of untested genotypes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00122-019-03276-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6449317
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-64493172019-04-17 Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel Sarinelli, J. Martin Murphy, J. Paul Tyagi, Priyanka Holland, James B. Johnson, Jerry W. Mergoum, Mohamed Mason, Richard E. Babar, Ali Harrison, Stephen Sutton, Russell Griffey, Carl A. Brown-Guedira, Gina Theor Appl Genet Original Article KEY MESSAGE: The optimization of training populations and the use of diagnostic markers as fixed effects increase the predictive ability of genomic prediction models in a cooperative wheat breeding panel. ABSTRACT: Plant breeding programs often have access to a large amount of historical data that is highly unbalanced, particularly across years. This study examined approaches to utilize these data sets as training populations to integrate genomic selection into existing pipelines. We used cross-validation to evaluate predictive ability in an unbalanced data set of 467 winter wheat (Triticum aestivum L.) genotypes evaluated in the Gulf Atlantic Wheat Nursery from 2008 to 2016. We evaluated the impact of different training population sizes and training population selection methods (Random, Clustering, PEVmean and PEVmean1) on predictive ability. We also evaluated inclusion of markers associated with major genes as fixed effects in prediction models for heading date, plant height, and resistance to powdery mildew (caused by Blumeria graminis f. sp. tritici). Increases in predictive ability as the size of the training population increased were more evident for Random and Clustering training population selection methods than for PEVmean and PEVmean1. The selection methods based on minimization of the prediction error variance (PEV) outperformed the Random and Clustering methods across all the population sizes. Major genes added as fixed effects always improved model predictive ability, with the greatest gains coming from combinations of multiple genes. Maximum predictabilities among all prediction methods were 0.64 for grain yield, 0.56 for test weight, 0.71 for heading date, 0.73 for plant height, and 0.60 for powdery mildew resistance. Our results demonstrate the utility of combining unbalanced phenotypic records with genome-wide SNP marker data for predicting the performance of untested genotypes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s00122-019-03276-6) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2019-01-24 2019 /pmc/articles/PMC6449317/ /pubmed/30680419 http://dx.doi.org/10.1007/s00122-019-03276-6 Text en © The Author(s) 2019 OpenAccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Article
Sarinelli, J. Martin
Murphy, J. Paul
Tyagi, Priyanka
Holland, James B.
Johnson, Jerry W.
Mergoum, Mohamed
Mason, Richard E.
Babar, Ali
Harrison, Stephen
Sutton, Russell
Griffey, Carl A.
Brown-Guedira, Gina
Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel
title Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel
title_full Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel
title_fullStr Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel
title_full_unstemmed Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel
title_short Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel
title_sort training population selection and use of fixed effects to optimize genomic predictions in a historical usa winter wheat panel
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6449317/
https://www.ncbi.nlm.nih.gov/pubmed/30680419
http://dx.doi.org/10.1007/s00122-019-03276-6
work_keys_str_mv AT sarinellijmartin trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT murphyjpaul trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT tyagipriyanka trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT hollandjamesb trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT johnsonjerryw trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT mergoummohamed trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT masonricharde trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT babarali trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT harrisonstephen trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT suttonrussell trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT griffeycarla trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel
AT brownguediragina trainingpopulationselectionanduseoffixedeffectstooptimizegenomicpredictionsinahistoricalusawinterwheatpanel