Cargando…

Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials

“Sparse testing” refers to reduced multi-environment breeding trials in which not all genotypes of interest are grown in each environment. Using genomic-enabled prediction and a model embracing genotype × environment interaction (GE), the non-observed genotype-in-environment combinations can be pred...

Descripción completa

Detalles Bibliográficos
Autores principales: Jarquin, Diego, Howard, Reka, Crossa, Jose, Beyene, Yoseph, Gowda, Manje, Martini, Johannes W. R., Covarrubias Pazaran, Giovanny, Burgueño, Juan, Pacheco, Angela, Grondona, Martin, Wimmer, Valentin, Prasanna, Boddupalli M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7407457/
https://www.ncbi.nlm.nih.gov/pubmed/32527748
http://dx.doi.org/10.1534/g3.120.401349
_version_ 1783567625230483456
author Jarquin, Diego
Howard, Reka
Crossa, Jose
Beyene, Yoseph
Gowda, Manje
Martini, Johannes W. R.
Covarrubias Pazaran, Giovanny
Burgueño, Juan
Pacheco, Angela
Grondona, Martin
Wimmer, Valentin
Prasanna, Boddupalli M.
author_facet Jarquin, Diego
Howard, Reka
Crossa, Jose
Beyene, Yoseph
Gowda, Manje
Martini, Johannes W. R.
Covarrubias Pazaran, Giovanny
Burgueño, Juan
Pacheco, Angela
Grondona, Martin
Wimmer, Valentin
Prasanna, Boddupalli M.
author_sort Jarquin, Diego
collection PubMed
description “Sparse testing” refers to reduced multi-environment breeding trials in which not all genotypes of interest are grown in each environment. Using genomic-enabled prediction and a model embracing genotype × environment interaction (GE), the non-observed genotype-in-environment combinations can be predicted. Consequently, the overall costs can be reduced and the testing capacities can be increased. The accuracy of predicting the unobserved data depends on different factors including (1) how many genotypes overlap between environments, (2) in how many environments each genotype is grown, and (3) which prediction method is used. In this research, we studied the predictive ability obtained when using a fixed number of plots and different sparse testing designs. The considered designs included the extreme cases of (1) no overlap of genotypes between environments, and (2) complete overlap of the genotypes between environments. In the latter case, the prediction set fully consists of genotypes that have not been tested at all. Moreover, we gradually go from one extreme to the other considering (3) intermediates between the two previous cases with varying numbers of different or non-overlapping (NO)/overlapping (O) genotypes. The empirical study is built upon two different maize hybrid data sets consisting of different genotypes crossed to two different testers (T1 and T2) and each data set was analyzed separately. For each set, phenotypic records on yield from three different environments are available. Three different prediction models were implemented, two main effects models (M1 and M2), and a model (M3) including GE. The results showed that the genome-based model including GE (M3) captured more phenotypic variation than the models that did not include this component. Also, M3 provided higher prediction accuracy than models M1 and M2 for the different allocation scenarios. Reducing the size of the calibration sets decreased the prediction accuracy under all allocation designs with M3 being the less affected model; however, using the genome-enabled models (i.e., M2 and M3) the predictive ability is recovered when more genotypes are tested across environments. Our results indicate that a substantial part of the testing resources can be saved when using genome-based models including GE for optimizing sparse testing designs.
format Online
Article
Text
id pubmed-7407457
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-74074572020-08-19 Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials Jarquin, Diego Howard, Reka Crossa, Jose Beyene, Yoseph Gowda, Manje Martini, Johannes W. R. Covarrubias Pazaran, Giovanny Burgueño, Juan Pacheco, Angela Grondona, Martin Wimmer, Valentin Prasanna, Boddupalli M. G3 (Bethesda) Genomic Prediction “Sparse testing” refers to reduced multi-environment breeding trials in which not all genotypes of interest are grown in each environment. Using genomic-enabled prediction and a model embracing genotype × environment interaction (GE), the non-observed genotype-in-environment combinations can be predicted. Consequently, the overall costs can be reduced and the testing capacities can be increased. The accuracy of predicting the unobserved data depends on different factors including (1) how many genotypes overlap between environments, (2) in how many environments each genotype is grown, and (3) which prediction method is used. In this research, we studied the predictive ability obtained when using a fixed number of plots and different sparse testing designs. The considered designs included the extreme cases of (1) no overlap of genotypes between environments, and (2) complete overlap of the genotypes between environments. In the latter case, the prediction set fully consists of genotypes that have not been tested at all. Moreover, we gradually go from one extreme to the other considering (3) intermediates between the two previous cases with varying numbers of different or non-overlapping (NO)/overlapping (O) genotypes. The empirical study is built upon two different maize hybrid data sets consisting of different genotypes crossed to two different testers (T1 and T2) and each data set was analyzed separately. For each set, phenotypic records on yield from three different environments are available. Three different prediction models were implemented, two main effects models (M1 and M2), and a model (M3) including GE. The results showed that the genome-based model including GE (M3) captured more phenotypic variation than the models that did not include this component. Also, M3 provided higher prediction accuracy than models M1 and M2 for the different allocation scenarios. Reducing the size of the calibration sets decreased the prediction accuracy under all allocation designs with M3 being the less affected model; however, using the genome-enabled models (i.e., M2 and M3) the predictive ability is recovered when more genotypes are tested across environments. Our results indicate that a substantial part of the testing resources can be saved when using genome-based models including GE for optimizing sparse testing designs. Genetics Society of America 2020-06-11 /pmc/articles/PMC7407457/ /pubmed/32527748 http://dx.doi.org/10.1534/g3.120.401349 Text en Copyright © 2020 Jarquin et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomic Prediction
Jarquin, Diego
Howard, Reka
Crossa, Jose
Beyene, Yoseph
Gowda, Manje
Martini, Johannes W. R.
Covarrubias Pazaran, Giovanny
Burgueño, Juan
Pacheco, Angela
Grondona, Martin
Wimmer, Valentin
Prasanna, Boddupalli M.
Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials
title Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials
title_full Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials
title_fullStr Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials
title_full_unstemmed Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials
title_short Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials
title_sort genomic prediction enhanced sparse testing for multi-environment trials
topic Genomic Prediction
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7407457/
https://www.ncbi.nlm.nih.gov/pubmed/32527748
http://dx.doi.org/10.1534/g3.120.401349
work_keys_str_mv AT jarquindiego genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT howardreka genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT crossajose genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT beyeneyoseph genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT gowdamanje genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT martinijohanneswr genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT covarrubiaspazarangiovanny genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT burguenojuan genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT pachecoangela genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT grondonamartin genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT wimmervalentin genomicpredictionenhancedsparsetestingformultienvironmenttrials
AT prasannaboddupallim genomicpredictionenhancedsparsetestingformultienvironmenttrials