Cargando…
Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials
“Sparse testing” refers to reduced multi-environment breeding trials in which not all genotypes of interest are grown in each environment. Using genomic-enabled prediction and a model embracing genotype × environment interaction (GE), the non-observed genotype-in-environment combinations can be pred...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7407457/ https://www.ncbi.nlm.nih.gov/pubmed/32527748 http://dx.doi.org/10.1534/g3.120.401349 |
_version_ | 1783567625230483456 |
---|---|
author | Jarquin, Diego Howard, Reka Crossa, Jose Beyene, Yoseph Gowda, Manje Martini, Johannes W. R. Covarrubias Pazaran, Giovanny Burgueño, Juan Pacheco, Angela Grondona, Martin Wimmer, Valentin Prasanna, Boddupalli M. |
author_facet | Jarquin, Diego Howard, Reka Crossa, Jose Beyene, Yoseph Gowda, Manje Martini, Johannes W. R. Covarrubias Pazaran, Giovanny Burgueño, Juan Pacheco, Angela Grondona, Martin Wimmer, Valentin Prasanna, Boddupalli M. |
author_sort | Jarquin, Diego |
collection | PubMed |
description | “Sparse testing” refers to reduced multi-environment breeding trials in which not all genotypes of interest are grown in each environment. Using genomic-enabled prediction and a model embracing genotype × environment interaction (GE), the non-observed genotype-in-environment combinations can be predicted. Consequently, the overall costs can be reduced and the testing capacities can be increased. The accuracy of predicting the unobserved data depends on different factors including (1) how many genotypes overlap between environments, (2) in how many environments each genotype is grown, and (3) which prediction method is used. In this research, we studied the predictive ability obtained when using a fixed number of plots and different sparse testing designs. The considered designs included the extreme cases of (1) no overlap of genotypes between environments, and (2) complete overlap of the genotypes between environments. In the latter case, the prediction set fully consists of genotypes that have not been tested at all. Moreover, we gradually go from one extreme to the other considering (3) intermediates between the two previous cases with varying numbers of different or non-overlapping (NO)/overlapping (O) genotypes. The empirical study is built upon two different maize hybrid data sets consisting of different genotypes crossed to two different testers (T1 and T2) and each data set was analyzed separately. For each set, phenotypic records on yield from three different environments are available. Three different prediction models were implemented, two main effects models (M1 and M2), and a model (M3) including GE. The results showed that the genome-based model including GE (M3) captured more phenotypic variation than the models that did not include this component. Also, M3 provided higher prediction accuracy than models M1 and M2 for the different allocation scenarios. Reducing the size of the calibration sets decreased the prediction accuracy under all allocation designs with M3 being the less affected model; however, using the genome-enabled models (i.e., M2 and M3) the predictive ability is recovered when more genotypes are tested across environments. Our results indicate that a substantial part of the testing resources can be saved when using genome-based models including GE for optimizing sparse testing designs. |
format | Online Article Text |
id | pubmed-7407457 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-74074572020-08-19 Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials Jarquin, Diego Howard, Reka Crossa, Jose Beyene, Yoseph Gowda, Manje Martini, Johannes W. R. Covarrubias Pazaran, Giovanny Burgueño, Juan Pacheco, Angela Grondona, Martin Wimmer, Valentin Prasanna, Boddupalli M. G3 (Bethesda) Genomic Prediction “Sparse testing” refers to reduced multi-environment breeding trials in which not all genotypes of interest are grown in each environment. Using genomic-enabled prediction and a model embracing genotype × environment interaction (GE), the non-observed genotype-in-environment combinations can be predicted. Consequently, the overall costs can be reduced and the testing capacities can be increased. The accuracy of predicting the unobserved data depends on different factors including (1) how many genotypes overlap between environments, (2) in how many environments each genotype is grown, and (3) which prediction method is used. In this research, we studied the predictive ability obtained when using a fixed number of plots and different sparse testing designs. The considered designs included the extreme cases of (1) no overlap of genotypes between environments, and (2) complete overlap of the genotypes between environments. In the latter case, the prediction set fully consists of genotypes that have not been tested at all. Moreover, we gradually go from one extreme to the other considering (3) intermediates between the two previous cases with varying numbers of different or non-overlapping (NO)/overlapping (O) genotypes. The empirical study is built upon two different maize hybrid data sets consisting of different genotypes crossed to two different testers (T1 and T2) and each data set was analyzed separately. For each set, phenotypic records on yield from three different environments are available. Three different prediction models were implemented, two main effects models (M1 and M2), and a model (M3) including GE. The results showed that the genome-based model including GE (M3) captured more phenotypic variation than the models that did not include this component. Also, M3 provided higher prediction accuracy than models M1 and M2 for the different allocation scenarios. Reducing the size of the calibration sets decreased the prediction accuracy under all allocation designs with M3 being the less affected model; however, using the genome-enabled models (i.e., M2 and M3) the predictive ability is recovered when more genotypes are tested across environments. Our results indicate that a substantial part of the testing resources can be saved when using genome-based models including GE for optimizing sparse testing designs. Genetics Society of America 2020-06-11 /pmc/articles/PMC7407457/ /pubmed/32527748 http://dx.doi.org/10.1534/g3.120.401349 Text en Copyright © 2020 Jarquin et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Genomic Prediction Jarquin, Diego Howard, Reka Crossa, Jose Beyene, Yoseph Gowda, Manje Martini, Johannes W. R. Covarrubias Pazaran, Giovanny Burgueño, Juan Pacheco, Angela Grondona, Martin Wimmer, Valentin Prasanna, Boddupalli M. Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials |
title | Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials |
title_full | Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials |
title_fullStr | Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials |
title_full_unstemmed | Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials |
title_short | Genomic Prediction Enhanced Sparse Testing for Multi-environment Trials |
title_sort | genomic prediction enhanced sparse testing for multi-environment trials |
topic | Genomic Prediction |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7407457/ https://www.ncbi.nlm.nih.gov/pubmed/32527748 http://dx.doi.org/10.1534/g3.120.401349 |
work_keys_str_mv | AT jarquindiego genomicpredictionenhancedsparsetestingformultienvironmenttrials AT howardreka genomicpredictionenhancedsparsetestingformultienvironmenttrials AT crossajose genomicpredictionenhancedsparsetestingformultienvironmenttrials AT beyeneyoseph genomicpredictionenhancedsparsetestingformultienvironmenttrials AT gowdamanje genomicpredictionenhancedsparsetestingformultienvironmenttrials AT martinijohanneswr genomicpredictionenhancedsparsetestingformultienvironmenttrials AT covarrubiaspazarangiovanny genomicpredictionenhancedsparsetestingformultienvironmenttrials AT burguenojuan genomicpredictionenhancedsparsetestingformultienvironmenttrials AT pachecoangela genomicpredictionenhancedsparsetestingformultienvironmenttrials AT grondonamartin genomicpredictionenhancedsparsetestingformultienvironmenttrials AT wimmervalentin genomicpredictionenhancedsparsetestingformultienvironmenttrials AT prasannaboddupallim genomicpredictionenhancedsparsetestingformultienvironmenttrials |