Cargando…
Computing strategies for multi-population genomic evaluation
BACKGROUND: Multiple breed evaluation using genomic prediction includes the use of data from multiple populations, or from parental breeds and crosses, and is expected to lead to better genomic predictions. Increased complexity comes from the need to fit non-additive effects such as dominance and/or...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8815165/ https://www.ncbi.nlm.nih.gov/pubmed/35114933 http://dx.doi.org/10.1186/s12711-022-00705-x |
_version_ | 1784645226750541824 |
---|---|
author | Legarra, Andrés González-Diéguez, David Vitezica, Zulma G. |
author_facet | Legarra, Andrés González-Diéguez, David Vitezica, Zulma G. |
author_sort | Legarra, Andrés |
collection | PubMed |
description | BACKGROUND: Multiple breed evaluation using genomic prediction includes the use of data from multiple populations, or from parental breeds and crosses, and is expected to lead to better genomic predictions. Increased complexity comes from the need to fit non-additive effects such as dominance and/or genotype-by-environment interactions. In these models, marker effects (and breeding values) are modelled as correlated between breeds, which leads to multiple trait formulations that are based either on markers [single nucleotide polymorphism best linear unbiased prediction (SNP-BLUP)] or on individuals [genomic(G)BLUP)]. As an alternative, we propose the use of generalized least squares (GLS) followed by backsolving of marker effects using selection index (SI) theory. RESULTS: All investigated options have advantages and inconveniences. The SNP-BLUP yields marker effects directly, which are useful for indirect prediction and for planned matings, but is very large in number of equations and is structured in dense and sparse blocks that do not allow for simple solving. GBLUP uses a multiple trait formulation and is very general, but results in many equations that are not used, which increase memory needs, and is also structured in dense and sparse blocks. An alternative formulation of GBLUP is more compact but requires tailored programming. The alternative of solving by GLS + SI is the least consuming, both in number of operations and in memory, and it uses only single dense blocks. However, it requires dedicated programming. Computational complexity problems are exacerbated when more than additive effects are fitted, e.g. dominance effects or genotype x environment interactions. CONCLUSIONS: As multi-breed predictions become more frequent and non-additive effects are more often included, standard equations for genomic prediction based on Henderson’s mixed model equations become less practical and may need to be replaced by more efficient (although less general) approaches such as the GLS + SI approach proposed here. |
format | Online Article Text |
id | pubmed-8815165 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-88151652022-02-07 Computing strategies for multi-population genomic evaluation Legarra, Andrés González-Diéguez, David Vitezica, Zulma G. Genet Sel Evol Short Communication BACKGROUND: Multiple breed evaluation using genomic prediction includes the use of data from multiple populations, or from parental breeds and crosses, and is expected to lead to better genomic predictions. Increased complexity comes from the need to fit non-additive effects such as dominance and/or genotype-by-environment interactions. In these models, marker effects (and breeding values) are modelled as correlated between breeds, which leads to multiple trait formulations that are based either on markers [single nucleotide polymorphism best linear unbiased prediction (SNP-BLUP)] or on individuals [genomic(G)BLUP)]. As an alternative, we propose the use of generalized least squares (GLS) followed by backsolving of marker effects using selection index (SI) theory. RESULTS: All investigated options have advantages and inconveniences. The SNP-BLUP yields marker effects directly, which are useful for indirect prediction and for planned matings, but is very large in number of equations and is structured in dense and sparse blocks that do not allow for simple solving. GBLUP uses a multiple trait formulation and is very general, but results in many equations that are not used, which increase memory needs, and is also structured in dense and sparse blocks. An alternative formulation of GBLUP is more compact but requires tailored programming. The alternative of solving by GLS + SI is the least consuming, both in number of operations and in memory, and it uses only single dense blocks. However, it requires dedicated programming. Computational complexity problems are exacerbated when more than additive effects are fitted, e.g. dominance effects or genotype x environment interactions. CONCLUSIONS: As multi-breed predictions become more frequent and non-additive effects are more often included, standard equations for genomic prediction based on Henderson’s mixed model equations become less practical and may need to be replaced by more efficient (although less general) approaches such as the GLS + SI approach proposed here. BioMed Central 2022-02-04 /pmc/articles/PMC8815165/ /pubmed/35114933 http://dx.doi.org/10.1186/s12711-022-00705-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Short Communication Legarra, Andrés González-Diéguez, David Vitezica, Zulma G. Computing strategies for multi-population genomic evaluation |
title | Computing strategies for multi-population genomic evaluation |
title_full | Computing strategies for multi-population genomic evaluation |
title_fullStr | Computing strategies for multi-population genomic evaluation |
title_full_unstemmed | Computing strategies for multi-population genomic evaluation |
title_short | Computing strategies for multi-population genomic evaluation |
title_sort | computing strategies for multi-population genomic evaluation |
topic | Short Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8815165/ https://www.ncbi.nlm.nih.gov/pubmed/35114933 http://dx.doi.org/10.1186/s12711-022-00705-x |
work_keys_str_mv | AT legarraandres computingstrategiesformultipopulationgenomicevaluation AT gonzalezdieguezdavid computingstrategiesformultipopulationgenomicevaluation AT vitezicazulmag computingstrategiesformultipopulationgenomicevaluation |