Cargando…

Computing strategies for multi-population genomic evaluation

BACKGROUND: Multiple breed evaluation using genomic prediction includes the use of data from multiple populations, or from parental breeds and crosses, and is expected to lead to better genomic predictions. Increased complexity comes from the need to fit non-additive effects such as dominance and/or...

Descripción completa

Detalles Bibliográficos
Autores principales: Legarra, Andrés, González-Diéguez, David, Vitezica, Zulma G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8815165/
https://www.ncbi.nlm.nih.gov/pubmed/35114933
http://dx.doi.org/10.1186/s12711-022-00705-x
_version_ 1784645226750541824
author Legarra, Andrés
González-Diéguez, David
Vitezica, Zulma G.
author_facet Legarra, Andrés
González-Diéguez, David
Vitezica, Zulma G.
author_sort Legarra, Andrés
collection PubMed
description BACKGROUND: Multiple breed evaluation using genomic prediction includes the use of data from multiple populations, or from parental breeds and crosses, and is expected to lead to better genomic predictions. Increased complexity comes from the need to fit non-additive effects such as dominance and/or genotype-by-environment interactions. In these models, marker effects (and breeding values) are modelled as correlated between breeds, which leads to multiple trait formulations that are based either on markers [single nucleotide polymorphism best linear unbiased prediction (SNP-BLUP)] or on individuals [genomic(G)BLUP)]. As an alternative, we propose the use of generalized least squares (GLS) followed by backsolving of marker effects using selection index (SI) theory. RESULTS: All investigated options have advantages and inconveniences. The SNP-BLUP yields marker effects directly, which are useful for indirect prediction and for planned matings, but is very large in number of equations and is structured in dense and sparse blocks that do not allow for simple solving. GBLUP uses a multiple trait formulation and is very general, but results in many equations that are not used, which increase memory needs, and is also structured in dense and sparse blocks. An alternative formulation of GBLUP is more compact but requires tailored programming. The alternative of solving by GLS + SI is the least consuming, both in number of operations and in memory, and it uses only single dense blocks. However, it requires dedicated programming. Computational complexity problems are exacerbated when more than additive effects are fitted, e.g. dominance effects or genotype x environment interactions. CONCLUSIONS: As multi-breed predictions become more frequent and non-additive effects are more often included, standard equations for genomic prediction based on Henderson’s mixed model equations become less practical and may need to be replaced by more efficient (although less general) approaches such as the GLS + SI approach proposed here.
format Online
Article
Text
id pubmed-8815165
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-88151652022-02-07 Computing strategies for multi-population genomic evaluation Legarra, Andrés González-Diéguez, David Vitezica, Zulma G. Genet Sel Evol Short Communication BACKGROUND: Multiple breed evaluation using genomic prediction includes the use of data from multiple populations, or from parental breeds and crosses, and is expected to lead to better genomic predictions. Increased complexity comes from the need to fit non-additive effects such as dominance and/or genotype-by-environment interactions. In these models, marker effects (and breeding values) are modelled as correlated between breeds, which leads to multiple trait formulations that are based either on markers [single nucleotide polymorphism best linear unbiased prediction (SNP-BLUP)] or on individuals [genomic(G)BLUP)]. As an alternative, we propose the use of generalized least squares (GLS) followed by backsolving of marker effects using selection index (SI) theory. RESULTS: All investigated options have advantages and inconveniences. The SNP-BLUP yields marker effects directly, which are useful for indirect prediction and for planned matings, but is very large in number of equations and is structured in dense and sparse blocks that do not allow for simple solving. GBLUP uses a multiple trait formulation and is very general, but results in many equations that are not used, which increase memory needs, and is also structured in dense and sparse blocks. An alternative formulation of GBLUP is more compact but requires tailored programming. The alternative of solving by GLS + SI is the least consuming, both in number of operations and in memory, and it uses only single dense blocks. However, it requires dedicated programming. Computational complexity problems are exacerbated when more than additive effects are fitted, e.g. dominance effects or genotype x environment interactions. CONCLUSIONS: As multi-breed predictions become more frequent and non-additive effects are more often included, standard equations for genomic prediction based on Henderson’s mixed model equations become less practical and may need to be replaced by more efficient (although less general) approaches such as the GLS + SI approach proposed here. BioMed Central 2022-02-04 /pmc/articles/PMC8815165/ /pubmed/35114933 http://dx.doi.org/10.1186/s12711-022-00705-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Short Communication
Legarra, Andrés
González-Diéguez, David
Vitezica, Zulma G.
Computing strategies for multi-population genomic evaluation
title Computing strategies for multi-population genomic evaluation
title_full Computing strategies for multi-population genomic evaluation
title_fullStr Computing strategies for multi-population genomic evaluation
title_full_unstemmed Computing strategies for multi-population genomic evaluation
title_short Computing strategies for multi-population genomic evaluation
title_sort computing strategies for multi-population genomic evaluation
topic Short Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8815165/
https://www.ncbi.nlm.nih.gov/pubmed/35114933
http://dx.doi.org/10.1186/s12711-022-00705-x
work_keys_str_mv AT legarraandres computingstrategiesformultipopulationgenomicevaluation
AT gonzalezdieguezdavid computingstrategiesformultipopulationgenomicevaluation
AT vitezicazulmag computingstrategiesformultipopulationgenomicevaluation