Cargando…

Computing strategies for multi-population genomic evaluation

BACKGROUND: Multiple breed evaluation using genomic prediction includes the use of data from multiple populations, or from parental breeds and crosses, and is expected to lead to better genomic predictions. Increased complexity comes from the need to fit non-additive effects such as dominance and/or...

Descripción completa

Detalles Bibliográficos
Autores principales:	Legarra, Andrés, González-Diéguez, David, Vitezica, Zulma G.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Short Communication
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8815165/ https://www.ncbi.nlm.nih.gov/pubmed/35114933 http://dx.doi.org/10.1186/s12711-022-00705-x

_version_	1784645226750541824
author	Legarra, Andrés González-Diéguez, David Vitezica, Zulma G.
author_facet	Legarra, Andrés González-Diéguez, David Vitezica, Zulma G.
author_sort	Legarra, Andrés
collection	PubMed
description	BACKGROUND: Multiple breed evaluation using genomic prediction includes the use of data from multiple populations, or from parental breeds and crosses, and is expected to lead to better genomic predictions. Increased complexity comes from the need to fit non-additive effects such as dominance and/or genotype-by-environment interactions. In these models, marker effects (and breeding values) are modelled as correlated between breeds, which leads to multiple trait formulations that are based either on markers [single nucleotide polymorphism best linear unbiased prediction (SNP-BLUP)] or on individuals [genomic(G)BLUP)]. As an alternative, we propose the use of generalized least squares (GLS) followed by backsolving of marker effects using selection index (SI) theory. RESULTS: All investigated options have advantages and inconveniences. The SNP-BLUP yields marker effects directly, which are useful for indirect prediction and for planned matings, but is very large in number of equations and is structured in dense and sparse blocks that do not allow for simple solving. GBLUP uses a multiple trait formulation and is very general, but results in many equations that are not used, which increase memory needs, and is also structured in dense and sparse blocks. An alternative formulation of GBLUP is more compact but requires tailored programming. The alternative of solving by GLS + SI is the least consuming, both in number of operations and in memory, and it uses only single dense blocks. However, it requires dedicated programming. Computational complexity problems are exacerbated when more than additive effects are fitted, e.g. dominance effects or genotype x environment interactions. CONCLUSIONS: As multi-breed predictions become more frequent and non-additive effects are more often included, standard equations for genomic prediction based on Henderson’s mixed model equations become less practical and may need to be replaced by more efficient (although less general) approaches such as the GLS + SI approach proposed here.
format	Online Article Text
id	pubmed-8815165
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-88151652022-02-07 Computing strategies for multi-population genomic evaluation Legarra, Andrés González-Diéguez, David Vitezica, Zulma G. Genet Sel Evol Short Communication BACKGROUND: Multiple breed evaluation using genomic prediction includes the use of data from multiple populations, or from parental breeds and crosses, and is expected to lead to better genomic predictions. Increased complexity comes from the need to fit non-additive effects such as dominance and/or genotype-by-environment interactions. In these models, marker effects (and breeding values) are modelled as correlated between breeds, which leads to multiple trait formulations that are based either on markers [single nucleotide polymorphism best linear unbiased prediction (SNP-BLUP)] or on individuals [genomic(G)BLUP)]. As an alternative, we propose the use of generalized least squares (GLS) followed by backsolving of marker effects using selection index (SI) theory. RESULTS: All investigated options have advantages and inconveniences. The SNP-BLUP yields marker effects directly, which are useful for indirect prediction and for planned matings, but is very large in number of equations and is structured in dense and sparse blocks that do not allow for simple solving. GBLUP uses a multiple trait formulation and is very general, but results in many equations that are not used, which increase memory needs, and is also structured in dense and sparse blocks. An alternative formulation of GBLUP is more compact but requires tailored programming. The alternative of solving by GLS + SI is the least consuming, both in number of operations and in memory, and it uses only single dense blocks. However, it requires dedicated programming. Computational complexity problems are exacerbated when more than additive effects are fitted, e.g. dominance effects or genotype x environment interactions. CONCLUSIONS: As multi-breed predictions become more frequent and non-additive effects are more often included, standard equations for genomic prediction based on Henderson’s mixed model equations become less practical and may need to be replaced by more efficient (although less general) approaches such as the GLS + SI approach proposed here. BioMed Central 2022-02-04 /pmc/articles/PMC8815165/ /pubmed/35114933 http://dx.doi.org/10.1186/s12711-022-00705-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Short Communication Legarra, Andrés González-Diéguez, David Vitezica, Zulma G. Computing strategies for multi-population genomic evaluation
title	Computing strategies for multi-population genomic evaluation
title_full	Computing strategies for multi-population genomic evaluation
title_fullStr	Computing strategies for multi-population genomic evaluation
title_full_unstemmed	Computing strategies for multi-population genomic evaluation
title_short	Computing strategies for multi-population genomic evaluation
title_sort	computing strategies for multi-population genomic evaluation
topic	Short Communication
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8815165/ https://www.ncbi.nlm.nih.gov/pubmed/35114933 http://dx.doi.org/10.1186/s12711-022-00705-x
work_keys_str_mv	AT legarraandres computingstrategiesformultipopulationgenomicevaluation AT gonzalezdieguezdavid computingstrategiesformultipopulationgenomicevaluation AT vitezicazulmag computingstrategiesformultipopulationgenomicevaluation

Computing strategies for multi-population genomic evaluation

Ejemplares similares