Cargando…

Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates

BACKGROUND: Single-step genomic best linear unbiased prediction (ssGBLUP) models allow the combination of genomic, pedigree, and phenotypic data into a single model, which is computationally challenging for large genotyped populations. In practice, genotypes of animals without their own phenotype an...

Descripción completa

Detalles Bibliográficos
Autores principales: Vandenplas, Jeremie, ten Napel, Jan, Darbaghshahi, Saeid Naderi, Evans, Ross, Calus, Mario P. L., Veerkamp, Roel, Cromie, Andrew, Mäntysaari, Esa A., Strandén, Ismo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10251624/
https://www.ncbi.nlm.nih.gov/pubmed/37291510
http://dx.doi.org/10.1186/s12711-023-00808-z
_version_ 1785055984569286656
author Vandenplas, Jeremie
ten Napel, Jan
Darbaghshahi, Saeid Naderi
Evans, Ross
Calus, Mario P. L.
Veerkamp, Roel
Cromie, Andrew
Mäntysaari, Esa A.
Strandén, Ismo
author_facet Vandenplas, Jeremie
ten Napel, Jan
Darbaghshahi, Saeid Naderi
Evans, Ross
Calus, Mario P. L.
Veerkamp, Roel
Cromie, Andrew
Mäntysaari, Esa A.
Strandén, Ismo
author_sort Vandenplas, Jeremie
collection PubMed
description BACKGROUND: Single-step genomic best linear unbiased prediction (ssGBLUP) models allow the combination of genomic, pedigree, and phenotypic data into a single model, which is computationally challenging for large genotyped populations. In practice, genotypes of animals without their own phenotype and progeny, so-called genotyped selection candidates, can become available after genomic breeding values have been estimated by ssGBLUP. In some breeding programmes, genomic estimated breeding values (GEBV) for these animals should be known shortly after obtaining genotype information but recomputing GEBV using the full ssGBLUP takes too much time. In this study, first we compare two equivalent formulations of ssGBLUP models, i.e. one that is based on the Woodbury matrix identity applied to the inverse of the genomic relationship matrix, and one that is based on marker equations. Second, we present computationally-fast approaches to indirectly compute GEBV for genotyped selection candidates, without the need to do the full ssGBLUP evaluation. RESULTS: The indirect approaches use information from the latest ssGBLUP evaluation and rely on the decomposition of GEBV into its components. The two equivalent ssGBLUP models and indirect approaches were tested on a six-trait calving difficulty model using Irish dairy and beef cattle data that include 2.6 million genotyped animals of which about 500,000 were considered as genotyped selection candidates. When using the same computational approaches, the solving phase of the two equivalent ssGBLUP models showed similar requirements for memory and time per iteration. The computational differences between them were due to the preprocessing phase of the genomic information. Regarding the indirect approaches, compared to GEBV obtained from single-step evaluations including all genotypes, indirect GEBV had correlations higher than 0.99 for all traits while showing little dispersion and level bias. CONCLUSIONS: In conclusion, ssGBLUP predictions for the genotyped selection candidates were accurately approximated using the presented indirect approaches, which are more memory efficient and computationally fast, compared to solving a full ssGBLUP evaluation. Thus, indirect approaches can be used even on a weekly basis to estimate GEBV for newly genotyped animals, while the full single-step evaluation is done only a few times within a year. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-023-00808-z.
format Online
Article
Text
id pubmed-10251624
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-102516242023-06-10 Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates Vandenplas, Jeremie ten Napel, Jan Darbaghshahi, Saeid Naderi Evans, Ross Calus, Mario P. L. Veerkamp, Roel Cromie, Andrew Mäntysaari, Esa A. Strandén, Ismo Genet Sel Evol Research Article BACKGROUND: Single-step genomic best linear unbiased prediction (ssGBLUP) models allow the combination of genomic, pedigree, and phenotypic data into a single model, which is computationally challenging for large genotyped populations. In practice, genotypes of animals without their own phenotype and progeny, so-called genotyped selection candidates, can become available after genomic breeding values have been estimated by ssGBLUP. In some breeding programmes, genomic estimated breeding values (GEBV) for these animals should be known shortly after obtaining genotype information but recomputing GEBV using the full ssGBLUP takes too much time. In this study, first we compare two equivalent formulations of ssGBLUP models, i.e. one that is based on the Woodbury matrix identity applied to the inverse of the genomic relationship matrix, and one that is based on marker equations. Second, we present computationally-fast approaches to indirectly compute GEBV for genotyped selection candidates, without the need to do the full ssGBLUP evaluation. RESULTS: The indirect approaches use information from the latest ssGBLUP evaluation and rely on the decomposition of GEBV into its components. The two equivalent ssGBLUP models and indirect approaches were tested on a six-trait calving difficulty model using Irish dairy and beef cattle data that include 2.6 million genotyped animals of which about 500,000 were considered as genotyped selection candidates. When using the same computational approaches, the solving phase of the two equivalent ssGBLUP models showed similar requirements for memory and time per iteration. The computational differences between them were due to the preprocessing phase of the genomic information. Regarding the indirect approaches, compared to GEBV obtained from single-step evaluations including all genotypes, indirect GEBV had correlations higher than 0.99 for all traits while showing little dispersion and level bias. CONCLUSIONS: In conclusion, ssGBLUP predictions for the genotyped selection candidates were accurately approximated using the presented indirect approaches, which are more memory efficient and computationally fast, compared to solving a full ssGBLUP evaluation. Thus, indirect approaches can be used even on a weekly basis to estimate GEBV for newly genotyped animals, while the full single-step evaluation is done only a few times within a year. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-023-00808-z. BioMed Central 2023-06-08 /pmc/articles/PMC10251624/ /pubmed/37291510 http://dx.doi.org/10.1186/s12711-023-00808-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Vandenplas, Jeremie
ten Napel, Jan
Darbaghshahi, Saeid Naderi
Evans, Ross
Calus, Mario P. L.
Veerkamp, Roel
Cromie, Andrew
Mäntysaari, Esa A.
Strandén, Ismo
Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates
title Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates
title_full Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates
title_fullStr Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates
title_full_unstemmed Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates
title_short Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates
title_sort efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10251624/
https://www.ncbi.nlm.nih.gov/pubmed/37291510
http://dx.doi.org/10.1186/s12711-023-00808-z
work_keys_str_mv AT vandenplasjeremie efficientlargescalesinglestepevaluationsandindirectgenomicpredictionofgenotypedselectioncandidates
AT tennapeljan efficientlargescalesinglestepevaluationsandindirectgenomicpredictionofgenotypedselectioncandidates
AT darbaghshahisaeidnaderi efficientlargescalesinglestepevaluationsandindirectgenomicpredictionofgenotypedselectioncandidates
AT evansross efficientlargescalesinglestepevaluationsandindirectgenomicpredictionofgenotypedselectioncandidates
AT calusmariopl efficientlargescalesinglestepevaluationsandindirectgenomicpredictionofgenotypedselectioncandidates
AT veerkamproel efficientlargescalesinglestepevaluationsandindirectgenomicpredictionofgenotypedselectioncandidates
AT cromieandrew efficientlargescalesinglestepevaluationsandindirectgenomicpredictionofgenotypedselectioncandidates
AT mantysaariesaa efficientlargescalesinglestepevaluationsandindirectgenomicpredictionofgenotypedselectioncandidates
AT strandenismo efficientlargescalesinglestepevaluationsandindirectgenomicpredictionofgenotypedselectioncandidates