Cargando…

Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds

BACKGROUND: In genomic prediction, it is common to centre the genotypes of single nucleotide polymorphisms based on the allele frequencies in the current population, rather than those in the base generation. The mean breeding value of non-genotyped animals is conditional on the mean performance of g...

Descripción completa

Detalles Bibliográficos
Autores principales: Aldridge, Michael, Vandenplas, Jeremie, Duenk, Pascal, Henshall, John, Hawken, Rachel, Calus, Mario
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10031914/
https://www.ncbi.nlm.nih.gov/pubmed/36949392
http://dx.doi.org/10.1186/s12711-023-00787-1
_version_ 1784910690704687104
author Aldridge, Michael
Vandenplas, Jeremie
Duenk, Pascal
Henshall, John
Hawken, Rachel
Calus, Mario
author_facet Aldridge, Michael
Vandenplas, Jeremie
Duenk, Pascal
Henshall, John
Hawken, Rachel
Calus, Mario
author_sort Aldridge, Michael
collection PubMed
description BACKGROUND: In genomic prediction, it is common to centre the genotypes of single nucleotide polymorphisms based on the allele frequencies in the current population, rather than those in the base generation. The mean breeding value of non-genotyped animals is conditional on the mean performance of genotyped relatives, but can be corrected by fitting the mean performance of genotyped individuals as a fixed regression. The associated covariate vector has been referred to as a ‘J-factor’, which if fitted as a fixed effect can improve the accuracy and dispersion bias of sire genomic estimated breeding values (GEBV). To date, this has only been performed on populations with a single breed. Here, we investigated whether there was any benefit in fitting a separate J-factor for each breed in a three-way crossbred population, and in using pedigree-based expected or genome-based estimated breed fractions to define the J-factors. RESULTS: For body weight at 7 days, dispersion bias decreased when fitting multiple J-factors, but only with a low proportion of genotyped individuals with selective genotyping. On average, the mean regression coefficients of validation records on those of GEBV increased with one J-factor compared to none, and further increased with multiple J-factors. However, for body weight at 35 days this was not observed. The accuracy of GEBV remained unchanged regardless of the J-factor method used. Differences between the J-factor methods were limited with correlations approaching 1 for the estimated covariate vector, the estimated coefficients of the regression on the J-factors, and the GEBV. CONCLUSIONS: Based on our results and in the particular design analysed here, i.e. all the animals with phenotype are of the same type of crossbreds, fitting a single J-factor should be sufficient, to reduce dispersion bias. Fitting multiple J-factors may reduce dispersion bias further but this depends on the trait and genotyping rate. For the crossbred population analysed, fitting multiple J-factors has no adverse consequences and if this is done, it does not matter if the breed fractions used are based on the pedigree-expectation or the genomic estimates. Finally, when GEBV are estimated from crossbred data, any observed bias can potentially be reduced by including a straightforward regression on actual breed proportions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-023-00787-1.
format Online
Article
Text
id pubmed-10031914
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-100319142023-03-23 Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds Aldridge, Michael Vandenplas, Jeremie Duenk, Pascal Henshall, John Hawken, Rachel Calus, Mario Genet Sel Evol Research Article BACKGROUND: In genomic prediction, it is common to centre the genotypes of single nucleotide polymorphisms based on the allele frequencies in the current population, rather than those in the base generation. The mean breeding value of non-genotyped animals is conditional on the mean performance of genotyped relatives, but can be corrected by fitting the mean performance of genotyped individuals as a fixed regression. The associated covariate vector has been referred to as a ‘J-factor’, which if fitted as a fixed effect can improve the accuracy and dispersion bias of sire genomic estimated breeding values (GEBV). To date, this has only been performed on populations with a single breed. Here, we investigated whether there was any benefit in fitting a separate J-factor for each breed in a three-way crossbred population, and in using pedigree-based expected or genome-based estimated breed fractions to define the J-factors. RESULTS: For body weight at 7 days, dispersion bias decreased when fitting multiple J-factors, but only with a low proportion of genotyped individuals with selective genotyping. On average, the mean regression coefficients of validation records on those of GEBV increased with one J-factor compared to none, and further increased with multiple J-factors. However, for body weight at 35 days this was not observed. The accuracy of GEBV remained unchanged regardless of the J-factor method used. Differences between the J-factor methods were limited with correlations approaching 1 for the estimated covariate vector, the estimated coefficients of the regression on the J-factors, and the GEBV. CONCLUSIONS: Based on our results and in the particular design analysed here, i.e. all the animals with phenotype are of the same type of crossbreds, fitting a single J-factor should be sufficient, to reduce dispersion bias. Fitting multiple J-factors may reduce dispersion bias further but this depends on the trait and genotyping rate. For the crossbred population analysed, fitting multiple J-factors has no adverse consequences and if this is done, it does not matter if the breed fractions used are based on the pedigree-expectation or the genomic estimates. Finally, when GEBV are estimated from crossbred data, any observed bias can potentially be reduced by including a straightforward regression on actual breed proportions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-023-00787-1. BioMed Central 2023-03-22 /pmc/articles/PMC10031914/ /pubmed/36949392 http://dx.doi.org/10.1186/s12711-023-00787-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Aldridge, Michael
Vandenplas, Jeremie
Duenk, Pascal
Henshall, John
Hawken, Rachel
Calus, Mario
Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds
title Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds
title_full Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds
title_fullStr Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds
title_full_unstemmed Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds
title_short Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds
title_sort validation with single-step snpblup shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10031914/
https://www.ncbi.nlm.nih.gov/pubmed/36949392
http://dx.doi.org/10.1186/s12711-023-00787-1
work_keys_str_mv AT aldridgemichael validationwithsinglestepsnpblupshowsthatevaluationscancontinueusingasinglemeanofgenotypedindividualsevenwithmultiplebreeds
AT vandenplasjeremie validationwithsinglestepsnpblupshowsthatevaluationscancontinueusingasinglemeanofgenotypedindividualsevenwithmultiplebreeds
AT duenkpascal validationwithsinglestepsnpblupshowsthatevaluationscancontinueusingasinglemeanofgenotypedindividualsevenwithmultiplebreeds
AT henshalljohn validationwithsinglestepsnpblupshowsthatevaluationscancontinueusingasinglemeanofgenotypedindividualsevenwithmultiplebreeds
AT hawkenrachel validationwithsinglestepsnpblupshowsthatevaluationscancontinueusingasinglemeanofgenotypedindividualsevenwithmultiplebreeds
AT calusmario validationwithsinglestepsnpblupshowsthatevaluationscancontinueusingasinglemeanofgenotypedindividualsevenwithmultiplebreeds