Cargando…

Genomic prediction of breeding values using previously estimated SNP variances

BACKGROUND: Genomic prediction requires estimation of variances of effects of single nucleotide polymorphisms (SNPs), which is computationally demanding, and uses these variances for prediction. We have developed models with separate estimation of SNP variances, which can be applied infrequently, an...

Descripción completa

Detalles Bibliográficos
Autores principales: Calus, Mario PL, Schrooten, Chris, Veerkamp, Roel F
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4176585/
https://www.ncbi.nlm.nih.gov/pubmed/25928875
http://dx.doi.org/10.1186/s12711-014-0052-x
_version_ 1782336655628173312
author Calus, Mario PL
Schrooten, Chris
Veerkamp, Roel F
author_facet Calus, Mario PL
Schrooten, Chris
Veerkamp, Roel F
author_sort Calus, Mario PL
collection PubMed
description BACKGROUND: Genomic prediction requires estimation of variances of effects of single nucleotide polymorphisms (SNPs), which is computationally demanding, and uses these variances for prediction. We have developed models with separate estimation of SNP variances, which can be applied infrequently, and genomic prediction, which can be applied routinely. METHODS: SNP variances were estimated with Bayes Stochastic Search Variable Selection (BSSVS) and BayesC. Genome-enhanced breeding values (GEBV) were estimated with RR-BLUP (ridge regression best linear unbiased prediction), using either variances obtained from BSSVS (BLUP-SSVS) or BayesC (BLUP-C), or assuming equal variances for each SNP. Datasets used to estimate SNP variances comprised (1) all animals, (2) 50% random animals (RAN50), (3) 50% best animals (TOP50), or (4) 50% worst animals (BOT50). Traits analysed were protein yield, udder depth, somatic cell score, interval between first and last insemination, direct longevity, and longevity including information from predictors. RESULTS: BLUP-SSVS and BLUP-C yielded similar GEBV as the equivalent Bayesian models that simultaneously estimated SNP variances. Reliabilities of these GEBV were consistently higher than from RR-BLUP, although only significantly for direct longevity. Across scenarios that used data subsets to estimate GEBV, observed reliabilities were generally higher for TOP50 than for RAN50, and much higher than for BOT50. Reliabilities of TOP50 were higher because the training data contained more ancestors of selection candidates. Using estimated SNP variances based on random or non-random subsets of the data, while using all data to estimate GEBV, did not affect reliabilities of the BLUP models. A convergence criterion of 10(−8) instead of 10(−10) for BLUP models yielded similar GEBV, while the required number of iterations decreased by 71 to 90%. Including a separate polygenic effect consistently improved reliabilities of the GEBV, but also substantially increased the required number of iterations to reach convergence with RR-BLUP. SNP variances converged faster for BayesC than for BSSVS. CONCLUSIONS: Combining Bayesian variable selection models to re-estimate SNP variances and BLUP models that use those SNP variances, yields GEBV that are similar to those from full Bayesian models. Moreover, these combined models yield predictions with higher reliability and less bias than the commonly used RR-BLUP model. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0052-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4176585
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41765852014-10-23 Genomic prediction of breeding values using previously estimated SNP variances Calus, Mario PL Schrooten, Chris Veerkamp, Roel F Genet Sel Evol Research BACKGROUND: Genomic prediction requires estimation of variances of effects of single nucleotide polymorphisms (SNPs), which is computationally demanding, and uses these variances for prediction. We have developed models with separate estimation of SNP variances, which can be applied infrequently, and genomic prediction, which can be applied routinely. METHODS: SNP variances were estimated with Bayes Stochastic Search Variable Selection (BSSVS) and BayesC. Genome-enhanced breeding values (GEBV) were estimated with RR-BLUP (ridge regression best linear unbiased prediction), using either variances obtained from BSSVS (BLUP-SSVS) or BayesC (BLUP-C), or assuming equal variances for each SNP. Datasets used to estimate SNP variances comprised (1) all animals, (2) 50% random animals (RAN50), (3) 50% best animals (TOP50), or (4) 50% worst animals (BOT50). Traits analysed were protein yield, udder depth, somatic cell score, interval between first and last insemination, direct longevity, and longevity including information from predictors. RESULTS: BLUP-SSVS and BLUP-C yielded similar GEBV as the equivalent Bayesian models that simultaneously estimated SNP variances. Reliabilities of these GEBV were consistently higher than from RR-BLUP, although only significantly for direct longevity. Across scenarios that used data subsets to estimate GEBV, observed reliabilities were generally higher for TOP50 than for RAN50, and much higher than for BOT50. Reliabilities of TOP50 were higher because the training data contained more ancestors of selection candidates. Using estimated SNP variances based on random or non-random subsets of the data, while using all data to estimate GEBV, did not affect reliabilities of the BLUP models. A convergence criterion of 10(−8) instead of 10(−10) for BLUP models yielded similar GEBV, while the required number of iterations decreased by 71 to 90%. Including a separate polygenic effect consistently improved reliabilities of the GEBV, but also substantially increased the required number of iterations to reach convergence with RR-BLUP. SNP variances converged faster for BayesC than for BSSVS. CONCLUSIONS: Combining Bayesian variable selection models to re-estimate SNP variances and BLUP models that use those SNP variances, yields GEBV that are similar to those from full Bayesian models. Moreover, these combined models yield predictions with higher reliability and less bias than the commonly used RR-BLUP model. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0052-x) contains supplementary material, which is available to authorized users. BioMed Central 2014-09-25 /pmc/articles/PMC4176585/ /pubmed/25928875 http://dx.doi.org/10.1186/s12711-014-0052-x Text en © Calus et al.; licensee BioMed Central Ltd. 2014
spellingShingle Research
Calus, Mario PL
Schrooten, Chris
Veerkamp, Roel F
Genomic prediction of breeding values using previously estimated SNP variances
title Genomic prediction of breeding values using previously estimated SNP variances
title_full Genomic prediction of breeding values using previously estimated SNP variances
title_fullStr Genomic prediction of breeding values using previously estimated SNP variances
title_full_unstemmed Genomic prediction of breeding values using previously estimated SNP variances
title_short Genomic prediction of breeding values using previously estimated SNP variances
title_sort genomic prediction of breeding values using previously estimated snp variances
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4176585/
https://www.ncbi.nlm.nih.gov/pubmed/25928875
http://dx.doi.org/10.1186/s12711-014-0052-x
work_keys_str_mv AT calusmariopl genomicpredictionofbreedingvaluesusingpreviouslyestimatedsnpvariances
AT schrootenchris genomicpredictionofbreedingvaluesusingpreviouslyestimatedsnpvariances
AT veerkamproelf genomicpredictionofbreedingvaluesusingpreviouslyestimatedsnpvariances