Cargando…

Genomic prediction of breeding values using previously estimated SNP variances

BACKGROUND: Genomic prediction requires estimation of variances of effects of single nucleotide polymorphisms (SNPs), which is computationally demanding, and uses these variances for prediction. We have developed models with separate estimation of SNP variances, which can be applied infrequently, an...

Descripción completa

Detalles Bibliográficos
Autores principales:	Calus, Mario PL, Schrooten, Chris, Veerkamp, Roel F
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2014
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4176585/ https://www.ncbi.nlm.nih.gov/pubmed/25928875 http://dx.doi.org/10.1186/s12711-014-0052-x

_version_	1782336655628173312
author	Calus, Mario PL Schrooten, Chris Veerkamp, Roel F
author_facet	Calus, Mario PL Schrooten, Chris Veerkamp, Roel F
author_sort	Calus, Mario PL
collection	PubMed
description	BACKGROUND: Genomic prediction requires estimation of variances of effects of single nucleotide polymorphisms (SNPs), which is computationally demanding, and uses these variances for prediction. We have developed models with separate estimation of SNP variances, which can be applied infrequently, and genomic prediction, which can be applied routinely. METHODS: SNP variances were estimated with Bayes Stochastic Search Variable Selection (BSSVS) and BayesC. Genome-enhanced breeding values (GEBV) were estimated with RR-BLUP (ridge regression best linear unbiased prediction), using either variances obtained from BSSVS (BLUP-SSVS) or BayesC (BLUP-C), or assuming equal variances for each SNP. Datasets used to estimate SNP variances comprised (1) all animals, (2) 50% random animals (RAN50), (3) 50% best animals (TOP50), or (4) 50% worst animals (BOT50). Traits analysed were protein yield, udder depth, somatic cell score, interval between first and last insemination, direct longevity, and longevity including information from predictors. RESULTS: BLUP-SSVS and BLUP-C yielded similar GEBV as the equivalent Bayesian models that simultaneously estimated SNP variances. Reliabilities of these GEBV were consistently higher than from RR-BLUP, although only significantly for direct longevity. Across scenarios that used data subsets to estimate GEBV, observed reliabilities were generally higher for TOP50 than for RAN50, and much higher than for BOT50. Reliabilities of TOP50 were higher because the training data contained more ancestors of selection candidates. Using estimated SNP variances based on random or non-random subsets of the data, while using all data to estimate GEBV, did not affect reliabilities of the BLUP models. A convergence criterion of 10(−8) instead of 10(−10) for BLUP models yielded similar GEBV, while the required number of iterations decreased by 71 to 90%. Including a separate polygenic effect consistently improved reliabilities of the GEBV, but also substantially increased the required number of iterations to reach convergence with RR-BLUP. SNP variances converged faster for BayesC than for BSSVS. CONCLUSIONS: Combining Bayesian variable selection models to re-estimate SNP variances and BLUP models that use those SNP variances, yields GEBV that are similar to those from full Bayesian models. Moreover, these combined models yield predictions with higher reliability and less bias than the commonly used RR-BLUP model. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0052-x) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-4176585
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-41765852014-10-23 Genomic prediction of breeding values using previously estimated SNP variances Calus, Mario PL Schrooten, Chris Veerkamp, Roel F Genet Sel Evol Research BACKGROUND: Genomic prediction requires estimation of variances of effects of single nucleotide polymorphisms (SNPs), which is computationally demanding, and uses these variances for prediction. We have developed models with separate estimation of SNP variances, which can be applied infrequently, and genomic prediction, which can be applied routinely. METHODS: SNP variances were estimated with Bayes Stochastic Search Variable Selection (BSSVS) and BayesC. Genome-enhanced breeding values (GEBV) were estimated with RR-BLUP (ridge regression best linear unbiased prediction), using either variances obtained from BSSVS (BLUP-SSVS) or BayesC (BLUP-C), or assuming equal variances for each SNP. Datasets used to estimate SNP variances comprised (1) all animals, (2) 50% random animals (RAN50), (3) 50% best animals (TOP50), or (4) 50% worst animals (BOT50). Traits analysed were protein yield, udder depth, somatic cell score, interval between first and last insemination, direct longevity, and longevity including information from predictors. RESULTS: BLUP-SSVS and BLUP-C yielded similar GEBV as the equivalent Bayesian models that simultaneously estimated SNP variances. Reliabilities of these GEBV were consistently higher than from RR-BLUP, although only significantly for direct longevity. Across scenarios that used data subsets to estimate GEBV, observed reliabilities were generally higher for TOP50 than for RAN50, and much higher than for BOT50. Reliabilities of TOP50 were higher because the training data contained more ancestors of selection candidates. Using estimated SNP variances based on random or non-random subsets of the data, while using all data to estimate GEBV, did not affect reliabilities of the BLUP models. A convergence criterion of 10(−8) instead of 10(−10) for BLUP models yielded similar GEBV, while the required number of iterations decreased by 71 to 90%. Including a separate polygenic effect consistently improved reliabilities of the GEBV, but also substantially increased the required number of iterations to reach convergence with RR-BLUP. SNP variances converged faster for BayesC than for BSSVS. CONCLUSIONS: Combining Bayesian variable selection models to re-estimate SNP variances and BLUP models that use those SNP variances, yields GEBV that are similar to those from full Bayesian models. Moreover, these combined models yield predictions with higher reliability and less bias than the commonly used RR-BLUP model. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0052-x) contains supplementary material, which is available to authorized users. BioMed Central 2014-09-25 /pmc/articles/PMC4176585/ /pubmed/25928875 http://dx.doi.org/10.1186/s12711-014-0052-x Text en © Calus et al.; licensee BioMed Central Ltd. 2014
spellingShingle	Research Calus, Mario PL Schrooten, Chris Veerkamp, Roel F Genomic prediction of breeding values using previously estimated SNP variances
title	Genomic prediction of breeding values using previously estimated SNP variances
title_full	Genomic prediction of breeding values using previously estimated SNP variances
title_fullStr	Genomic prediction of breeding values using previously estimated SNP variances
title_full_unstemmed	Genomic prediction of breeding values using previously estimated SNP variances
title_short	Genomic prediction of breeding values using previously estimated SNP variances
title_sort	genomic prediction of breeding values using previously estimated snp variances
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4176585/ https://www.ncbi.nlm.nih.gov/pubmed/25928875 http://dx.doi.org/10.1186/s12711-014-0052-x
work_keys_str_mv	AT calusmariopl genomicpredictionofbreedingvaluesusingpreviouslyestimatedsnpvariances AT schrootenchris genomicpredictionofbreedingvaluesusingpreviouslyestimatedsnpvariances AT veerkamproelf genomicpredictionofbreedingvaluesusingpreviouslyestimatedsnpvariances

Genomic prediction of breeding values using previously estimated SNP variances

Ejemplares similares