Cargando…
Genomic prediction of breeding values using previously estimated SNP variances
BACKGROUND: Genomic prediction requires estimation of variances of effects of single nucleotide polymorphisms (SNPs), which is computationally demanding, and uses these variances for prediction. We have developed models with separate estimation of SNP variances, which can be applied infrequently, an...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4176585/ https://www.ncbi.nlm.nih.gov/pubmed/25928875 http://dx.doi.org/10.1186/s12711-014-0052-x |
_version_ | 1782336655628173312 |
---|---|
author | Calus, Mario PL Schrooten, Chris Veerkamp, Roel F |
author_facet | Calus, Mario PL Schrooten, Chris Veerkamp, Roel F |
author_sort | Calus, Mario PL |
collection | PubMed |
description | BACKGROUND: Genomic prediction requires estimation of variances of effects of single nucleotide polymorphisms (SNPs), which is computationally demanding, and uses these variances for prediction. We have developed models with separate estimation of SNP variances, which can be applied infrequently, and genomic prediction, which can be applied routinely. METHODS: SNP variances were estimated with Bayes Stochastic Search Variable Selection (BSSVS) and BayesC. Genome-enhanced breeding values (GEBV) were estimated with RR-BLUP (ridge regression best linear unbiased prediction), using either variances obtained from BSSVS (BLUP-SSVS) or BayesC (BLUP-C), or assuming equal variances for each SNP. Datasets used to estimate SNP variances comprised (1) all animals, (2) 50% random animals (RAN50), (3) 50% best animals (TOP50), or (4) 50% worst animals (BOT50). Traits analysed were protein yield, udder depth, somatic cell score, interval between first and last insemination, direct longevity, and longevity including information from predictors. RESULTS: BLUP-SSVS and BLUP-C yielded similar GEBV as the equivalent Bayesian models that simultaneously estimated SNP variances. Reliabilities of these GEBV were consistently higher than from RR-BLUP, although only significantly for direct longevity. Across scenarios that used data subsets to estimate GEBV, observed reliabilities were generally higher for TOP50 than for RAN50, and much higher than for BOT50. Reliabilities of TOP50 were higher because the training data contained more ancestors of selection candidates. Using estimated SNP variances based on random or non-random subsets of the data, while using all data to estimate GEBV, did not affect reliabilities of the BLUP models. A convergence criterion of 10(−8) instead of 10(−10) for BLUP models yielded similar GEBV, while the required number of iterations decreased by 71 to 90%. Including a separate polygenic effect consistently improved reliabilities of the GEBV, but also substantially increased the required number of iterations to reach convergence with RR-BLUP. SNP variances converged faster for BayesC than for BSSVS. CONCLUSIONS: Combining Bayesian variable selection models to re-estimate SNP variances and BLUP models that use those SNP variances, yields GEBV that are similar to those from full Bayesian models. Moreover, these combined models yield predictions with higher reliability and less bias than the commonly used RR-BLUP model. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0052-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4176585 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41765852014-10-23 Genomic prediction of breeding values using previously estimated SNP variances Calus, Mario PL Schrooten, Chris Veerkamp, Roel F Genet Sel Evol Research BACKGROUND: Genomic prediction requires estimation of variances of effects of single nucleotide polymorphisms (SNPs), which is computationally demanding, and uses these variances for prediction. We have developed models with separate estimation of SNP variances, which can be applied infrequently, and genomic prediction, which can be applied routinely. METHODS: SNP variances were estimated with Bayes Stochastic Search Variable Selection (BSSVS) and BayesC. Genome-enhanced breeding values (GEBV) were estimated with RR-BLUP (ridge regression best linear unbiased prediction), using either variances obtained from BSSVS (BLUP-SSVS) or BayesC (BLUP-C), or assuming equal variances for each SNP. Datasets used to estimate SNP variances comprised (1) all animals, (2) 50% random animals (RAN50), (3) 50% best animals (TOP50), or (4) 50% worst animals (BOT50). Traits analysed were protein yield, udder depth, somatic cell score, interval between first and last insemination, direct longevity, and longevity including information from predictors. RESULTS: BLUP-SSVS and BLUP-C yielded similar GEBV as the equivalent Bayesian models that simultaneously estimated SNP variances. Reliabilities of these GEBV were consistently higher than from RR-BLUP, although only significantly for direct longevity. Across scenarios that used data subsets to estimate GEBV, observed reliabilities were generally higher for TOP50 than for RAN50, and much higher than for BOT50. Reliabilities of TOP50 were higher because the training data contained more ancestors of selection candidates. Using estimated SNP variances based on random or non-random subsets of the data, while using all data to estimate GEBV, did not affect reliabilities of the BLUP models. A convergence criterion of 10(−8) instead of 10(−10) for BLUP models yielded similar GEBV, while the required number of iterations decreased by 71 to 90%. Including a separate polygenic effect consistently improved reliabilities of the GEBV, but also substantially increased the required number of iterations to reach convergence with RR-BLUP. SNP variances converged faster for BayesC than for BSSVS. CONCLUSIONS: Combining Bayesian variable selection models to re-estimate SNP variances and BLUP models that use those SNP variances, yields GEBV that are similar to those from full Bayesian models. Moreover, these combined models yield predictions with higher reliability and less bias than the commonly used RR-BLUP model. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0052-x) contains supplementary material, which is available to authorized users. BioMed Central 2014-09-25 /pmc/articles/PMC4176585/ /pubmed/25928875 http://dx.doi.org/10.1186/s12711-014-0052-x Text en © Calus et al.; licensee BioMed Central Ltd. 2014 |
spellingShingle | Research Calus, Mario PL Schrooten, Chris Veerkamp, Roel F Genomic prediction of breeding values using previously estimated SNP variances |
title | Genomic prediction of breeding values using previously estimated SNP variances |
title_full | Genomic prediction of breeding values using previously estimated SNP variances |
title_fullStr | Genomic prediction of breeding values using previously estimated SNP variances |
title_full_unstemmed | Genomic prediction of breeding values using previously estimated SNP variances |
title_short | Genomic prediction of breeding values using previously estimated SNP variances |
title_sort | genomic prediction of breeding values using previously estimated snp variances |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4176585/ https://www.ncbi.nlm.nih.gov/pubmed/25928875 http://dx.doi.org/10.1186/s12711-014-0052-x |
work_keys_str_mv | AT calusmariopl genomicpredictionofbreedingvaluesusingpreviouslyestimatedsnpvariances AT schrootenchris genomicpredictionofbreedingvaluesusingpreviouslyestimatedsnpvariances AT veerkamproelf genomicpredictionofbreedingvaluesusingpreviouslyestimatedsnpvariances |