Cargando…
Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle
BACKGROUND: In contrast to currently used single nucleotide polymorphism (SNP) panels, the use of whole-genome sequence data is expected to enable the direct estimation of the effects of causal mutations on a given trait. This could lead to higher reliabilities of genomic predictions compared to tho...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4574568/ https://www.ncbi.nlm.nih.gov/pubmed/26381777 http://dx.doi.org/10.1186/s12711-015-0149-x |
_version_ | 1782390650323337216 |
---|---|
author | van Binsbergen, Rianne Calus, Mario P. L. Bink, Marco C. A. M. van Eeuwijk, Fred A. Schrooten, Chris Veerkamp, Roel F. |
author_facet | van Binsbergen, Rianne Calus, Mario P. L. Bink, Marco C. A. M. van Eeuwijk, Fred A. Schrooten, Chris Veerkamp, Roel F. |
author_sort | van Binsbergen, Rianne |
collection | PubMed |
description | BACKGROUND: In contrast to currently used single nucleotide polymorphism (SNP) panels, the use of whole-genome sequence data is expected to enable the direct estimation of the effects of causal mutations on a given trait. This could lead to higher reliabilities of genomic predictions compared to those based on SNP genotypes. Also, at each generation of selection, recombination events between a SNP and a mutation can cause decay in reliability of genomic predictions based on markers rather than on the causal variants. Our objective was to investigate the use of imputed whole-genome sequence genotypes versus high-density SNP genotypes on (the persistency of) the reliability of genomic predictions using real cattle data. METHODS: Highly accurate phenotypes based on daughter performance and Illumina BovineHD Beadchip genotypes were available for 5503 Holstein Friesian bulls. The BovineHD genotypes (631,428 SNPs) of each bull were used to impute whole-genome sequence genotypes (12,590,056 SNPs) using the Beagle software. Imputation was done using a multi-breed reference panel of 429 sequenced individuals. Genomic estimated breeding values for three traits were predicted using a Bayesian stochastic search variable selection (BSSVS) model and a genome-enabled best linear unbiased prediction model (GBLUP). Reliabilities of predictions were based on 2087 validation bulls, while the other 3416 bulls were used for training. RESULTS: Prediction reliabilities ranged from 0.37 to 0.52. BSSVS performed better than GBLUP in all cases. Reliabilities of genomic predictions were slightly lower with imputed sequence data than with BovineHD chip data. Also, the reliabilities tended to be lower for both sequence data and BovineHD chip data when relationships between training animals were low. No increase in persistency of prediction reliability using imputed sequence data was observed. CONCLUSIONS: Compared to BovineHD genotype data, using imputed sequence data for genomic prediction produced no advantage. To investigate the putative advantage of genomic prediction using (imputed) sequence data, a training set with a larger number of individuals that are distantly related to each other and genomic prediction models that incorporate biological information on the SNPs or that apply stricter SNP pre-selection should be considered. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-015-0149-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4574568 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45745682015-09-19 Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle van Binsbergen, Rianne Calus, Mario P. L. Bink, Marco C. A. M. van Eeuwijk, Fred A. Schrooten, Chris Veerkamp, Roel F. Genet Sel Evol Research Article BACKGROUND: In contrast to currently used single nucleotide polymorphism (SNP) panels, the use of whole-genome sequence data is expected to enable the direct estimation of the effects of causal mutations on a given trait. This could lead to higher reliabilities of genomic predictions compared to those based on SNP genotypes. Also, at each generation of selection, recombination events between a SNP and a mutation can cause decay in reliability of genomic predictions based on markers rather than on the causal variants. Our objective was to investigate the use of imputed whole-genome sequence genotypes versus high-density SNP genotypes on (the persistency of) the reliability of genomic predictions using real cattle data. METHODS: Highly accurate phenotypes based on daughter performance and Illumina BovineHD Beadchip genotypes were available for 5503 Holstein Friesian bulls. The BovineHD genotypes (631,428 SNPs) of each bull were used to impute whole-genome sequence genotypes (12,590,056 SNPs) using the Beagle software. Imputation was done using a multi-breed reference panel of 429 sequenced individuals. Genomic estimated breeding values for three traits were predicted using a Bayesian stochastic search variable selection (BSSVS) model and a genome-enabled best linear unbiased prediction model (GBLUP). Reliabilities of predictions were based on 2087 validation bulls, while the other 3416 bulls were used for training. RESULTS: Prediction reliabilities ranged from 0.37 to 0.52. BSSVS performed better than GBLUP in all cases. Reliabilities of genomic predictions were slightly lower with imputed sequence data than with BovineHD chip data. Also, the reliabilities tended to be lower for both sequence data and BovineHD chip data when relationships between training animals were low. No increase in persistency of prediction reliability using imputed sequence data was observed. CONCLUSIONS: Compared to BovineHD genotype data, using imputed sequence data for genomic prediction produced no advantage. To investigate the putative advantage of genomic prediction using (imputed) sequence data, a training set with a larger number of individuals that are distantly related to each other and genomic prediction models that incorporate biological information on the SNPs or that apply stricter SNP pre-selection should be considered. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-015-0149-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-17 /pmc/articles/PMC4574568/ /pubmed/26381777 http://dx.doi.org/10.1186/s12711-015-0149-x Text en © van Binsbergen et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article van Binsbergen, Rianne Calus, Mario P. L. Bink, Marco C. A. M. van Eeuwijk, Fred A. Schrooten, Chris Veerkamp, Roel F. Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle |
title | Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle |
title_full | Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle |
title_fullStr | Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle |
title_full_unstemmed | Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle |
title_short | Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle |
title_sort | genomic prediction using imputed whole-genome sequence data in holstein friesian cattle |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4574568/ https://www.ncbi.nlm.nih.gov/pubmed/26381777 http://dx.doi.org/10.1186/s12711-015-0149-x |
work_keys_str_mv | AT vanbinsbergenrianne genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle AT calusmariopl genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle AT binkmarcocam genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle AT vaneeuwijkfreda genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle AT schrootenchris genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle AT veerkamproelf genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle |