Cargando…

Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle

BACKGROUND: In contrast to currently used single nucleotide polymorphism (SNP) panels, the use of whole-genome sequence data is expected to enable the direct estimation of the effects of causal mutations on a given trait. This could lead to higher reliabilities of genomic predictions compared to tho...

Descripción completa

Detalles Bibliográficos
Autores principales: van Binsbergen, Rianne, Calus, Mario P. L., Bink, Marco C. A. M., van Eeuwijk, Fred A., Schrooten, Chris, Veerkamp, Roel F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4574568/
https://www.ncbi.nlm.nih.gov/pubmed/26381777
http://dx.doi.org/10.1186/s12711-015-0149-x
_version_ 1782390650323337216
author van Binsbergen, Rianne
Calus, Mario P. L.
Bink, Marco C. A. M.
van Eeuwijk, Fred A.
Schrooten, Chris
Veerkamp, Roel F.
author_facet van Binsbergen, Rianne
Calus, Mario P. L.
Bink, Marco C. A. M.
van Eeuwijk, Fred A.
Schrooten, Chris
Veerkamp, Roel F.
author_sort van Binsbergen, Rianne
collection PubMed
description BACKGROUND: In contrast to currently used single nucleotide polymorphism (SNP) panels, the use of whole-genome sequence data is expected to enable the direct estimation of the effects of causal mutations on a given trait. This could lead to higher reliabilities of genomic predictions compared to those based on SNP genotypes. Also, at each generation of selection, recombination events between a SNP and a mutation can cause decay in reliability of genomic predictions based on markers rather than on the causal variants. Our objective was to investigate the use of imputed whole-genome sequence genotypes versus high-density SNP genotypes on (the persistency of) the reliability of genomic predictions using real cattle data. METHODS: Highly accurate phenotypes based on daughter performance and Illumina BovineHD Beadchip genotypes were available for 5503 Holstein Friesian bulls. The BovineHD genotypes (631,428 SNPs) of each bull were used to impute whole-genome sequence genotypes (12,590,056 SNPs) using the Beagle software. Imputation was done using a multi-breed reference panel of 429 sequenced individuals. Genomic estimated breeding values for three traits were predicted using a Bayesian stochastic search variable selection (BSSVS) model and a genome-enabled best linear unbiased prediction model (GBLUP). Reliabilities of predictions were based on 2087 validation bulls, while the other 3416 bulls were used for training. RESULTS: Prediction reliabilities ranged from 0.37 to 0.52. BSSVS performed better than GBLUP in all cases. Reliabilities of genomic predictions were slightly lower with imputed sequence data than with BovineHD chip data. Also, the reliabilities tended to be lower for both sequence data and BovineHD chip data when relationships between training animals were low. No increase in persistency of prediction reliability using imputed sequence data was observed. CONCLUSIONS: Compared to BovineHD genotype data, using imputed sequence data for genomic prediction produced no advantage. To investigate the putative advantage of genomic prediction using (imputed) sequence data, a training set with a larger number of individuals that are distantly related to each other and genomic prediction models that incorporate biological information on the SNPs or that apply stricter SNP pre-selection should be considered. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-015-0149-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4574568
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45745682015-09-19 Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle van Binsbergen, Rianne Calus, Mario P. L. Bink, Marco C. A. M. van Eeuwijk, Fred A. Schrooten, Chris Veerkamp, Roel F. Genet Sel Evol Research Article BACKGROUND: In contrast to currently used single nucleotide polymorphism (SNP) panels, the use of whole-genome sequence data is expected to enable the direct estimation of the effects of causal mutations on a given trait. This could lead to higher reliabilities of genomic predictions compared to those based on SNP genotypes. Also, at each generation of selection, recombination events between a SNP and a mutation can cause decay in reliability of genomic predictions based on markers rather than on the causal variants. Our objective was to investigate the use of imputed whole-genome sequence genotypes versus high-density SNP genotypes on (the persistency of) the reliability of genomic predictions using real cattle data. METHODS: Highly accurate phenotypes based on daughter performance and Illumina BovineHD Beadchip genotypes were available for 5503 Holstein Friesian bulls. The BovineHD genotypes (631,428 SNPs) of each bull were used to impute whole-genome sequence genotypes (12,590,056 SNPs) using the Beagle software. Imputation was done using a multi-breed reference panel of 429 sequenced individuals. Genomic estimated breeding values for three traits were predicted using a Bayesian stochastic search variable selection (BSSVS) model and a genome-enabled best linear unbiased prediction model (GBLUP). Reliabilities of predictions were based on 2087 validation bulls, while the other 3416 bulls were used for training. RESULTS: Prediction reliabilities ranged from 0.37 to 0.52. BSSVS performed better than GBLUP in all cases. Reliabilities of genomic predictions were slightly lower with imputed sequence data than with BovineHD chip data. Also, the reliabilities tended to be lower for both sequence data and BovineHD chip data when relationships between training animals were low. No increase in persistency of prediction reliability using imputed sequence data was observed. CONCLUSIONS: Compared to BovineHD genotype data, using imputed sequence data for genomic prediction produced no advantage. To investigate the putative advantage of genomic prediction using (imputed) sequence data, a training set with a larger number of individuals that are distantly related to each other and genomic prediction models that incorporate biological information on the SNPs or that apply stricter SNP pre-selection should be considered. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-015-0149-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-17 /pmc/articles/PMC4574568/ /pubmed/26381777 http://dx.doi.org/10.1186/s12711-015-0149-x Text en © van Binsbergen et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
van Binsbergen, Rianne
Calus, Mario P. L.
Bink, Marco C. A. M.
van Eeuwijk, Fred A.
Schrooten, Chris
Veerkamp, Roel F.
Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle
title Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle
title_full Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle
title_fullStr Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle
title_full_unstemmed Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle
title_short Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle
title_sort genomic prediction using imputed whole-genome sequence data in holstein friesian cattle
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4574568/
https://www.ncbi.nlm.nih.gov/pubmed/26381777
http://dx.doi.org/10.1186/s12711-015-0149-x
work_keys_str_mv AT vanbinsbergenrianne genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle
AT calusmariopl genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle
AT binkmarcocam genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle
AT vaneeuwijkfreda genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle
AT schrootenchris genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle
AT veerkamproelf genomicpredictionusingimputedwholegenomesequencedatainholsteinfriesiancattle