Cargando…

Imputation of high-density genotypes in the Fleckvieh cattle population

BACKGROUND: Currently, genome-wide evaluation of cattle populations is based on SNP-genotyping using ~ 54 000 SNP. Increasing the number of markers might improve genomic predictions and power of genome-wide association studies. Imputation of genotypes makes it possible to extrapolate genotypes from...

Descripción completa

Detalles Bibliográficos
Autores principales: Pausch, Hubert, Aigner, Bernhard, Emmerling, Reiner, Edel, Christian, Götz, Kay-Uwe, Fries, Ruedi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3598996/
https://www.ncbi.nlm.nih.gov/pubmed/23406470
http://dx.doi.org/10.1186/1297-9686-45-3
_version_ 1782262867705200640
author Pausch, Hubert
Aigner, Bernhard
Emmerling, Reiner
Edel, Christian
Götz, Kay-Uwe
Fries, Ruedi
author_facet Pausch, Hubert
Aigner, Bernhard
Emmerling, Reiner
Edel, Christian
Götz, Kay-Uwe
Fries, Ruedi
author_sort Pausch, Hubert
collection PubMed
description BACKGROUND: Currently, genome-wide evaluation of cattle populations is based on SNP-genotyping using ~ 54 000 SNP. Increasing the number of markers might improve genomic predictions and power of genome-wide association studies. Imputation of genotypes makes it possible to extrapolate genotypes from lower to higher density arrays based on a representative reference sample for which genotypes are obtained at higher density. METHODS: Genotypes using 639 214 SNP were available for 797 bulls of the Fleckvieh cattle breed. The data set was divided into a reference and a validation population. Genotypes for all SNP except those included in the BovineSNP50 Bead chip were masked and subsequently imputed for animals of the validation population. Imputation of genotypes was performed with Beagle, findhap.f90, MaCH and Minimac. The accuracy of the imputed genotypes was assessed for four different scenarios including 50, 100, 200 and 400 animals as reference population. The reference animals were selected to account for 78.03%, 89.21%, 97.47% and > 99% of the gene pool of the genotyped population, respectively. RESULTS: Imputation accuracy increased as the number of animals and relatives in the reference population increased. Population-based algorithms provided highly reliable imputation of genotypes, even for scenarios with 50 and 100 reference animals only. Using MaCH and Minimac, the correlation between true and imputed genotypes was > 0.975 with 100 reference animals only. Pre-phasing the genotypes of both the reference and validation populations not only provided highly accurate imputed genotypes but was also computationally efficient. Genome-wide analysis of imputation accuracy led to the identification of many misplaced SNP. CONCLUSIONS: Genotyping key animals at high density and subsequent population-based genotype imputation yield high imputation accuracy. Pre-phasing the genotypes of the reference and validation populations is computationally efficient and results in high imputation accuracy, even when the reference population is small.
format Online
Article
Text
id pubmed-3598996
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35989962013-03-29 Imputation of high-density genotypes in the Fleckvieh cattle population Pausch, Hubert Aigner, Bernhard Emmerling, Reiner Edel, Christian Götz, Kay-Uwe Fries, Ruedi Genet Sel Evol Research BACKGROUND: Currently, genome-wide evaluation of cattle populations is based on SNP-genotyping using ~ 54 000 SNP. Increasing the number of markers might improve genomic predictions and power of genome-wide association studies. Imputation of genotypes makes it possible to extrapolate genotypes from lower to higher density arrays based on a representative reference sample for which genotypes are obtained at higher density. METHODS: Genotypes using 639 214 SNP were available for 797 bulls of the Fleckvieh cattle breed. The data set was divided into a reference and a validation population. Genotypes for all SNP except those included in the BovineSNP50 Bead chip were masked and subsequently imputed for animals of the validation population. Imputation of genotypes was performed with Beagle, findhap.f90, MaCH and Minimac. The accuracy of the imputed genotypes was assessed for four different scenarios including 50, 100, 200 and 400 animals as reference population. The reference animals were selected to account for 78.03%, 89.21%, 97.47% and > 99% of the gene pool of the genotyped population, respectively. RESULTS: Imputation accuracy increased as the number of animals and relatives in the reference population increased. Population-based algorithms provided highly reliable imputation of genotypes, even for scenarios with 50 and 100 reference animals only. Using MaCH and Minimac, the correlation between true and imputed genotypes was > 0.975 with 100 reference animals only. Pre-phasing the genotypes of both the reference and validation populations not only provided highly accurate imputed genotypes but was also computationally efficient. Genome-wide analysis of imputation accuracy led to the identification of many misplaced SNP. CONCLUSIONS: Genotyping key animals at high density and subsequent population-based genotype imputation yield high imputation accuracy. Pre-phasing the genotypes of the reference and validation populations is computationally efficient and results in high imputation accuracy, even when the reference population is small. BioMed Central 2013-02-13 /pmc/articles/PMC3598996/ /pubmed/23406470 http://dx.doi.org/10.1186/1297-9686-45-3 Text en Copyright ©2013 Pausch et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Pausch, Hubert
Aigner, Bernhard
Emmerling, Reiner
Edel, Christian
Götz, Kay-Uwe
Fries, Ruedi
Imputation of high-density genotypes in the Fleckvieh cattle population
title Imputation of high-density genotypes in the Fleckvieh cattle population
title_full Imputation of high-density genotypes in the Fleckvieh cattle population
title_fullStr Imputation of high-density genotypes in the Fleckvieh cattle population
title_full_unstemmed Imputation of high-density genotypes in the Fleckvieh cattle population
title_short Imputation of high-density genotypes in the Fleckvieh cattle population
title_sort imputation of high-density genotypes in the fleckvieh cattle population
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3598996/
https://www.ncbi.nlm.nih.gov/pubmed/23406470
http://dx.doi.org/10.1186/1297-9686-45-3
work_keys_str_mv AT pauschhubert imputationofhighdensitygenotypesinthefleckviehcattlepopulation
AT aignerbernhard imputationofhighdensitygenotypesinthefleckviehcattlepopulation
AT emmerlingreiner imputationofhighdensitygenotypesinthefleckviehcattlepopulation
AT edelchristian imputationofhighdensitygenotypesinthefleckviehcattlepopulation
AT gotzkayuwe imputationofhighdensitygenotypesinthefleckviehcattlepopulation
AT friesruedi imputationofhighdensitygenotypesinthefleckviehcattlepopulation