Cargando…

Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle

Imputation of high-density genotypes from low- or medium-density platforms is a promising way to enhance the efficiency of whole-genome selection programs at low cost. In this study, we compared the efficiency of three widely used imputation algorithms (fastPHASE, BEAGLE and findhap) using Chinese H...

Descripción completa

Detalles Bibliográficos
Autores principales: Weng, Z., Zhang, Z., Zhang, Q., Fu, W., He, S., Ding, X.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cambridge University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608330/
https://www.ncbi.nlm.nih.gov/pubmed/23228675
http://dx.doi.org/10.1017/S1751731112002224
_version_ 1782264224998752256
author Weng, Z.
Zhang, Z.
Zhang, Q.
Fu, W.
He, S.
Ding, X.
author_facet Weng, Z.
Zhang, Z.
Zhang, Q.
Fu, W.
He, S.
Ding, X.
author_sort Weng, Z.
collection PubMed
description Imputation of high-density genotypes from low- or medium-density platforms is a promising way to enhance the efficiency of whole-genome selection programs at low cost. In this study, we compared the efficiency of three widely used imputation algorithms (fastPHASE, BEAGLE and findhap) using Chinese Holstein cattle with Illumina BovineSNP50 genotypes. A total of 2108 cattle were randomly divided into a reference population and a test population to evaluate the influence of the reference population size. Three bovine chromosomes, BTA1, 16 and 28, were used to represent large, medium and small chromosome size, respectively. We simulated different scenarios by randomly masking 20%, 40%, 80% and 95% single-nucleotide polymorphisms (SNPs) on each chromosome in the test population to mimic different SNP density panels. Illumina Bovine3K and Illumina BovineLD (6909 SNPs) information was also used. We found that the three methods showed comparable accuracy when the proportion of masked SNPs was low. However, the difference became larger when more SNPs were masked. BEAGLE performed the best and was most robust with imputation accuracies >90% in almost all situations. fastPHASE was affected by the proportion of masked SNPs, especially when the masked SNP rate was high. findhap ran the fastest, whereas its accuracies were lower than those of BEAGLE but higher than those of fastPHASE. In addition, enlarging the reference population improved the imputation accuracy for BEAGLE and findhap, but did not affect fastPHASE. Considering imputation accuracy and computational requirements, BEAGLE has been found to be more reliable for imputing genotypes from low- to high-density genotyping platforms.
format Online
Article
Text
id pubmed-3608330
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Cambridge University Press
record_format MEDLINE/PubMed
spelling pubmed-36083302013-03-26 Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle Weng, Z. Zhang, Z. Zhang, Q. Fu, W. He, S. Ding, X. Animal Breeding and Genetics Imputation of high-density genotypes from low- or medium-density platforms is a promising way to enhance the efficiency of whole-genome selection programs at low cost. In this study, we compared the efficiency of three widely used imputation algorithms (fastPHASE, BEAGLE and findhap) using Chinese Holstein cattle with Illumina BovineSNP50 genotypes. A total of 2108 cattle were randomly divided into a reference population and a test population to evaluate the influence of the reference population size. Three bovine chromosomes, BTA1, 16 and 28, were used to represent large, medium and small chromosome size, respectively. We simulated different scenarios by randomly masking 20%, 40%, 80% and 95% single-nucleotide polymorphisms (SNPs) on each chromosome in the test population to mimic different SNP density panels. Illumina Bovine3K and Illumina BovineLD (6909 SNPs) information was also used. We found that the three methods showed comparable accuracy when the proportion of masked SNPs was low. However, the difference became larger when more SNPs were masked. BEAGLE performed the best and was most robust with imputation accuracies >90% in almost all situations. fastPHASE was affected by the proportion of masked SNPs, especially when the masked SNP rate was high. findhap ran the fastest, whereas its accuracies were lower than those of BEAGLE but higher than those of fastPHASE. In addition, enlarging the reference population improved the imputation accuracy for BEAGLE and findhap, but did not affect fastPHASE. Considering imputation accuracy and computational requirements, BEAGLE has been found to be more reliable for imputing genotypes from low- to high-density genotyping platforms. Cambridge University Press 2012-12-11 2013-05 /pmc/articles/PMC3608330/ /pubmed/23228675 http://dx.doi.org/10.1017/S1751731112002224 Text en © The Animal Consortium 2012 The online version of this article is published within an Open Access environment subject to the conditions of the Creative Commons Attribution-NonCommercial-ShareAlike licence <http://creativecommons.org/licenses/by-nc-sa/2.5/>. The written permission of Cambridge University Press must be obtained for commercial re-use.
spellingShingle Breeding and Genetics
Weng, Z.
Zhang, Z.
Zhang, Q.
Fu, W.
He, S.
Ding, X.
Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle
title Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle
title_full Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle
title_fullStr Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle
title_full_unstemmed Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle
title_short Comparison of different imputation methods from low- to high-density panels using Chinese Holstein cattle
title_sort comparison of different imputation methods from low- to high-density panels using chinese holstein cattle
topic Breeding and Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608330/
https://www.ncbi.nlm.nih.gov/pubmed/23228675
http://dx.doi.org/10.1017/S1751731112002224
work_keys_str_mv AT wengz comparisonofdifferentimputationmethodsfromlowtohighdensitypanelsusingchineseholsteincattle
AT zhangz comparisonofdifferentimputationmethodsfromlowtohighdensitypanelsusingchineseholsteincattle
AT zhangq comparisonofdifferentimputationmethodsfromlowtohighdensitypanelsusingchineseholsteincattle
AT fuw comparisonofdifferentimputationmethodsfromlowtohighdensitypanelsusingchineseholsteincattle
AT hes comparisonofdifferentimputationmethodsfromlowtohighdensitypanelsusingchineseholsteincattle
AT dingx comparisonofdifferentimputationmethodsfromlowtohighdensitypanelsusingchineseholsteincattle