Cargando…

Imputation of sequence level genotypes in the Franches-Montagnes horse breed

BACKGROUND: A cost-effective strategy to increase the density of available markers within a population is to sequence a small proportion of the population and impute whole-genome sequence data for the remaining population. Increased densities of typed markers are advantageous for genome-wide associa...

Descripción completa

Detalles Bibliográficos
Autores principales: Frischknecht, Mirjam, Neuditschko, Markus, Jagannathan, Vidhya, Drögemüller, Cord, Tetens, Jens, Thaller, Georg, Leeb, Tosso, Rieder, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4180851/
https://www.ncbi.nlm.nih.gov/pubmed/25927638
http://dx.doi.org/10.1186/s12711-014-0063-7
_version_ 1782337271670767616
author Frischknecht, Mirjam
Neuditschko, Markus
Jagannathan, Vidhya
Drögemüller, Cord
Tetens, Jens
Thaller, Georg
Leeb, Tosso
Rieder, Stefan
author_facet Frischknecht, Mirjam
Neuditschko, Markus
Jagannathan, Vidhya
Drögemüller, Cord
Tetens, Jens
Thaller, Georg
Leeb, Tosso
Rieder, Stefan
author_sort Frischknecht, Mirjam
collection PubMed
description BACKGROUND: A cost-effective strategy to increase the density of available markers within a population is to sequence a small proportion of the population and impute whole-genome sequence data for the remaining population. Increased densities of typed markers are advantageous for genome-wide association studies (GWAS) and genomic predictions. METHODS: We obtained genotypes for 54 602 SNPs (single nucleotide polymorphisms) in 1077 Franches-Montagnes (FM) horses and Illumina paired-end whole-genome sequencing data for 30 FM horses and 14 Warmblood horses. After variant calling, the sequence-derived SNP genotypes (~13 million SNPs) were used for genotype imputation with the software programs Beagle, Impute2 and FImpute. RESULTS: The mean imputation accuracy of FM horses using Impute2 was 92.0%. Imputation accuracy using Beagle and FImpute was 74.3% and 77.2%, respectively. In addition, for Impute2 we determined the imputation accuracy of all individual horses in the validation population, which ranged from 85.7% to 99.8%. The subsequent inclusion of Warmblood sequence data further increased the correlation between true and imputed genotypes for most horses, especially for horses with a high level of admixture. The final imputation accuracy of the horses ranged from 91.2% to 99.5%. CONCLUSIONS: Using Impute2, the imputation accuracy was higher than 91% for all horses in the validation population, which indicates that direct imputation of 50k SNP-chip data to sequence level genotypes is feasible in the FM population. The individual imputation accuracy depended mainly on the applied software and the level of admixture. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0063-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4180851
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41808512014-10-03 Imputation of sequence level genotypes in the Franches-Montagnes horse breed Frischknecht, Mirjam Neuditschko, Markus Jagannathan, Vidhya Drögemüller, Cord Tetens, Jens Thaller, Georg Leeb, Tosso Rieder, Stefan Genet Sel Evol Research BACKGROUND: A cost-effective strategy to increase the density of available markers within a population is to sequence a small proportion of the population and impute whole-genome sequence data for the remaining population. Increased densities of typed markers are advantageous for genome-wide association studies (GWAS) and genomic predictions. METHODS: We obtained genotypes for 54 602 SNPs (single nucleotide polymorphisms) in 1077 Franches-Montagnes (FM) horses and Illumina paired-end whole-genome sequencing data for 30 FM horses and 14 Warmblood horses. After variant calling, the sequence-derived SNP genotypes (~13 million SNPs) were used for genotype imputation with the software programs Beagle, Impute2 and FImpute. RESULTS: The mean imputation accuracy of FM horses using Impute2 was 92.0%. Imputation accuracy using Beagle and FImpute was 74.3% and 77.2%, respectively. In addition, for Impute2 we determined the imputation accuracy of all individual horses in the validation population, which ranged from 85.7% to 99.8%. The subsequent inclusion of Warmblood sequence data further increased the correlation between true and imputed genotypes for most horses, especially for horses with a high level of admixture. The final imputation accuracy of the horses ranged from 91.2% to 99.5%. CONCLUSIONS: Using Impute2, the imputation accuracy was higher than 91% for all horses in the validation population, which indicates that direct imputation of 50k SNP-chip data to sequence level genotypes is feasible in the FM population. The individual imputation accuracy depended mainly on the applied software and the level of admixture. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-014-0063-7) contains supplementary material, which is available to authorized users. BioMed Central 2014-10-01 /pmc/articles/PMC4180851/ /pubmed/25927638 http://dx.doi.org/10.1186/s12711-014-0063-7 Text en © Frischknecht et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Frischknecht, Mirjam
Neuditschko, Markus
Jagannathan, Vidhya
Drögemüller, Cord
Tetens, Jens
Thaller, Georg
Leeb, Tosso
Rieder, Stefan
Imputation of sequence level genotypes in the Franches-Montagnes horse breed
title Imputation of sequence level genotypes in the Franches-Montagnes horse breed
title_full Imputation of sequence level genotypes in the Franches-Montagnes horse breed
title_fullStr Imputation of sequence level genotypes in the Franches-Montagnes horse breed
title_full_unstemmed Imputation of sequence level genotypes in the Franches-Montagnes horse breed
title_short Imputation of sequence level genotypes in the Franches-Montagnes horse breed
title_sort imputation of sequence level genotypes in the franches-montagnes horse breed
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4180851/
https://www.ncbi.nlm.nih.gov/pubmed/25927638
http://dx.doi.org/10.1186/s12711-014-0063-7
work_keys_str_mv AT frischknechtmirjam imputationofsequencelevelgenotypesinthefranchesmontagneshorsebreed
AT neuditschkomarkus imputationofsequencelevelgenotypesinthefranchesmontagneshorsebreed
AT jagannathanvidhya imputationofsequencelevelgenotypesinthefranchesmontagneshorsebreed
AT drogemullercord imputationofsequencelevelgenotypesinthefranchesmontagneshorsebreed
AT tetensjens imputationofsequencelevelgenotypesinthefranchesmontagneshorsebreed
AT thallergeorg imputationofsequencelevelgenotypesinthefranchesmontagneshorsebreed
AT leebtosso imputationofsequencelevelgenotypesinthefranchesmontagneshorsebreed
AT riederstefan imputationofsequencelevelgenotypesinthefranchesmontagneshorsebreed