Cargando…

Prioritizing animals for dense genotyping in order to impute missing genotypes of sparsely genotyped animals

BACKGROUND: Genotyping accounts for a substantial part of the cost of genomic selection (GS). Using both dense and sparse SNP chips, together with imputation of missing genotypes, can reduce these costs. The aim of this study was to identify the set of candidates that are most important for dense ge...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yu, Xijiang, Woolliams, John A, Meuwissen, Theo HE
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2014
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4283150/ https://www.ncbi.nlm.nih.gov/pubmed/25158690 http://dx.doi.org/10.1186/1297-9686-46-46

_version_	1782351225989103616
author	Yu, Xijiang Woolliams, John A Meuwissen, Theo HE
author_facet	Yu, Xijiang Woolliams, John A Meuwissen, Theo HE
author_sort	Yu, Xijiang
collection	PubMed
description	BACKGROUND: Genotyping accounts for a substantial part of the cost of genomic selection (GS). Using both dense and sparse SNP chips, together with imputation of missing genotypes, can reduce these costs. The aim of this study was to identify the set of candidates that are most important for dense genotyping, when they are used to impute the genotypes of sparsely genotyped animals. In a real pig pedigree, the 2500 most recently born pigs of the last generation, i.e. the target animals, were used for sparse genotyping. Their missing genotypes were imputed using either Beagle or LDMIP from T densely genotyped candidates chosen from the whole pedigree. A new optimization method was derived to identify the best animals for dense genotyping, which minimized the conditional genetic variance of the target animals, using either the pedigree-based relationship matrix (MCA), or a genotypic relationship matrix based on sparse marker genotypes (MCG). These, and five other methods for selecting the T animals were compared, using T = 100 or 200 animals, SNP genotypes were obtained assuming Ne =100 or 200, and MAF thresholds set to D = 0.01, 0.05 or 0.10. The performances of the methods were compared using the following criteria: call rate of true genotypes, accuracy of genotype prediction, and accuracy of genomic evaluations using the imputed genotypes. RESULTS: For all criteria, MCA and MCG performed better than other selection methods, significantly so for all methods other than selection of sires with the largest numbers of offspring. Methods that choose animals that have the closest average relationship or contribution to the target population gave the lowest accuracy of imputation, in some cases worse than random selection, and should be avoided in practice. CONCLUSION: Minimization of the conditional variance of the genotypes in target animals provided an effective optimization procedure for prioritizing animals for genotyping or sequencing. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1297-9686-46-46) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-4283150
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-42831502015-01-06 Prioritizing animals for dense genotyping in order to impute missing genotypes of sparsely genotyped animals Yu, Xijiang Woolliams, John A Meuwissen, Theo HE Genet Sel Evol Research BACKGROUND: Genotyping accounts for a substantial part of the cost of genomic selection (GS). Using both dense and sparse SNP chips, together with imputation of missing genotypes, can reduce these costs. The aim of this study was to identify the set of candidates that are most important for dense genotyping, when they are used to impute the genotypes of sparsely genotyped animals. In a real pig pedigree, the 2500 most recently born pigs of the last generation, i.e. the target animals, were used for sparse genotyping. Their missing genotypes were imputed using either Beagle or LDMIP from T densely genotyped candidates chosen from the whole pedigree. A new optimization method was derived to identify the best animals for dense genotyping, which minimized the conditional genetic variance of the target animals, using either the pedigree-based relationship matrix (MCA), or a genotypic relationship matrix based on sparse marker genotypes (MCG). These, and five other methods for selecting the T animals were compared, using T = 100 or 200 animals, SNP genotypes were obtained assuming Ne =100 or 200, and MAF thresholds set to D = 0.01, 0.05 or 0.10. The performances of the methods were compared using the following criteria: call rate of true genotypes, accuracy of genotype prediction, and accuracy of genomic evaluations using the imputed genotypes. RESULTS: For all criteria, MCA and MCG performed better than other selection methods, significantly so for all methods other than selection of sires with the largest numbers of offspring. Methods that choose animals that have the closest average relationship or contribution to the target population gave the lowest accuracy of imputation, in some cases worse than random selection, and should be avoided in practice. CONCLUSION: Minimization of the conditional variance of the genotypes in target animals provided an effective optimization procedure for prioritizing animals for genotyping or sequencing. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1297-9686-46-46) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-26 /pmc/articles/PMC4283150/ /pubmed/25158690 http://dx.doi.org/10.1186/1297-9686-46-46 Text en © Yu et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Yu, Xijiang Woolliams, John A Meuwissen, Theo HE Prioritizing animals for dense genotyping in order to impute missing genotypes of sparsely genotyped animals
title	Prioritizing animals for dense genotyping in order to impute missing genotypes of sparsely genotyped animals
title_full	Prioritizing animals for dense genotyping in order to impute missing genotypes of sparsely genotyped animals
title_fullStr	Prioritizing animals for dense genotyping in order to impute missing genotypes of sparsely genotyped animals
title_full_unstemmed	Prioritizing animals for dense genotyping in order to impute missing genotypes of sparsely genotyped animals
title_short	Prioritizing animals for dense genotyping in order to impute missing genotypes of sparsely genotyped animals
title_sort	prioritizing animals for dense genotyping in order to impute missing genotypes of sparsely genotyped animals
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4283150/ https://www.ncbi.nlm.nih.gov/pubmed/25158690 http://dx.doi.org/10.1186/1297-9686-46-46
work_keys_str_mv	AT yuxijiang prioritizinganimalsfordensegenotypinginordertoimputemissinggenotypesofsparselygenotypedanimals AT woolliamsjohna prioritizinganimalsfordensegenotypinginordertoimputemissinggenotypesofsparselygenotypedanimals AT meuwissentheohe prioritizinganimalsfordensegenotypinginordertoimputemissinggenotypesofsparselygenotypedanimals

Prioritizing animals for dense genotyping in order to impute missing genotypes of sparsely genotyped animals

Ejemplares similares