Cargando…

Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds

BACKGROUND: SNPs are informative to estimate genomic breed composition (GBC) of individual animals, but selected SNPs for this purpose were not made available in the commercial bovine SNP chips prior to the present study. The primary objective of the present study was to select five common SNP panel...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Jun, Guo, Yage, Xu, Jiaqi, Li, Hao, Fuller, Anna, Tait, Richard G., Wu, Xiao-Lin, Bauck, Stewart
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6085684/
https://www.ncbi.nlm.nih.gov/pubmed/30092776
http://dx.doi.org/10.1186/s12863-018-0654-3
_version_ 1783346383717138432
author He, Jun
Guo, Yage
Xu, Jiaqi
Li, Hao
Fuller, Anna
Tait, Richard G.
Wu, Xiao-Lin
Bauck, Stewart
author_facet He, Jun
Guo, Yage
Xu, Jiaqi
Li, Hao
Fuller, Anna
Tait, Richard G.
Wu, Xiao-Lin
Bauck, Stewart
author_sort He, Jun
collection PubMed
description BACKGROUND: SNPs are informative to estimate genomic breed composition (GBC) of individual animals, but selected SNPs for this purpose were not made available in the commercial bovine SNP chips prior to the present study. The primary objective of the present study was to select five common SNP panels for estimating GBC of individual animals initially involving 10 cattle breeds (two dairy breeds and eight beef breeds). The performance of the five common SNP panels was evaluated based on admixture model and linear regression model, respectively. Finally, the downstream implication of GBC on genomic prediction accuracies was investigated and discussed in a Santa Gertrudis cattle population. RESULTS: There were 15,708 common SNPs across five currently-available commercial bovine SNP chips. From this set, four subsets (1,000, 3,000, 5,000, and 10,000 SNPs) were selected by maximizing average Euclidean distance (AED) of SNP allelic frequencies among the ten cattle breeds. For 198 animals presented as Akaushi, estimated GBC of the Akaushi breed (GBCA) based on the admixture model agreed very well among the five SNP panels, identifying 166 animals with GBCA = 1. Using the same SNP panels, the linear regression approach reported fewer animals with GBCA = 1. Nevertheless, estimated GBCA using both models were highly correlated (r = 0.953 to 0.992). In the genomic prediction of a Santa Gertrudis population (and crosses), the results showed that the predictability of molecular breeding values using SNP effects obtained from 1,225 animals with no less than 0.90 GBC of Santa Gertrudis (GBCSG) decreased on crossbred animals with lower GBCSG. CONCLUSIONS: Of the two statistical models used to compute GBC, the admixture model gave more consistent results among the five selected SNP panels than the linear regression model. The availability of these common SNP panels facilitates identification and estimation of breed compositions using currently-available bovine SNP chips. In view of utility, the 1 K panel is the most cost effective and it is convenient to be included as add-on content in future development of bovine SNP chips, whereas the 10 K and 16 K SNP panels can be more resourceful if used independently for imputation to intermediate or high-density genotypes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12863-018-0654-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6085684
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60856842018-08-16 Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds He, Jun Guo, Yage Xu, Jiaqi Li, Hao Fuller, Anna Tait, Richard G. Wu, Xiao-Lin Bauck, Stewart BMC Genet Research Article BACKGROUND: SNPs are informative to estimate genomic breed composition (GBC) of individual animals, but selected SNPs for this purpose were not made available in the commercial bovine SNP chips prior to the present study. The primary objective of the present study was to select five common SNP panels for estimating GBC of individual animals initially involving 10 cattle breeds (two dairy breeds and eight beef breeds). The performance of the five common SNP panels was evaluated based on admixture model and linear regression model, respectively. Finally, the downstream implication of GBC on genomic prediction accuracies was investigated and discussed in a Santa Gertrudis cattle population. RESULTS: There were 15,708 common SNPs across five currently-available commercial bovine SNP chips. From this set, four subsets (1,000, 3,000, 5,000, and 10,000 SNPs) were selected by maximizing average Euclidean distance (AED) of SNP allelic frequencies among the ten cattle breeds. For 198 animals presented as Akaushi, estimated GBC of the Akaushi breed (GBCA) based on the admixture model agreed very well among the five SNP panels, identifying 166 animals with GBCA = 1. Using the same SNP panels, the linear regression approach reported fewer animals with GBCA = 1. Nevertheless, estimated GBCA using both models were highly correlated (r = 0.953 to 0.992). In the genomic prediction of a Santa Gertrudis population (and crosses), the results showed that the predictability of molecular breeding values using SNP effects obtained from 1,225 animals with no less than 0.90 GBC of Santa Gertrudis (GBCSG) decreased on crossbred animals with lower GBCSG. CONCLUSIONS: Of the two statistical models used to compute GBC, the admixture model gave more consistent results among the five selected SNP panels than the linear regression model. The availability of these common SNP panels facilitates identification and estimation of breed compositions using currently-available bovine SNP chips. In view of utility, the 1 K panel is the most cost effective and it is convenient to be included as add-on content in future development of bovine SNP chips, whereas the 10 K and 16 K SNP panels can be more resourceful if used independently for imputation to intermediate or high-density genotypes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12863-018-0654-3) contains supplementary material, which is available to authorized users. BioMed Central 2018-08-09 /pmc/articles/PMC6085684/ /pubmed/30092776 http://dx.doi.org/10.1186/s12863-018-0654-3 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
He, Jun
Guo, Yage
Xu, Jiaqi
Li, Hao
Fuller, Anna
Tait, Richard G.
Wu, Xiao-Lin
Bauck, Stewart
Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds
title Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds
title_full Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds
title_fullStr Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds
title_full_unstemmed Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds
title_short Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds
title_sort comparing snp panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6085684/
https://www.ncbi.nlm.nih.gov/pubmed/30092776
http://dx.doi.org/10.1186/s12863-018-0654-3
work_keys_str_mv AT hejun comparingsnppanelsandstatisticalmethodsforestimatinggenomicbreedcompositionofindividualanimalsintencattlebreeds
AT guoyage comparingsnppanelsandstatisticalmethodsforestimatinggenomicbreedcompositionofindividualanimalsintencattlebreeds
AT xujiaqi comparingsnppanelsandstatisticalmethodsforestimatinggenomicbreedcompositionofindividualanimalsintencattlebreeds
AT lihao comparingsnppanelsandstatisticalmethodsforestimatinggenomicbreedcompositionofindividualanimalsintencattlebreeds
AT fulleranna comparingsnppanelsandstatisticalmethodsforestimatinggenomicbreedcompositionofindividualanimalsintencattlebreeds
AT taitrichardg comparingsnppanelsandstatisticalmethodsforestimatinggenomicbreedcompositionofindividualanimalsintencattlebreeds
AT wuxiaolin comparingsnppanelsandstatisticalmethodsforestimatinggenomicbreedcompositionofindividualanimalsintencattlebreeds
AT bauckstewart comparingsnppanelsandstatisticalmethodsforestimatinggenomicbreedcompositionofindividualanimalsintencattlebreeds