Cargando…

Small SNP panels for breed proportion estimation in Indian crossbred dairy cattle

Reliably identifying breed proportions in crossbred cattle in smallholder farms is a crucial step to improve mating decisions and optimizing management in these systems. High‐density genotype information is able to estimate higher‐order breed proportions accurately, but, are too expensive for mass a...

Descripción completa

Detalles Bibliográficos
Autores principales: Strucken, Eva M., Swaminathan, Marimuthu, Gibson, John P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8519156/
https://www.ncbi.nlm.nih.gov/pubmed/33687116
http://dx.doi.org/10.1111/jbg.12544
Descripción
Sumario:Reliably identifying breed proportions in crossbred cattle in smallholder farms is a crucial step to improve mating decisions and optimizing management in these systems. High‐density genotype information is able to estimate higher‐order breed proportions accurately, but, are too expensive for mass application in smallholder systems. We used high‐density genotype information (777 k SNPs) of 623 crossbred cattle from India that had Holstein‐Friesian (HFX) and/or Jersey and indigenous breeds in their ancestry to select a smaller number of SNPs for breed proportion estimation. The accuracy of estimates obtained from panels with 100–500 SNP was compared to estimates based on all SNPs. Panels were selected for highest absolute allele frequency difference between exotic dairy versus indigenous Bos indicus, or between HFX versus Jersey breeds. A step‐wise pruning approach was developed showing that and increased physical distances between markers of 8.5 Mb improved breed proportion estimation compared to a standard 1 Mb distance. A panel of 500 SNPs optimized to estimate HFX versus Jersey versus indicine ancestry was able to estimate indicine breed proportions with r (2) = .991, HFX proportions with r (2) = .979 and Jersey proportions with r (2) = .949. The number of markers was a deciding factor in estimation accuracy, together with the distribution of markers across the genome.