Cargando…
Estimation of dam line composition of 3-way crossbred animals using genomic information
BACKGROUND: In genomic prediction including data of 3- or 4-way crossbred animals, line composition is usually fitted as a regression on expected line proportions, which are 0.5, 0.25 and 0.25, respectively, for 3-way crossbred animals. However, actual line proportions for the dam lines can vary bet...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9199202/ https://www.ncbi.nlm.nih.gov/pubmed/35705918 http://dx.doi.org/10.1186/s12711-022-00728-4 |
_version_ | 1784727801384927232 |
---|---|
author | Calus, Mario P. L. Henshall, John M. Hawken, Rachel Vandenplas, Jérémie |
author_facet | Calus, Mario P. L. Henshall, John M. Hawken, Rachel Vandenplas, Jérémie |
author_sort | Calus, Mario P. L. |
collection | PubMed |
description | BACKGROUND: In genomic prediction including data of 3- or 4-way crossbred animals, line composition is usually fitted as a regression on expected line proportions, which are 0.5, 0.25 and 0.25, respectively, for 3-way crossbred animals. However, actual line proportions for the dam lines can vary between ~ 0.1 and 0.4, and ignoring this variation may affect the genomic estimated breeding values of purebred selection candidates. Our aim was to validate a proposed gold standard to evaluate different approaches for estimating line proportions using simulated data, and to subsequently use this in actual 3-way crossbred broiler data to evaluate several other methods. RESULTS: Analysis of simulated data confirmed that line proportions computed from assigned breed-origin-of-alleles (BOA) provide a very accurate gold standard, even if the parental lines are closely related. Alternative investigated methods were linear regression of genotypes on line-specific allele frequencies, maximum likelihood estimation using the program ADMIXTURE, and the genomic relationship of crossbred animals with their maternal grandparents. The results from the simulated data showed that the genomic relationship with the maternal grandparent was most accurate, and least affected by closer relationships between the dam lines. Linear regression and ADMIXTURE performed similarly for unrelated lines, but their accuracy dropped considerably when the dam lines were more closely related. In almost all cases, estimates improved after adjusting them to ensure that the sum of dam line contributions within animals was equal to 0.5, and within dam line and across animals the average was equal to 0.25. Results from the broiler data were much more similar between methods. In both cases, stringent linkage disequilibrium pruning of genotype data led to a relatively low accuracy of predicted line proportions, due to the loss of too many single nucleotide polymorphisms. CONCLUSIONS: With relatively unrelated parental lines as typical in crosses in pigs and poultry, linear regression of crossbred genotypes on line-specific allele frequencies and ADMIXTURE are very competitive methods. Thus, linear regression may be the method of choice, as it does not require genotypes of grandparents, is computationally very efficient, and easily implemented and adapted for considering the specific nature of the crossbred animals analysed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-022-00728-4. |
format | Online Article Text |
id | pubmed-9199202 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-91992022022-06-16 Estimation of dam line composition of 3-way crossbred animals using genomic information Calus, Mario P. L. Henshall, John M. Hawken, Rachel Vandenplas, Jérémie Genet Sel Evol Research Article BACKGROUND: In genomic prediction including data of 3- or 4-way crossbred animals, line composition is usually fitted as a regression on expected line proportions, which are 0.5, 0.25 and 0.25, respectively, for 3-way crossbred animals. However, actual line proportions for the dam lines can vary between ~ 0.1 and 0.4, and ignoring this variation may affect the genomic estimated breeding values of purebred selection candidates. Our aim was to validate a proposed gold standard to evaluate different approaches for estimating line proportions using simulated data, and to subsequently use this in actual 3-way crossbred broiler data to evaluate several other methods. RESULTS: Analysis of simulated data confirmed that line proportions computed from assigned breed-origin-of-alleles (BOA) provide a very accurate gold standard, even if the parental lines are closely related. Alternative investigated methods were linear regression of genotypes on line-specific allele frequencies, maximum likelihood estimation using the program ADMIXTURE, and the genomic relationship of crossbred animals with their maternal grandparents. The results from the simulated data showed that the genomic relationship with the maternal grandparent was most accurate, and least affected by closer relationships between the dam lines. Linear regression and ADMIXTURE performed similarly for unrelated lines, but their accuracy dropped considerably when the dam lines were more closely related. In almost all cases, estimates improved after adjusting them to ensure that the sum of dam line contributions within animals was equal to 0.5, and within dam line and across animals the average was equal to 0.25. Results from the broiler data were much more similar between methods. In both cases, stringent linkage disequilibrium pruning of genotype data led to a relatively low accuracy of predicted line proportions, due to the loss of too many single nucleotide polymorphisms. CONCLUSIONS: With relatively unrelated parental lines as typical in crosses in pigs and poultry, linear regression of crossbred genotypes on line-specific allele frequencies and ADMIXTURE are very competitive methods. Thus, linear regression may be the method of choice, as it does not require genotypes of grandparents, is computationally very efficient, and easily implemented and adapted for considering the specific nature of the crossbred animals analysed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-022-00728-4. BioMed Central 2022-06-15 /pmc/articles/PMC9199202/ /pubmed/35705918 http://dx.doi.org/10.1186/s12711-022-00728-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Calus, Mario P. L. Henshall, John M. Hawken, Rachel Vandenplas, Jérémie Estimation of dam line composition of 3-way crossbred animals using genomic information |
title | Estimation of dam line composition of 3-way crossbred animals using genomic information |
title_full | Estimation of dam line composition of 3-way crossbred animals using genomic information |
title_fullStr | Estimation of dam line composition of 3-way crossbred animals using genomic information |
title_full_unstemmed | Estimation of dam line composition of 3-way crossbred animals using genomic information |
title_short | Estimation of dam line composition of 3-way crossbred animals using genomic information |
title_sort | estimation of dam line composition of 3-way crossbred animals using genomic information |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9199202/ https://www.ncbi.nlm.nih.gov/pubmed/35705918 http://dx.doi.org/10.1186/s12711-022-00728-4 |
work_keys_str_mv | AT calusmariopl estimationofdamlinecompositionof3waycrossbredanimalsusinggenomicinformation AT henshalljohnm estimationofdamlinecompositionof3waycrossbredanimalsusinggenomicinformation AT hawkenrachel estimationofdamlinecompositionof3waycrossbredanimalsusinggenomicinformation AT vandenplasjeremie estimationofdamlinecompositionof3waycrossbredanimalsusinggenomicinformation |