Cargando…

Using pooled data for genomic prediction in a bivariate framework with missing data

Pooling samples to derive group genotypes can enable the economically efficient use of commercial animals within genetic evaluations. To test a multivariate framework for genetic evaluations using pooled data, simulation was used to mimic a beef cattle population including two moderately heritable t...

Descripción completa

Detalles Bibliográficos
Autores principales: Baller, Johnna L., Kachman, Stephen D., Kuehn, Larry A., Spangler, Matthew L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9544112/
https://www.ncbi.nlm.nih.gov/pubmed/35698863
http://dx.doi.org/10.1111/jbg.12727
_version_ 1784804527158853632
author Baller, Johnna L.
Kachman, Stephen D.
Kuehn, Larry A.
Spangler, Matthew L.
author_facet Baller, Johnna L.
Kachman, Stephen D.
Kuehn, Larry A.
Spangler, Matthew L.
author_sort Baller, Johnna L.
collection PubMed
description Pooling samples to derive group genotypes can enable the economically efficient use of commercial animals within genetic evaluations. To test a multivariate framework for genetic evaluations using pooled data, simulation was used to mimic a beef cattle population including two moderately heritable traits with varying genetic correlations, genotypes and pedigree data. There were 15 generations (n = 32,000; random selection and mating), and the last generation was subjected to genotyping through pooling. Missing records were induced in two ways: (a) sequential culling and (b) random missing records. Gaps in genotyping were also explored whereby genotyping occurred through generation 13 or 14. Pools of 1, 20, 50 and 100 animals were constructed randomly or by minimizing phenotypic variation. The EBV was estimated using a bivariate single‐step genomic best linear unbiased prediction model. Pools of 20 animals constructed by minimizing phenotypic variation generally led to accuracies that were not different than using individual progeny data. Gaps in genotyping led to significantly different EBV accuracies (p < .05) for sires and dams born in the generation nearest the pools. Pooling of any size generally led to larger accuracies than no information from generation 15 regardless of the way missing records arose, the percentage of records available or the genetic correlation. Pooling to aid in the use of commercial data in genetic evaluations can be utilized in multivariate cases with varying relationships between the traits and in the presence of systematic and randomly missing phenotypes.
format Online
Article
Text
id pubmed-9544112
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-95441122022-10-14 Using pooled data for genomic prediction in a bivariate framework with missing data Baller, Johnna L. Kachman, Stephen D. Kuehn, Larry A. Spangler, Matthew L. J Anim Breed Genet Original Articles Pooling samples to derive group genotypes can enable the economically efficient use of commercial animals within genetic evaluations. To test a multivariate framework for genetic evaluations using pooled data, simulation was used to mimic a beef cattle population including two moderately heritable traits with varying genetic correlations, genotypes and pedigree data. There were 15 generations (n = 32,000; random selection and mating), and the last generation was subjected to genotyping through pooling. Missing records were induced in two ways: (a) sequential culling and (b) random missing records. Gaps in genotyping were also explored whereby genotyping occurred through generation 13 or 14. Pools of 1, 20, 50 and 100 animals were constructed randomly or by minimizing phenotypic variation. The EBV was estimated using a bivariate single‐step genomic best linear unbiased prediction model. Pools of 20 animals constructed by minimizing phenotypic variation generally led to accuracies that were not different than using individual progeny data. Gaps in genotyping led to significantly different EBV accuracies (p < .05) for sires and dams born in the generation nearest the pools. Pooling of any size generally led to larger accuracies than no information from generation 15 regardless of the way missing records arose, the percentage of records available or the genetic correlation. Pooling to aid in the use of commercial data in genetic evaluations can be utilized in multivariate cases with varying relationships between the traits and in the presence of systematic and randomly missing phenotypes. John Wiley and Sons Inc. 2022-06-14 2022-09 /pmc/articles/PMC9544112/ /pubmed/35698863 http://dx.doi.org/10.1111/jbg.12727 Text en © 2022 The Authors. Journal of Animal Breeding and Genetics published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Original Articles
Baller, Johnna L.
Kachman, Stephen D.
Kuehn, Larry A.
Spangler, Matthew L.
Using pooled data for genomic prediction in a bivariate framework with missing data
title Using pooled data for genomic prediction in a bivariate framework with missing data
title_full Using pooled data for genomic prediction in a bivariate framework with missing data
title_fullStr Using pooled data for genomic prediction in a bivariate framework with missing data
title_full_unstemmed Using pooled data for genomic prediction in a bivariate framework with missing data
title_short Using pooled data for genomic prediction in a bivariate framework with missing data
title_sort using pooled data for genomic prediction in a bivariate framework with missing data
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9544112/
https://www.ncbi.nlm.nih.gov/pubmed/35698863
http://dx.doi.org/10.1111/jbg.12727
work_keys_str_mv AT ballerjohnnal usingpooleddataforgenomicpredictioninabivariateframeworkwithmissingdata
AT kachmanstephend usingpooleddataforgenomicpredictioninabivariateframeworkwithmissingdata
AT kuehnlarrya usingpooleddataforgenomicpredictioninabivariateframeworkwithmissingdata
AT spanglermatthewl usingpooleddataforgenomicpredictioninabivariateframeworkwithmissingdata