Cargando…

Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species

BACKGROUND: A genomic relationship matrix (GRM) can be inverted efficiently with the Algorithm for Proven and Young (APY) through recursion on a small number of core animals. The number of core animals is theoretically linked to effective population size (N (e)). In a simulation study, the optimal n...

Descripción completa

Detalles Bibliográficos
Autores principales: Pocrnic, Ivan, Lourenco, Daniela A. L., Masuda, Yutaka, Misztal, Ignacy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5088690/
https://www.ncbi.nlm.nih.gov/pubmed/27799053
http://dx.doi.org/10.1186/s12711-016-0261-6
_version_ 1782464147530711040
author Pocrnic, Ivan
Lourenco, Daniela A. L.
Masuda, Yutaka
Misztal, Ignacy
author_facet Pocrnic, Ivan
Lourenco, Daniela A. L.
Masuda, Yutaka
Misztal, Ignacy
author_sort Pocrnic, Ivan
collection PubMed
description BACKGROUND: A genomic relationship matrix (GRM) can be inverted efficiently with the Algorithm for Proven and Young (APY) through recursion on a small number of core animals. The number of core animals is theoretically linked to effective population size (N (e)). In a simulation study, the optimal number of core animals was equal to the number of largest eigenvalues of GRM that explained 98% of its variation. The purpose of this study was to find the optimal number of core animals and estimate N (e) for different species. METHODS: Datasets included phenotypes, pedigrees, and genotypes for populations of Holstein, Jersey, and Angus cattle, pigs, and broiler chickens. The number of genotyped animals varied from 15,000 for broiler chickens to 77,000 for Holsteins, and the number of single-nucleotide polymorphisms used for genomic prediction varied from 37,000 to 61,000. Eigenvalue decomposition of the GRM for each population determined numbers of largest eigenvalues corresponding to 90, 95, 98, and 99% of variation. RESULTS: The number of eigenvalues corresponding to 90% (98%) of variation was 4527 (14,026) for Holstein, 3325 (11,500) for Jersey, 3654 (10,605) for Angus, 1239 (4103) for pig, and 1655 (4171) for broiler chicken. Each trait in each species was analyzed using the APY inverse of the GRM with randomly selected core animals, and their number was equal to the number of largest eigenvalues. Realized accuracies peaked with the number of core animals corresponding to 98% of variation for Holstein and Jersey and closer to 99% for other breed/species. N (e) was estimated based on comparisons of eigenvalue decomposition in a simulation study. Assuming a genome length of 30 Morgan, N (e) was equal to 149 for Holsteins, 101 for Jerseys, 113 for Angus, 32 for pigs, and 44 for broilers. CONCLUSIONS: Eigenvalue profiles of GRM for common species are similar to those in simulation studies although they are affected by number of genotyped animals and genotyping quality. For all investigated species, the APY required less than 15,000 core animals. Realized accuracies were equal or greater with the APY inverse than with regular inversion. Eigenvalue analysis of GRM can provide a realistic estimate of N (e).
format Online
Article
Text
id pubmed-5088690
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50886902016-11-07 Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species Pocrnic, Ivan Lourenco, Daniela A. L. Masuda, Yutaka Misztal, Ignacy Genet Sel Evol Research Article BACKGROUND: A genomic relationship matrix (GRM) can be inverted efficiently with the Algorithm for Proven and Young (APY) through recursion on a small number of core animals. The number of core animals is theoretically linked to effective population size (N (e)). In a simulation study, the optimal number of core animals was equal to the number of largest eigenvalues of GRM that explained 98% of its variation. The purpose of this study was to find the optimal number of core animals and estimate N (e) for different species. METHODS: Datasets included phenotypes, pedigrees, and genotypes for populations of Holstein, Jersey, and Angus cattle, pigs, and broiler chickens. The number of genotyped animals varied from 15,000 for broiler chickens to 77,000 for Holsteins, and the number of single-nucleotide polymorphisms used for genomic prediction varied from 37,000 to 61,000. Eigenvalue decomposition of the GRM for each population determined numbers of largest eigenvalues corresponding to 90, 95, 98, and 99% of variation. RESULTS: The number of eigenvalues corresponding to 90% (98%) of variation was 4527 (14,026) for Holstein, 3325 (11,500) for Jersey, 3654 (10,605) for Angus, 1239 (4103) for pig, and 1655 (4171) for broiler chicken. Each trait in each species was analyzed using the APY inverse of the GRM with randomly selected core animals, and their number was equal to the number of largest eigenvalues. Realized accuracies peaked with the number of core animals corresponding to 98% of variation for Holstein and Jersey and closer to 99% for other breed/species. N (e) was estimated based on comparisons of eigenvalue decomposition in a simulation study. Assuming a genome length of 30 Morgan, N (e) was equal to 149 for Holsteins, 101 for Jerseys, 113 for Angus, 32 for pigs, and 44 for broilers. CONCLUSIONS: Eigenvalue profiles of GRM for common species are similar to those in simulation studies although they are affected by number of genotyped animals and genotyping quality. For all investigated species, the APY required less than 15,000 core animals. Realized accuracies were equal or greater with the APY inverse than with regular inversion. Eigenvalue analysis of GRM can provide a realistic estimate of N (e). BioMed Central 2016-10-31 /pmc/articles/PMC5088690/ /pubmed/27799053 http://dx.doi.org/10.1186/s12711-016-0261-6 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Pocrnic, Ivan
Lourenco, Daniela A. L.
Masuda, Yutaka
Misztal, Ignacy
Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species
title Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species
title_full Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species
title_fullStr Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species
title_full_unstemmed Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species
title_short Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species
title_sort dimensionality of genomic information and performance of the algorithm for proven and young for different livestock species
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5088690/
https://www.ncbi.nlm.nih.gov/pubmed/27799053
http://dx.doi.org/10.1186/s12711-016-0261-6
work_keys_str_mv AT pocrnicivan dimensionalityofgenomicinformationandperformanceofthealgorithmforprovenandyoungfordifferentlivestockspecies
AT lourencodanielaal dimensionalityofgenomicinformationandperformanceofthealgorithmforprovenandyoungfordifferentlivestockspecies
AT masudayutaka dimensionalityofgenomicinformationandperformanceofthealgorithmforprovenandyoungfordifferentlivestockspecies
AT misztalignacy dimensionalityofgenomicinformationandperformanceofthealgorithmforprovenandyoungfordifferentlivestockspecies