Cargando…
Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species
BACKGROUND: A genomic relationship matrix (GRM) can be inverted efficiently with the Algorithm for Proven and Young (APY) through recursion on a small number of core animals. The number of core animals is theoretically linked to effective population size (N (e)). In a simulation study, the optimal n...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5088690/ https://www.ncbi.nlm.nih.gov/pubmed/27799053 http://dx.doi.org/10.1186/s12711-016-0261-6 |
_version_ | 1782464147530711040 |
---|---|
author | Pocrnic, Ivan Lourenco, Daniela A. L. Masuda, Yutaka Misztal, Ignacy |
author_facet | Pocrnic, Ivan Lourenco, Daniela A. L. Masuda, Yutaka Misztal, Ignacy |
author_sort | Pocrnic, Ivan |
collection | PubMed |
description | BACKGROUND: A genomic relationship matrix (GRM) can be inverted efficiently with the Algorithm for Proven and Young (APY) through recursion on a small number of core animals. The number of core animals is theoretically linked to effective population size (N (e)). In a simulation study, the optimal number of core animals was equal to the number of largest eigenvalues of GRM that explained 98% of its variation. The purpose of this study was to find the optimal number of core animals and estimate N (e) for different species. METHODS: Datasets included phenotypes, pedigrees, and genotypes for populations of Holstein, Jersey, and Angus cattle, pigs, and broiler chickens. The number of genotyped animals varied from 15,000 for broiler chickens to 77,000 for Holsteins, and the number of single-nucleotide polymorphisms used for genomic prediction varied from 37,000 to 61,000. Eigenvalue decomposition of the GRM for each population determined numbers of largest eigenvalues corresponding to 90, 95, 98, and 99% of variation. RESULTS: The number of eigenvalues corresponding to 90% (98%) of variation was 4527 (14,026) for Holstein, 3325 (11,500) for Jersey, 3654 (10,605) for Angus, 1239 (4103) for pig, and 1655 (4171) for broiler chicken. Each trait in each species was analyzed using the APY inverse of the GRM with randomly selected core animals, and their number was equal to the number of largest eigenvalues. Realized accuracies peaked with the number of core animals corresponding to 98% of variation for Holstein and Jersey and closer to 99% for other breed/species. N (e) was estimated based on comparisons of eigenvalue decomposition in a simulation study. Assuming a genome length of 30 Morgan, N (e) was equal to 149 for Holsteins, 101 for Jerseys, 113 for Angus, 32 for pigs, and 44 for broilers. CONCLUSIONS: Eigenvalue profiles of GRM for common species are similar to those in simulation studies although they are affected by number of genotyped animals and genotyping quality. For all investigated species, the APY required less than 15,000 core animals. Realized accuracies were equal or greater with the APY inverse than with regular inversion. Eigenvalue analysis of GRM can provide a realistic estimate of N (e). |
format | Online Article Text |
id | pubmed-5088690 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50886902016-11-07 Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species Pocrnic, Ivan Lourenco, Daniela A. L. Masuda, Yutaka Misztal, Ignacy Genet Sel Evol Research Article BACKGROUND: A genomic relationship matrix (GRM) can be inverted efficiently with the Algorithm for Proven and Young (APY) through recursion on a small number of core animals. The number of core animals is theoretically linked to effective population size (N (e)). In a simulation study, the optimal number of core animals was equal to the number of largest eigenvalues of GRM that explained 98% of its variation. The purpose of this study was to find the optimal number of core animals and estimate N (e) for different species. METHODS: Datasets included phenotypes, pedigrees, and genotypes for populations of Holstein, Jersey, and Angus cattle, pigs, and broiler chickens. The number of genotyped animals varied from 15,000 for broiler chickens to 77,000 for Holsteins, and the number of single-nucleotide polymorphisms used for genomic prediction varied from 37,000 to 61,000. Eigenvalue decomposition of the GRM for each population determined numbers of largest eigenvalues corresponding to 90, 95, 98, and 99% of variation. RESULTS: The number of eigenvalues corresponding to 90% (98%) of variation was 4527 (14,026) for Holstein, 3325 (11,500) for Jersey, 3654 (10,605) for Angus, 1239 (4103) for pig, and 1655 (4171) for broiler chicken. Each trait in each species was analyzed using the APY inverse of the GRM with randomly selected core animals, and their number was equal to the number of largest eigenvalues. Realized accuracies peaked with the number of core animals corresponding to 98% of variation for Holstein and Jersey and closer to 99% for other breed/species. N (e) was estimated based on comparisons of eigenvalue decomposition in a simulation study. Assuming a genome length of 30 Morgan, N (e) was equal to 149 for Holsteins, 101 for Jerseys, 113 for Angus, 32 for pigs, and 44 for broilers. CONCLUSIONS: Eigenvalue profiles of GRM for common species are similar to those in simulation studies although they are affected by number of genotyped animals and genotyping quality. For all investigated species, the APY required less than 15,000 core animals. Realized accuracies were equal or greater with the APY inverse than with regular inversion. Eigenvalue analysis of GRM can provide a realistic estimate of N (e). BioMed Central 2016-10-31 /pmc/articles/PMC5088690/ /pubmed/27799053 http://dx.doi.org/10.1186/s12711-016-0261-6 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Pocrnic, Ivan Lourenco, Daniela A. L. Masuda, Yutaka Misztal, Ignacy Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species |
title | Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species |
title_full | Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species |
title_fullStr | Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species |
title_full_unstemmed | Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species |
title_short | Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species |
title_sort | dimensionality of genomic information and performance of the algorithm for proven and young for different livestock species |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5088690/ https://www.ncbi.nlm.nih.gov/pubmed/27799053 http://dx.doi.org/10.1186/s12711-016-0261-6 |
work_keys_str_mv | AT pocrnicivan dimensionalityofgenomicinformationandperformanceofthealgorithmforprovenandyoungfordifferentlivestockspecies AT lourencodanielaal dimensionalityofgenomicinformationandperformanceofthealgorithmforprovenandyoungfordifferentlivestockspecies AT masudayutaka dimensionalityofgenomicinformationandperformanceofthealgorithmforprovenandyoungfordifferentlivestockspecies AT misztalignacy dimensionalityofgenomicinformationandperformanceofthealgorithmforprovenandyoungfordifferentlivestockspecies |