Cargando…
Accuracy of genomic BLUP when considering a genomic relationship matrix based on the number of the largest eigenvalues: a simulation study
BACKGROUND: The dimensionality of genomic information is limited by the number of independent chromosome segments (M(e)), which is a function of the effective population size. This dimensionality can be determined approximately by singular value decomposition of the gene content matrix, by eigenvalu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6907194/ https://www.ncbi.nlm.nih.gov/pubmed/31830899 http://dx.doi.org/10.1186/s12711-019-0516-0 |
_version_ | 1783478500027531264 |
---|---|
author | Pocrnic, Ivan Lourenco, Daniela A. L. Masuda, Yutaka Misztal, Ignacy |
author_facet | Pocrnic, Ivan Lourenco, Daniela A. L. Masuda, Yutaka Misztal, Ignacy |
author_sort | Pocrnic, Ivan |
collection | PubMed |
description | BACKGROUND: The dimensionality of genomic information is limited by the number of independent chromosome segments (M(e)), which is a function of the effective population size. This dimensionality can be determined approximately by singular value decomposition of the gene content matrix, by eigenvalue decomposition of the genomic relationship matrix (GRM), or by the number of core animals in the algorithm for proven and young (APY) that maximizes the accuracy of genomic prediction. In the latter, core animals act as proxies to linear combinations of M(e). Field studies indicate that a moderate accuracy of genomic selection is achieved with a small dataset, but that further improvement of the accuracy requires much more data. When only one quarter of the optimal number of core animals are used in the APY algorithm, the accuracy of genomic selection is only slightly below the optimal value. This suggests that genomic selection works on clusters of M(e). RESULTS: The simulation included datasets with different population sizes and amounts of phenotypic information. Computations were done by genomic best linear unbiased prediction (GBLUP) with selected eigenvalues and corresponding eigenvectors of the GRM set to zero. About four eigenvalues in the GRM explained 10% of the genomic variation, and less than 2% of the total eigenvalues explained 50% of the genomic variation. With limited phenotypic information, the accuracy of GBLUP was close to the peak where most of the smallest eigenvalues were set to zero. With a large amount of phenotypic information, accuracy increased as smaller eigenvalues were added. CONCLUSIONS: A small amount of phenotypic data is sufficient to estimate only the effects of the largest eigenvalues and the associated eigenvectors that contain a large fraction of the genomic information, and a very large amount of data is required to estimate the remaining eigenvalues that account for a limited amount of genomic information. Core animals in the APY algorithm act as proxies of almost the same number of eigenvalues. By using an eigenvalues-based approach, it was possible to explain why the moderate accuracy of genomic selection based on small datasets only increases slowly as more data are added. |
format | Online Article Text |
id | pubmed-6907194 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69071942019-12-20 Accuracy of genomic BLUP when considering a genomic relationship matrix based on the number of the largest eigenvalues: a simulation study Pocrnic, Ivan Lourenco, Daniela A. L. Masuda, Yutaka Misztal, Ignacy Genet Sel Evol Research Article BACKGROUND: The dimensionality of genomic information is limited by the number of independent chromosome segments (M(e)), which is a function of the effective population size. This dimensionality can be determined approximately by singular value decomposition of the gene content matrix, by eigenvalue decomposition of the genomic relationship matrix (GRM), or by the number of core animals in the algorithm for proven and young (APY) that maximizes the accuracy of genomic prediction. In the latter, core animals act as proxies to linear combinations of M(e). Field studies indicate that a moderate accuracy of genomic selection is achieved with a small dataset, but that further improvement of the accuracy requires much more data. When only one quarter of the optimal number of core animals are used in the APY algorithm, the accuracy of genomic selection is only slightly below the optimal value. This suggests that genomic selection works on clusters of M(e). RESULTS: The simulation included datasets with different population sizes and amounts of phenotypic information. Computations were done by genomic best linear unbiased prediction (GBLUP) with selected eigenvalues and corresponding eigenvectors of the GRM set to zero. About four eigenvalues in the GRM explained 10% of the genomic variation, and less than 2% of the total eigenvalues explained 50% of the genomic variation. With limited phenotypic information, the accuracy of GBLUP was close to the peak where most of the smallest eigenvalues were set to zero. With a large amount of phenotypic information, accuracy increased as smaller eigenvalues were added. CONCLUSIONS: A small amount of phenotypic data is sufficient to estimate only the effects of the largest eigenvalues and the associated eigenvectors that contain a large fraction of the genomic information, and a very large amount of data is required to estimate the remaining eigenvalues that account for a limited amount of genomic information. Core animals in the APY algorithm act as proxies of almost the same number of eigenvalues. By using an eigenvalues-based approach, it was possible to explain why the moderate accuracy of genomic selection based on small datasets only increases slowly as more data are added. BioMed Central 2019-12-12 /pmc/articles/PMC6907194/ /pubmed/31830899 http://dx.doi.org/10.1186/s12711-019-0516-0 Text en © The Author(s) 2019 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Pocrnic, Ivan Lourenco, Daniela A. L. Masuda, Yutaka Misztal, Ignacy Accuracy of genomic BLUP when considering a genomic relationship matrix based on the number of the largest eigenvalues: a simulation study |
title | Accuracy of genomic BLUP when considering a genomic relationship matrix based on the number of the largest eigenvalues: a simulation study |
title_full | Accuracy of genomic BLUP when considering a genomic relationship matrix based on the number of the largest eigenvalues: a simulation study |
title_fullStr | Accuracy of genomic BLUP when considering a genomic relationship matrix based on the number of the largest eigenvalues: a simulation study |
title_full_unstemmed | Accuracy of genomic BLUP when considering a genomic relationship matrix based on the number of the largest eigenvalues: a simulation study |
title_short | Accuracy of genomic BLUP when considering a genomic relationship matrix based on the number of the largest eigenvalues: a simulation study |
title_sort | accuracy of genomic blup when considering a genomic relationship matrix based on the number of the largest eigenvalues: a simulation study |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6907194/ https://www.ncbi.nlm.nih.gov/pubmed/31830899 http://dx.doi.org/10.1186/s12711-019-0516-0 |
work_keys_str_mv | AT pocrnicivan accuracyofgenomicblupwhenconsideringagenomicrelationshipmatrixbasedonthenumberofthelargesteigenvaluesasimulationstudy AT lourencodanielaal accuracyofgenomicblupwhenconsideringagenomicrelationshipmatrixbasedonthenumberofthelargesteigenvaluesasimulationstudy AT masudayutaka accuracyofgenomicblupwhenconsideringagenomicrelationshipmatrixbasedonthenumberofthelargesteigenvaluesasimulationstudy AT misztalignacy accuracyofgenomicblupwhenconsideringagenomicrelationshipmatrixbasedonthenumberofthelargesteigenvaluesasimulationstudy |