Cargando…
A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices
BACKGROUND: A multi-population genomic prediction (GP) model in which important pre-selected single nucleotide polymorphisms (SNPs) are differentially weighted (MPMG) has been shown to result in better prediction accuracy than a multi-population, single genomic relationship matrix ([Formula: see tex...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7189707/ https://www.ncbi.nlm.nih.gov/pubmed/32345213 http://dx.doi.org/10.1186/s12711-020-00540-y |
_version_ | 1783527554451243008 |
---|---|
author | Raymond, Biaty Wientjes, Yvonne C. J. Bouwman, Aniek C. Schrooten, Chris Veerkamp, Roel F. |
author_facet | Raymond, Biaty Wientjes, Yvonne C. J. Bouwman, Aniek C. Schrooten, Chris Veerkamp, Roel F. |
author_sort | Raymond, Biaty |
collection | PubMed |
description | BACKGROUND: A multi-population genomic prediction (GP) model in which important pre-selected single nucleotide polymorphisms (SNPs) are differentially weighted (MPMG) has been shown to result in better prediction accuracy than a multi-population, single genomic relationship matrix ([Formula: see text] ) GP model (MPSG) in which all SNPs are weighted equally. Our objective was to underpin theoretically the advantages and limits of the MPMG model over the MPSG model, by deriving and validating a deterministic prediction equation for its accuracy. METHODS: Using selection index theory, we derived an equation to predict the accuracy of estimated total genomic values of selection candidates from population [Formula: see text] ([Formula: see text] ), when individuals from two populations, [Formula: see text] and [Formula: see text] , are combined in the training population and two [Formula: see text] , made respectively from pre-selected and remaining SNPs, are fitted simultaneously in MPMG. We used simulations to validate the prediction equation in scenarios that differed in the level of genetic correlation between populations, heritability, and proportion of genetic variance explained by the pre-selected SNPs. Empirical accuracy of the MPMG model in each scenario was calculated and compared to the predicted accuracy from the equation. RESULTS: In general, the derived prediction equation resulted in accurate predictions of [Formula: see text] for the scenarios evaluated. Using the prediction equation, we showed that an important advantage of the MPMG model over the MPSG model is its ability to benefit from the small number of independent chromosome segments ([Formula: see text] ) due to the pre-selected SNPs, both within and across populations, whereas for the MPSG model, there is only a single value for [Formula: see text] , calculated based on all SNPs, which is very large. However, this advantage is dependent on the pre-selected SNPs that explain some proportion of the total genetic variance for the trait. CONCLUSIONS: We developed an equation that gives insight into why, and under which conditions the MPMG outperforms the MPSG model for GP. The equation can be used as a deterministic tool to assess the potential benefit of combining information from different populations, e.g., different breeds or lines for GP in livestock or plants, or different groups of people based on their ethnic background for prediction of disease risk scores. |
format | Online Article Text |
id | pubmed-7189707 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-71897072020-05-04 A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices Raymond, Biaty Wientjes, Yvonne C. J. Bouwman, Aniek C. Schrooten, Chris Veerkamp, Roel F. Genet Sel Evol Research Article BACKGROUND: A multi-population genomic prediction (GP) model in which important pre-selected single nucleotide polymorphisms (SNPs) are differentially weighted (MPMG) has been shown to result in better prediction accuracy than a multi-population, single genomic relationship matrix ([Formula: see text] ) GP model (MPSG) in which all SNPs are weighted equally. Our objective was to underpin theoretically the advantages and limits of the MPMG model over the MPSG model, by deriving and validating a deterministic prediction equation for its accuracy. METHODS: Using selection index theory, we derived an equation to predict the accuracy of estimated total genomic values of selection candidates from population [Formula: see text] ([Formula: see text] ), when individuals from two populations, [Formula: see text] and [Formula: see text] , are combined in the training population and two [Formula: see text] , made respectively from pre-selected and remaining SNPs, are fitted simultaneously in MPMG. We used simulations to validate the prediction equation in scenarios that differed in the level of genetic correlation between populations, heritability, and proportion of genetic variance explained by the pre-selected SNPs. Empirical accuracy of the MPMG model in each scenario was calculated and compared to the predicted accuracy from the equation. RESULTS: In general, the derived prediction equation resulted in accurate predictions of [Formula: see text] for the scenarios evaluated. Using the prediction equation, we showed that an important advantage of the MPMG model over the MPSG model is its ability to benefit from the small number of independent chromosome segments ([Formula: see text] ) due to the pre-selected SNPs, both within and across populations, whereas for the MPSG model, there is only a single value for [Formula: see text] , calculated based on all SNPs, which is very large. However, this advantage is dependent on the pre-selected SNPs that explain some proportion of the total genetic variance for the trait. CONCLUSIONS: We developed an equation that gives insight into why, and under which conditions the MPMG outperforms the MPSG model for GP. The equation can be used as a deterministic tool to assess the potential benefit of combining information from different populations, e.g., different breeds or lines for GP in livestock or plants, or different groups of people based on their ethnic background for prediction of disease risk scores. BioMed Central 2020-04-28 /pmc/articles/PMC7189707/ /pubmed/32345213 http://dx.doi.org/10.1186/s12711-020-00540-y Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Raymond, Biaty Wientjes, Yvonne C. J. Bouwman, Aniek C. Schrooten, Chris Veerkamp, Roel F. A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices |
title | A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices |
title_full | A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices |
title_fullStr | A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices |
title_full_unstemmed | A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices |
title_short | A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices |
title_sort | deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7189707/ https://www.ncbi.nlm.nih.gov/pubmed/32345213 http://dx.doi.org/10.1186/s12711-020-00540-y |
work_keys_str_mv | AT raymondbiaty adeterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices AT wientjesyvonnecj adeterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices AT bouwmananiekc adeterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices AT schrootenchris adeterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices AT veerkamproelf adeterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices AT raymondbiaty deterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices AT wientjesyvonnecj deterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices AT bouwmananiekc deterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices AT schrootenchris deterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices AT veerkamproelf deterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices |