Cargando…

A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices

BACKGROUND: A multi-population genomic prediction (GP) model in which important pre-selected single nucleotide polymorphisms (SNPs) are differentially weighted (MPMG) has been shown to result in better prediction accuracy than a multi-population, single genomic relationship matrix ([Formula: see tex...

Descripción completa

Detalles Bibliográficos
Autores principales: Raymond, Biaty, Wientjes, Yvonne C. J., Bouwman, Aniek C., Schrooten, Chris, Veerkamp, Roel F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7189707/
https://www.ncbi.nlm.nih.gov/pubmed/32345213
http://dx.doi.org/10.1186/s12711-020-00540-y
_version_ 1783527554451243008
author Raymond, Biaty
Wientjes, Yvonne C. J.
Bouwman, Aniek C.
Schrooten, Chris
Veerkamp, Roel F.
author_facet Raymond, Biaty
Wientjes, Yvonne C. J.
Bouwman, Aniek C.
Schrooten, Chris
Veerkamp, Roel F.
author_sort Raymond, Biaty
collection PubMed
description BACKGROUND: A multi-population genomic prediction (GP) model in which important pre-selected single nucleotide polymorphisms (SNPs) are differentially weighted (MPMG) has been shown to result in better prediction accuracy than a multi-population, single genomic relationship matrix ([Formula: see text] ) GP model (MPSG) in which all SNPs are weighted equally. Our objective was to underpin theoretically the advantages and limits of the MPMG model over the MPSG model, by deriving and validating a deterministic prediction equation for its accuracy. METHODS: Using selection index theory, we derived an equation to predict the accuracy of estimated total genomic values of selection candidates from population [Formula: see text] ([Formula: see text] ), when individuals from two populations, [Formula: see text] and [Formula: see text] , are combined in the training population and two [Formula: see text] , made respectively from pre-selected and remaining SNPs, are fitted simultaneously in MPMG. We used simulations to validate the prediction equation in scenarios that differed in the level of genetic correlation between populations, heritability, and proportion of genetic variance explained by the pre-selected SNPs. Empirical accuracy of the MPMG model in each scenario was calculated and compared to the predicted accuracy from the equation. RESULTS: In general, the derived prediction equation resulted in accurate predictions of [Formula: see text] for the scenarios evaluated. Using the prediction equation, we showed that an important advantage of the MPMG model over the MPSG model is its ability to benefit from the small number of independent chromosome segments ([Formula: see text] ) due to the pre-selected SNPs, both within and across populations, whereas for the MPSG model, there is only a single value for [Formula: see text] , calculated based on all SNPs, which is very large. However, this advantage is dependent on the pre-selected SNPs that explain some proportion of the total genetic variance for the trait. CONCLUSIONS: We developed an equation that gives insight into why, and under which conditions the MPMG outperforms the MPSG model for GP. The equation can be used as a deterministic tool to assess the potential benefit of combining information from different populations, e.g., different breeds or lines for GP in livestock or plants, or different groups of people based on their ethnic background for prediction of disease risk scores.
format Online
Article
Text
id pubmed-7189707
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71897072020-05-04 A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices Raymond, Biaty Wientjes, Yvonne C. J. Bouwman, Aniek C. Schrooten, Chris Veerkamp, Roel F. Genet Sel Evol Research Article BACKGROUND: A multi-population genomic prediction (GP) model in which important pre-selected single nucleotide polymorphisms (SNPs) are differentially weighted (MPMG) has been shown to result in better prediction accuracy than a multi-population, single genomic relationship matrix ([Formula: see text] ) GP model (MPSG) in which all SNPs are weighted equally. Our objective was to underpin theoretically the advantages and limits of the MPMG model over the MPSG model, by deriving and validating a deterministic prediction equation for its accuracy. METHODS: Using selection index theory, we derived an equation to predict the accuracy of estimated total genomic values of selection candidates from population [Formula: see text] ([Formula: see text] ), when individuals from two populations, [Formula: see text] and [Formula: see text] , are combined in the training population and two [Formula: see text] , made respectively from pre-selected and remaining SNPs, are fitted simultaneously in MPMG. We used simulations to validate the prediction equation in scenarios that differed in the level of genetic correlation between populations, heritability, and proportion of genetic variance explained by the pre-selected SNPs. Empirical accuracy of the MPMG model in each scenario was calculated and compared to the predicted accuracy from the equation. RESULTS: In general, the derived prediction equation resulted in accurate predictions of [Formula: see text] for the scenarios evaluated. Using the prediction equation, we showed that an important advantage of the MPMG model over the MPSG model is its ability to benefit from the small number of independent chromosome segments ([Formula: see text] ) due to the pre-selected SNPs, both within and across populations, whereas for the MPSG model, there is only a single value for [Formula: see text] , calculated based on all SNPs, which is very large. However, this advantage is dependent on the pre-selected SNPs that explain some proportion of the total genetic variance for the trait. CONCLUSIONS: We developed an equation that gives insight into why, and under which conditions the MPMG outperforms the MPSG model for GP. The equation can be used as a deterministic tool to assess the potential benefit of combining information from different populations, e.g., different breeds or lines for GP in livestock or plants, or different groups of people based on their ethnic background for prediction of disease risk scores. BioMed Central 2020-04-28 /pmc/articles/PMC7189707/ /pubmed/32345213 http://dx.doi.org/10.1186/s12711-020-00540-y Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Raymond, Biaty
Wientjes, Yvonne C. J.
Bouwman, Aniek C.
Schrooten, Chris
Veerkamp, Roel F.
A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices
title A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices
title_full A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices
title_fullStr A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices
title_full_unstemmed A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices
title_short A deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices
title_sort deterministic equation to predict the accuracy of multi-population genomic prediction with multiple genomic relationship matrices
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7189707/
https://www.ncbi.nlm.nih.gov/pubmed/32345213
http://dx.doi.org/10.1186/s12711-020-00540-y
work_keys_str_mv AT raymondbiaty adeterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices
AT wientjesyvonnecj adeterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices
AT bouwmananiekc adeterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices
AT schrootenchris adeterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices
AT veerkamproelf adeterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices
AT raymondbiaty deterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices
AT wientjesyvonnecj deterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices
AT bouwmananiekc deterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices
AT schrootenchris deterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices
AT veerkamproelf deterministicequationtopredicttheaccuracyofmultipopulationgenomicpredictionwithmultiplegenomicrelationshipmatrices