Cargando…

Approximated prediction of genomic selection accuracy when reference and candidate populations are related

BACKGROUND: Genomic selection is still to be evaluated and optimized in many species. Mathematical modeling of selection schemes prior to their implementation is a classical and useful tool for that purpose. These models include formalization of a number of entities including the precision of the es...

Descripción completa

Detalles Bibliográficos
Autor principal: Elsen, Jean-Michel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4778372/
https://www.ncbi.nlm.nih.gov/pubmed/26940536
http://dx.doi.org/10.1186/s12711-016-0183-3
_version_ 1782419455598395392
author Elsen, Jean-Michel
author_facet Elsen, Jean-Michel
author_sort Elsen, Jean-Michel
collection PubMed
description BACKGROUND: Genomic selection is still to be evaluated and optimized in many species. Mathematical modeling of selection schemes prior to their implementation is a classical and useful tool for that purpose. These models include formalization of a number of entities including the precision of the estimated breeding value. To model genomic selection schemes, equations that predict this reliability as a function of factors such as the size of the reference population, its diversity, its genetic distance from the group of selection candidates genotyped, number of markers and strength of linkage disequilibrium are needed. The present paper aims at exploring new approximations of this reliability. RESULTS: Two alternative approximations are proposed for the estimation of the reliability of genomic estimated breeding values (GEBV) in the case of non-independence between candidate and reference populations. Both were derived from the Taylor series heuristic approach suggested by Goddard in 2009. A numerical exploration of their properties showed that the series were not equivalent in terms of convergence to the exact reliability, that the approximations may overestimate the precision of GEBV and that they converged towards their theoretical expectations. Formulae derived for these approximations were simple to handle in the case of independent markers. A few parameters that describe the markers’ genotypic variability (allele frequencies, linkage disequilibrium) can be estimated from genomic data corresponding to the population of interest or after making assumptions about their distribution. When markers are not in linkage equilibrium, replacing the real number of markers and QTL by the “effective number of independent loci”, as proposed earlier is a practical solution. In this paper, we considered an alternative, i.e. an “equivalent number of independent loci” which would give a GEBV reliability for unrelated individuals by considering a sub-set of independent markers that is identical to the reliability obtained by considering the full set of markers. CONCLUSIONS: This paper is a further step towards the development of deterministic models that describe breeding plans based on the use of genomic information. Such deterministic models carry low computational burden, which allows design optimization through intensive numerical exploration. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-016-0183-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4778372
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47783722016-03-05 Approximated prediction of genomic selection accuracy when reference and candidate populations are related Elsen, Jean-Michel Genet Sel Evol Research Article BACKGROUND: Genomic selection is still to be evaluated and optimized in many species. Mathematical modeling of selection schemes prior to their implementation is a classical and useful tool for that purpose. These models include formalization of a number of entities including the precision of the estimated breeding value. To model genomic selection schemes, equations that predict this reliability as a function of factors such as the size of the reference population, its diversity, its genetic distance from the group of selection candidates genotyped, number of markers and strength of linkage disequilibrium are needed. The present paper aims at exploring new approximations of this reliability. RESULTS: Two alternative approximations are proposed for the estimation of the reliability of genomic estimated breeding values (GEBV) in the case of non-independence between candidate and reference populations. Both were derived from the Taylor series heuristic approach suggested by Goddard in 2009. A numerical exploration of their properties showed that the series were not equivalent in terms of convergence to the exact reliability, that the approximations may overestimate the precision of GEBV and that they converged towards their theoretical expectations. Formulae derived for these approximations were simple to handle in the case of independent markers. A few parameters that describe the markers’ genotypic variability (allele frequencies, linkage disequilibrium) can be estimated from genomic data corresponding to the population of interest or after making assumptions about their distribution. When markers are not in linkage equilibrium, replacing the real number of markers and QTL by the “effective number of independent loci”, as proposed earlier is a practical solution. In this paper, we considered an alternative, i.e. an “equivalent number of independent loci” which would give a GEBV reliability for unrelated individuals by considering a sub-set of independent markers that is identical to the reliability obtained by considering the full set of markers. CONCLUSIONS: This paper is a further step towards the development of deterministic models that describe breeding plans based on the use of genomic information. Such deterministic models carry low computational burden, which allows design optimization through intensive numerical exploration. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-016-0183-3) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-03 /pmc/articles/PMC4778372/ /pubmed/26940536 http://dx.doi.org/10.1186/s12711-016-0183-3 Text en © Elsen. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Elsen, Jean-Michel
Approximated prediction of genomic selection accuracy when reference and candidate populations are related
title Approximated prediction of genomic selection accuracy when reference and candidate populations are related
title_full Approximated prediction of genomic selection accuracy when reference and candidate populations are related
title_fullStr Approximated prediction of genomic selection accuracy when reference and candidate populations are related
title_full_unstemmed Approximated prediction of genomic selection accuracy when reference and candidate populations are related
title_short Approximated prediction of genomic selection accuracy when reference and candidate populations are related
title_sort approximated prediction of genomic selection accuracy when reference and candidate populations are related
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4778372/
https://www.ncbi.nlm.nih.gov/pubmed/26940536
http://dx.doi.org/10.1186/s12711-016-0183-3
work_keys_str_mv AT elsenjeanmichel approximatedpredictionofgenomicselectionaccuracywhenreferenceandcandidatepopulationsarerelated