Cargando…

Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels

BACKGROUND: Bayesian approaches for predicting genomic breeding values (GEBV) have been proposed that allow for different variances for individual markers resulting in a shrinkage procedure that uses prior information to coerce negligible effects towards zero. These approaches have generally assumed...

Descripción completa

Detalles Bibliográficos
Autores principales: Cleveland, Matthew A, Forni, Selma, Deeb, Nader, Maltecca, Christian
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2857848/
https://www.ncbi.nlm.nih.gov/pubmed/20380760
http://dx.doi.org/10.1186/1753-6561-4-S1-S6
_version_ 1782180351650562048
author Cleveland, Matthew A
Forni, Selma
Deeb, Nader
Maltecca, Christian
author_facet Cleveland, Matthew A
Forni, Selma
Deeb, Nader
Maltecca, Christian
author_sort Cleveland, Matthew A
collection PubMed
description BACKGROUND: Bayesian approaches for predicting genomic breeding values (GEBV) have been proposed that allow for different variances for individual markers resulting in a shrinkage procedure that uses prior information to coerce negligible effects towards zero. These approaches have generally assumed application to high-density genotype data on all individuals, which may not be the case in practice. In this study, three approaches were compared for their predictive power in computing GEBV when training at high SNP marker density and predicting at high or low densities: the well- known Bayes-A, a generalization of Bayes-A where scale and degrees of freedom are estimated from the data (Student-t) and a Bayesian implementation of the Lasso method. Twelve scenarios were evaluated for predicting GEBV using low-density marker subsets, including selection of SNP based on genome spacing or size of additive effect and the inclusion of unknown genotype information in the form of genotype probabilities from pedigree and genotyped ancestors. RESULTS: The GEBV accuracy (calculated as correlation between GEBV and traditional breeding values) was highest for Lasso, followed by Student-t and then Bayes-A. When comparing GEBV to true breeding values, Student-t was most accurate, though differences were small. In general the shrinkage applied by the Lasso approach was less conservative than Bayes-A or Student-t, indicating that Lasso may be more sensitive to QTL with small effects. In the reduced-density marker subsets the ranking of the methods was generally consistent. Overall, low-density, evenly-spaced SNPs did a poor job of predicting GEBV, but SNPs selected based on additive effect size yielded accuracies similar to those at high density, even when coverage was low. The inclusion of genotype probabilities to the evenly-spaced subsets showed promising increases in accuracy and may be more useful in cases where many QTL of small effect are expected. CONCLUSIONS: In this dataset the Student-t approach slightly outperformed the other methods when predicting GEBV at both high and low density, but the Lasso method may have particular advantages in situations where many small QTL are expected. When markers were selected at low density based on genome spacing, the inclusion of genotype probabilities increased GEBV accuracy which would allow a single low- density marker panel to be used across traits.
format Text
id pubmed-2857848
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28578482010-04-22 Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels Cleveland, Matthew A Forni, Selma Deeb, Nader Maltecca, Christian BMC Proc Proceedings BACKGROUND: Bayesian approaches for predicting genomic breeding values (GEBV) have been proposed that allow for different variances for individual markers resulting in a shrinkage procedure that uses prior information to coerce negligible effects towards zero. These approaches have generally assumed application to high-density genotype data on all individuals, which may not be the case in practice. In this study, three approaches were compared for their predictive power in computing GEBV when training at high SNP marker density and predicting at high or low densities: the well- known Bayes-A, a generalization of Bayes-A where scale and degrees of freedom are estimated from the data (Student-t) and a Bayesian implementation of the Lasso method. Twelve scenarios were evaluated for predicting GEBV using low-density marker subsets, including selection of SNP based on genome spacing or size of additive effect and the inclusion of unknown genotype information in the form of genotype probabilities from pedigree and genotyped ancestors. RESULTS: The GEBV accuracy (calculated as correlation between GEBV and traditional breeding values) was highest for Lasso, followed by Student-t and then Bayes-A. When comparing GEBV to true breeding values, Student-t was most accurate, though differences were small. In general the shrinkage applied by the Lasso approach was less conservative than Bayes-A or Student-t, indicating that Lasso may be more sensitive to QTL with small effects. In the reduced-density marker subsets the ranking of the methods was generally consistent. Overall, low-density, evenly-spaced SNPs did a poor job of predicting GEBV, but SNPs selected based on additive effect size yielded accuracies similar to those at high density, even when coverage was low. The inclusion of genotype probabilities to the evenly-spaced subsets showed promising increases in accuracy and may be more useful in cases where many QTL of small effect are expected. CONCLUSIONS: In this dataset the Student-t approach slightly outperformed the other methods when predicting GEBV at both high and low density, but the Lasso method may have particular advantages in situations where many small QTL are expected. When markers were selected at low density based on genome spacing, the inclusion of genotype probabilities increased GEBV accuracy which would allow a single low- density marker panel to be used across traits. BioMed Central 2010-03-31 /pmc/articles/PMC2857848/ /pubmed/20380760 http://dx.doi.org/10.1186/1753-6561-4-S1-S6 Text en Copyright ©2010 Cleveland et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Cleveland, Matthew A
Forni, Selma
Deeb, Nader
Maltecca, Christian
Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels
title Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels
title_full Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels
title_fullStr Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels
title_full_unstemmed Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels
title_short Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels
title_sort genomic breeding value prediction using three bayesian methods and application to reduced density marker panels
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2857848/
https://www.ncbi.nlm.nih.gov/pubmed/20380760
http://dx.doi.org/10.1186/1753-6561-4-S1-S6
work_keys_str_mv AT clevelandmatthewa genomicbreedingvaluepredictionusingthreebayesianmethodsandapplicationtoreduceddensitymarkerpanels
AT forniselma genomicbreedingvaluepredictionusingthreebayesianmethodsandapplicationtoreduceddensitymarkerpanels
AT deebnader genomicbreedingvaluepredictionusingthreebayesianmethodsandapplicationtoreduceddensitymarkerpanels
AT malteccachristian genomicbreedingvaluepredictionusingthreebayesianmethodsandapplicationtoreduceddensitymarkerpanels