Cargando…
Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels
BACKGROUND: Bayesian approaches for predicting genomic breeding values (GEBV) have been proposed that allow for different variances for individual markers resulting in a shrinkage procedure that uses prior information to coerce negligible effects towards zero. These approaches have generally assumed...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2857848/ https://www.ncbi.nlm.nih.gov/pubmed/20380760 http://dx.doi.org/10.1186/1753-6561-4-S1-S6 |
_version_ | 1782180351650562048 |
---|---|
author | Cleveland, Matthew A Forni, Selma Deeb, Nader Maltecca, Christian |
author_facet | Cleveland, Matthew A Forni, Selma Deeb, Nader Maltecca, Christian |
author_sort | Cleveland, Matthew A |
collection | PubMed |
description | BACKGROUND: Bayesian approaches for predicting genomic breeding values (GEBV) have been proposed that allow for different variances for individual markers resulting in a shrinkage procedure that uses prior information to coerce negligible effects towards zero. These approaches have generally assumed application to high-density genotype data on all individuals, which may not be the case in practice. In this study, three approaches were compared for their predictive power in computing GEBV when training at high SNP marker density and predicting at high or low densities: the well- known Bayes-A, a generalization of Bayes-A where scale and degrees of freedom are estimated from the data (Student-t) and a Bayesian implementation of the Lasso method. Twelve scenarios were evaluated for predicting GEBV using low-density marker subsets, including selection of SNP based on genome spacing or size of additive effect and the inclusion of unknown genotype information in the form of genotype probabilities from pedigree and genotyped ancestors. RESULTS: The GEBV accuracy (calculated as correlation between GEBV and traditional breeding values) was highest for Lasso, followed by Student-t and then Bayes-A. When comparing GEBV to true breeding values, Student-t was most accurate, though differences were small. In general the shrinkage applied by the Lasso approach was less conservative than Bayes-A or Student-t, indicating that Lasso may be more sensitive to QTL with small effects. In the reduced-density marker subsets the ranking of the methods was generally consistent. Overall, low-density, evenly-spaced SNPs did a poor job of predicting GEBV, but SNPs selected based on additive effect size yielded accuracies similar to those at high density, even when coverage was low. The inclusion of genotype probabilities to the evenly-spaced subsets showed promising increases in accuracy and may be more useful in cases where many QTL of small effect are expected. CONCLUSIONS: In this dataset the Student-t approach slightly outperformed the other methods when predicting GEBV at both high and low density, but the Lasso method may have particular advantages in situations where many small QTL are expected. When markers were selected at low density based on genome spacing, the inclusion of genotype probabilities increased GEBV accuracy which would allow a single low- density marker panel to be used across traits. |
format | Text |
id | pubmed-2857848 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28578482010-04-22 Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels Cleveland, Matthew A Forni, Selma Deeb, Nader Maltecca, Christian BMC Proc Proceedings BACKGROUND: Bayesian approaches for predicting genomic breeding values (GEBV) have been proposed that allow for different variances for individual markers resulting in a shrinkage procedure that uses prior information to coerce negligible effects towards zero. These approaches have generally assumed application to high-density genotype data on all individuals, which may not be the case in practice. In this study, three approaches were compared for their predictive power in computing GEBV when training at high SNP marker density and predicting at high or low densities: the well- known Bayes-A, a generalization of Bayes-A where scale and degrees of freedom are estimated from the data (Student-t) and a Bayesian implementation of the Lasso method. Twelve scenarios were evaluated for predicting GEBV using low-density marker subsets, including selection of SNP based on genome spacing or size of additive effect and the inclusion of unknown genotype information in the form of genotype probabilities from pedigree and genotyped ancestors. RESULTS: The GEBV accuracy (calculated as correlation between GEBV and traditional breeding values) was highest for Lasso, followed by Student-t and then Bayes-A. When comparing GEBV to true breeding values, Student-t was most accurate, though differences were small. In general the shrinkage applied by the Lasso approach was less conservative than Bayes-A or Student-t, indicating that Lasso may be more sensitive to QTL with small effects. In the reduced-density marker subsets the ranking of the methods was generally consistent. Overall, low-density, evenly-spaced SNPs did a poor job of predicting GEBV, but SNPs selected based on additive effect size yielded accuracies similar to those at high density, even when coverage was low. The inclusion of genotype probabilities to the evenly-spaced subsets showed promising increases in accuracy and may be more useful in cases where many QTL of small effect are expected. CONCLUSIONS: In this dataset the Student-t approach slightly outperformed the other methods when predicting GEBV at both high and low density, but the Lasso method may have particular advantages in situations where many small QTL are expected. When markers were selected at low density based on genome spacing, the inclusion of genotype probabilities increased GEBV accuracy which would allow a single low- density marker panel to be used across traits. BioMed Central 2010-03-31 /pmc/articles/PMC2857848/ /pubmed/20380760 http://dx.doi.org/10.1186/1753-6561-4-S1-S6 Text en Copyright ©2010 Cleveland et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Cleveland, Matthew A Forni, Selma Deeb, Nader Maltecca, Christian Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels |
title | Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels |
title_full | Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels |
title_fullStr | Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels |
title_full_unstemmed | Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels |
title_short | Genomic breeding value prediction using three Bayesian methods and application to reduced density marker panels |
title_sort | genomic breeding value prediction using three bayesian methods and application to reduced density marker panels |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2857848/ https://www.ncbi.nlm.nih.gov/pubmed/20380760 http://dx.doi.org/10.1186/1753-6561-4-S1-S6 |
work_keys_str_mv | AT clevelandmatthewa genomicbreedingvaluepredictionusingthreebayesianmethodsandapplicationtoreduceddensitymarkerpanels AT forniselma genomicbreedingvaluepredictionusingthreebayesianmethodsandapplicationtoreduceddensitymarkerpanels AT deebnader genomicbreedingvaluepredictionusingthreebayesianmethodsandapplicationtoreduceddensitymarkerpanels AT malteccachristian genomicbreedingvaluepredictionusingthreebayesianmethodsandapplicationtoreduceddensitymarkerpanels |