Cargando…

Estimation of genomic breeding values using the Horseshoe prior

BACKGROUND: A method for estimating genomic breeding values (GEBV) based on the Horseshoe prior was introduced and used on the analysis of the 16(th )QTLMAS workshop dataset, which resembles three milk production traits. The method was compared with five commonly used methods: Bayes A, Bayes B, Baye...

Descripción completa

Detalles Bibliográficos
Autor principal: Pong-Wong, Ricardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4195408/
https://www.ncbi.nlm.nih.gov/pubmed/25519520
http://dx.doi.org/10.1186/1753-6561-8-S5-S6
_version_ 1782339309697761280
author Pong-Wong, Ricardo
author_facet Pong-Wong, Ricardo
author_sort Pong-Wong, Ricardo
collection PubMed
description BACKGROUND: A method for estimating genomic breeding values (GEBV) based on the Horseshoe prior was introduced and used on the analysis of the 16(th )QTLMAS workshop dataset, which resembles three milk production traits. The method was compared with five commonly used methods: Bayes A, Bayes B, Bayes C, Bayesian Lasso and GLUP. METHODS: The main difference between the methods is the prior distribution assumed during the estimation of the SNP effects. The distribution of the Bayesian Lasso is a Laplace distribution; for Bayes A is a Student-t; for Bayes B and Bayes C is a spike and slab prior combining a proportion of SNP without effect and a proportion with effect distributed as a Student-t or Gaussian for Bayes B and C, respectively; for GBLUP is similar to a ridge regression. The distribution for the Horseshoe prior behaves like log(1+1/β(2)) (up to a constant). It has an infinite spike at zero and heavy tail that decay by β(-2 )(slower than the Laplace or the Student-t). The implementation of all methods (except GBLUP) was done using a MCMC approach, where the relevant parameters defining the prior distributions were jointly estimated from the data. The GBLUP was done using ASREML. RESULTS: The accuracy for all methods ranged from 0.74 to 0.83, representing an improvement of 44% to 78% over the traditional BLUP evaluation. GEBV with the highest accuracy were obtained with Bayes A, Bayes B and the Horseshoe prior. The Horseshoe tended to select smaller number of SNP and assigning them larger effects, while strongly shrinking the remaining SNP to have an effect closer to zero. CONCLUSIONS: The Horseshoe prior showed a different shrinkage pattern than the other methods. While for this specific dataset, this has little impact on the accuracy of the GEBV, it may prove a good property to discriminate true effect from noise, and thereby, improve overall prediction under different scenarios.
format Online
Article
Text
id pubmed-4195408
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41954082014-11-05 Estimation of genomic breeding values using the Horseshoe prior Pong-Wong, Ricardo BMC Proc Proceedings BACKGROUND: A method for estimating genomic breeding values (GEBV) based on the Horseshoe prior was introduced and used on the analysis of the 16(th )QTLMAS workshop dataset, which resembles three milk production traits. The method was compared with five commonly used methods: Bayes A, Bayes B, Bayes C, Bayesian Lasso and GLUP. METHODS: The main difference between the methods is the prior distribution assumed during the estimation of the SNP effects. The distribution of the Bayesian Lasso is a Laplace distribution; for Bayes A is a Student-t; for Bayes B and Bayes C is a spike and slab prior combining a proportion of SNP without effect and a proportion with effect distributed as a Student-t or Gaussian for Bayes B and C, respectively; for GBLUP is similar to a ridge regression. The distribution for the Horseshoe prior behaves like log(1+1/β(2)) (up to a constant). It has an infinite spike at zero and heavy tail that decay by β(-2 )(slower than the Laplace or the Student-t). The implementation of all methods (except GBLUP) was done using a MCMC approach, where the relevant parameters defining the prior distributions were jointly estimated from the data. The GBLUP was done using ASREML. RESULTS: The accuracy for all methods ranged from 0.74 to 0.83, representing an improvement of 44% to 78% over the traditional BLUP evaluation. GEBV with the highest accuracy were obtained with Bayes A, Bayes B and the Horseshoe prior. The Horseshoe tended to select smaller number of SNP and assigning them larger effects, while strongly shrinking the remaining SNP to have an effect closer to zero. CONCLUSIONS: The Horseshoe prior showed a different shrinkage pattern than the other methods. While for this specific dataset, this has little impact on the accuracy of the GEBV, it may prove a good property to discriminate true effect from noise, and thereby, improve overall prediction under different scenarios. BioMed Central 2014-10-07 /pmc/articles/PMC4195408/ /pubmed/25519520 http://dx.doi.org/10.1186/1753-6561-8-S5-S6 Text en Copyright © 2014 Pong-Wong; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Pong-Wong, Ricardo
Estimation of genomic breeding values using the Horseshoe prior
title Estimation of genomic breeding values using the Horseshoe prior
title_full Estimation of genomic breeding values using the Horseshoe prior
title_fullStr Estimation of genomic breeding values using the Horseshoe prior
title_full_unstemmed Estimation of genomic breeding values using the Horseshoe prior
title_short Estimation of genomic breeding values using the Horseshoe prior
title_sort estimation of genomic breeding values using the horseshoe prior
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4195408/
https://www.ncbi.nlm.nih.gov/pubmed/25519520
http://dx.doi.org/10.1186/1753-6561-8-S5-S6
work_keys_str_mv AT pongwongricardo estimationofgenomicbreedingvaluesusingthehorseshoeprior