Cargando…

A Bayesian Genomic Regression Model with Skew Normal Random Errors

Genomic selection (GS) has become a tool for selecting candidates in plant and animal breeding programs. In the case of quantitative traits, it is common to assume that the distribution of the response variable can be approximated by a normal distribution. However, it is known that the selection pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Pérez-Rodríguez, Paulino, Acosta-Pech, Rocío, Pérez-Elizalde, Sergio, Cruz, Ciro Velasco, Espinosa, Javier Suárez, Crossa, José
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5940167/
https://www.ncbi.nlm.nih.gov/pubmed/29588381
http://dx.doi.org/10.1534/g3.117.300406
_version_ 1783321061196038144
author Pérez-Rodríguez, Paulino
Acosta-Pech, Rocío
Pérez-Elizalde, Sergio
Cruz, Ciro Velasco
Espinosa, Javier Suárez
Crossa, José
author_facet Pérez-Rodríguez, Paulino
Acosta-Pech, Rocío
Pérez-Elizalde, Sergio
Cruz, Ciro Velasco
Espinosa, Javier Suárez
Crossa, José
author_sort Pérez-Rodríguez, Paulino
collection PubMed
description Genomic selection (GS) has become a tool for selecting candidates in plant and animal breeding programs. In the case of quantitative traits, it is common to assume that the distribution of the response variable can be approximated by a normal distribution. However, it is known that the selection process leads to skewed distributions. There is vast statistical literature on skewed distributions, but the skew normal distribution is of particular interest in this research. This distribution includes a third parameter that drives the skewness, so that it generalizes the normal distribution. We propose an extension of the Bayesian whole-genome regression to skew normal distribution data in the context of GS applications, where usually the number of predictors vastly exceeds the sample size. However, it can also be applied when the number of predictors is smaller than the sample size. We used a stochastic representation of a skew normal random variable, which allows the implementation of standard Markov Chain Monte Carlo (MCMC) techniques to efficiently fit the proposed model. The predictive ability and goodness of fit of the proposed model were evaluated using simulated and real data, and the results were compared to those obtained by the Bayesian Ridge Regression model. Results indicate that the proposed model has a better fit and is as good as the conventional Bayesian Ridge Regression model for prediction, based on the DIC criterion and cross-validation, respectively. A computing program coded in the R statistical package and C programming language to fit the proposed model is available as supplementary material.
format Online
Article
Text
id pubmed-5940167
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-59401672018-05-10 A Bayesian Genomic Regression Model with Skew Normal Random Errors Pérez-Rodríguez, Paulino Acosta-Pech, Rocío Pérez-Elizalde, Sergio Cruz, Ciro Velasco Espinosa, Javier Suárez Crossa, José G3 (Bethesda) Genomic Selection Genomic selection (GS) has become a tool for selecting candidates in plant and animal breeding programs. In the case of quantitative traits, it is common to assume that the distribution of the response variable can be approximated by a normal distribution. However, it is known that the selection process leads to skewed distributions. There is vast statistical literature on skewed distributions, but the skew normal distribution is of particular interest in this research. This distribution includes a third parameter that drives the skewness, so that it generalizes the normal distribution. We propose an extension of the Bayesian whole-genome regression to skew normal distribution data in the context of GS applications, where usually the number of predictors vastly exceeds the sample size. However, it can also be applied when the number of predictors is smaller than the sample size. We used a stochastic representation of a skew normal random variable, which allows the implementation of standard Markov Chain Monte Carlo (MCMC) techniques to efficiently fit the proposed model. The predictive ability and goodness of fit of the proposed model were evaluated using simulated and real data, and the results were compared to those obtained by the Bayesian Ridge Regression model. Results indicate that the proposed model has a better fit and is as good as the conventional Bayesian Ridge Regression model for prediction, based on the DIC criterion and cross-validation, respectively. A computing program coded in the R statistical package and C programming language to fit the proposed model is available as supplementary material. Genetics Society of America 2018-03-27 /pmc/articles/PMC5940167/ /pubmed/29588381 http://dx.doi.org/10.1534/g3.117.300406 Text en Copyright © 2018 Pérez-Rodríguez et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomic Selection
Pérez-Rodríguez, Paulino
Acosta-Pech, Rocío
Pérez-Elizalde, Sergio
Cruz, Ciro Velasco
Espinosa, Javier Suárez
Crossa, José
A Bayesian Genomic Regression Model with Skew Normal Random Errors
title A Bayesian Genomic Regression Model with Skew Normal Random Errors
title_full A Bayesian Genomic Regression Model with Skew Normal Random Errors
title_fullStr A Bayesian Genomic Regression Model with Skew Normal Random Errors
title_full_unstemmed A Bayesian Genomic Regression Model with Skew Normal Random Errors
title_short A Bayesian Genomic Regression Model with Skew Normal Random Errors
title_sort bayesian genomic regression model with skew normal random errors
topic Genomic Selection
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5940167/
https://www.ncbi.nlm.nih.gov/pubmed/29588381
http://dx.doi.org/10.1534/g3.117.300406
work_keys_str_mv AT perezrodriguezpaulino abayesiangenomicregressionmodelwithskewnormalrandomerrors
AT acostapechrocio abayesiangenomicregressionmodelwithskewnormalrandomerrors
AT perezelizaldesergio abayesiangenomicregressionmodelwithskewnormalrandomerrors
AT cruzcirovelasco abayesiangenomicregressionmodelwithskewnormalrandomerrors
AT espinosajaviersuarez abayesiangenomicregressionmodelwithskewnormalrandomerrors
AT crossajose abayesiangenomicregressionmodelwithskewnormalrandomerrors
AT perezrodriguezpaulino bayesiangenomicregressionmodelwithskewnormalrandomerrors
AT acostapechrocio bayesiangenomicregressionmodelwithskewnormalrandomerrors
AT perezelizaldesergio bayesiangenomicregressionmodelwithskewnormalrandomerrors
AT cruzcirovelasco bayesiangenomicregressionmodelwithskewnormalrandomerrors
AT espinosajaviersuarez bayesiangenomicregressionmodelwithskewnormalrandomerrors
AT crossajose bayesiangenomicregressionmodelwithskewnormalrandomerrors