Cargando…

An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction

Evidence that genomic selection (GS) is a technology that is revolutionizing plant breeding continues to grow. However, it is very well documented that its success strongly depends on statistical models, which are used by GS to perform predictions of candidate genotypes that were not phenotyped. Bec...

Descripción completa

Detalles Bibliográficos
Autores principales: Montesinos-López, Osval A., Montesinos-López, Abelardo, Luna-Vázquez, Francisco Javier, Toledo, Fernando H., Pérez-Rodríguez, Paulino, Lillemo, Morten, Crossa, José
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6505148/
https://www.ncbi.nlm.nih.gov/pubmed/30819822
http://dx.doi.org/10.1534/g3.119.400126
_version_ 1783416700406857728
author Montesinos-López, Osval A.
Montesinos-López, Abelardo
Luna-Vázquez, Francisco Javier
Toledo, Fernando H.
Pérez-Rodríguez, Paulino
Lillemo, Morten
Crossa, José
author_facet Montesinos-López, Osval A.
Montesinos-López, Abelardo
Luna-Vázquez, Francisco Javier
Toledo, Fernando H.
Pérez-Rodríguez, Paulino
Lillemo, Morten
Crossa, José
author_sort Montesinos-López, Osval A.
collection PubMed
description Evidence that genomic selection (GS) is a technology that is revolutionizing plant breeding continues to grow. However, it is very well documented that its success strongly depends on statistical models, which are used by GS to perform predictions of candidate genotypes that were not phenotyped. Because there is no universally better model for prediction and models for each type of response variable are needed (continuous, binary, ordinal, count, etc.), an active area of research aims to develop statistical models for the prediction of univariate and multivariate traits in GS. However, most of the models developed so far are for univariate and continuous (Gaussian) traits. Therefore, to overcome the lack of multivariate statistical models for genome-based prediction by improving the original version of the BMTME, we propose an improved Bayesian multi-trait and multi-environment (BMTME) R package for analyzing breeding data with multiple traits and multiple environments. We also introduce Bayesian multi-output regressor stacking (BMORS) functions that are considerably efficient in terms of computational resources. The package allows parameter estimation and evaluates the prediction performance of multi-trait and multi-environment data in a reliable, efficient and user-friendly way. We illustrate the use of the BMTME with real toy datasets to show all the facilities that the software offers the user. However, for large datasets, the BME() and BMTME() functions of the BMTME R package are very intense in terms of computing time; on the other hand, less intensive computing is required with BMORS functions BMORS() and BMORS_Env() that are also included in the BMTME package.
format Online
Article
Text
id pubmed-6505148
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-65051482019-05-21 An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction Montesinos-López, Osval A. Montesinos-López, Abelardo Luna-Vázquez, Francisco Javier Toledo, Fernando H. Pérez-Rodríguez, Paulino Lillemo, Morten Crossa, José G3 (Bethesda) Software and Data Resources Evidence that genomic selection (GS) is a technology that is revolutionizing plant breeding continues to grow. However, it is very well documented that its success strongly depends on statistical models, which are used by GS to perform predictions of candidate genotypes that were not phenotyped. Because there is no universally better model for prediction and models for each type of response variable are needed (continuous, binary, ordinal, count, etc.), an active area of research aims to develop statistical models for the prediction of univariate and multivariate traits in GS. However, most of the models developed so far are for univariate and continuous (Gaussian) traits. Therefore, to overcome the lack of multivariate statistical models for genome-based prediction by improving the original version of the BMTME, we propose an improved Bayesian multi-trait and multi-environment (BMTME) R package for analyzing breeding data with multiple traits and multiple environments. We also introduce Bayesian multi-output regressor stacking (BMORS) functions that are considerably efficient in terms of computational resources. The package allows parameter estimation and evaluates the prediction performance of multi-trait and multi-environment data in a reliable, efficient and user-friendly way. We illustrate the use of the BMTME with real toy datasets to show all the facilities that the software offers the user. However, for large datasets, the BME() and BMTME() functions of the BMTME R package are very intense in terms of computing time; on the other hand, less intensive computing is required with BMORS functions BMORS() and BMORS_Env() that are also included in the BMTME package. Genetics Society of America 2019-02-28 /pmc/articles/PMC6505148/ /pubmed/30819822 http://dx.doi.org/10.1534/g3.119.400126 Text en Copyright © 2019 Montesinos-López et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software and Data Resources
Montesinos-López, Osval A.
Montesinos-López, Abelardo
Luna-Vázquez, Francisco Javier
Toledo, Fernando H.
Pérez-Rodríguez, Paulino
Lillemo, Morten
Crossa, José
An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction
title An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction
title_full An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction
title_fullStr An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction
title_full_unstemmed An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction
title_short An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction
title_sort r package for bayesian analysis of multi-environment and multi-trait multi-environment data for genome-based prediction
topic Software and Data Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6505148/
https://www.ncbi.nlm.nih.gov/pubmed/30819822
http://dx.doi.org/10.1534/g3.119.400126
work_keys_str_mv AT montesinoslopezosvala anrpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT montesinoslopezabelardo anrpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT lunavazquezfranciscojavier anrpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT toledofernandoh anrpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT perezrodriguezpaulino anrpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT lillemomorten anrpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT crossajose anrpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT montesinoslopezosvala rpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT montesinoslopezabelardo rpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT lunavazquezfranciscojavier rpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT toledofernandoh rpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT perezrodriguezpaulino rpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT lillemomorten rpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction
AT crossajose rpackageforbayesiananalysisofmultienvironmentandmultitraitmultienvironmentdataforgenomebasedprediction