Cargando…

Genome-Wide Regression and Prediction with the BGLR Statistical Package

Many modern genomic data analyses require implementing regressions where the number of parameters (p, e.g., the number of marker effects) exceeds sample size (n). Implementing these large-p-with-small-n regressions poses several statistical and computational challenges, some of which can be confront...

Descripción completa

Detalles Bibliográficos
Autores principales: Pérez, Paulino, de los Campos, Gustavo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4196607/
https://www.ncbi.nlm.nih.gov/pubmed/25009151
http://dx.doi.org/10.1534/genetics.114.164442
_version_ 1782339507269402624
author Pérez, Paulino
de los Campos, Gustavo
author_facet Pérez, Paulino
de los Campos, Gustavo
author_sort Pérez, Paulino
collection PubMed
description Many modern genomic data analyses require implementing regressions where the number of parameters (p, e.g., the number of marker effects) exceeds sample size (n). Implementing these large-p-with-small-n regressions poses several statistical and computational challenges, some of which can be confronted using Bayesian methods. This approach allows integrating various parametric and nonparametric shrinkage and variable selection procedures in a unified and consistent manner. The BGLR R-package implements a large collection of Bayesian regression models, including parametric variable selection and shrinkage methods and semiparametric procedures (Bayesian reproducing kernel Hilbert spaces regressions, RKHS). The software was originally developed for genomic applications; however, the methods implemented are useful for many nongenomic applications as well. The response can be continuous (censored or not) or categorical (either binary or ordinal). The algorithm is based on a Gibbs sampler with scalar updates and the implementation takes advantage of efficient compiled C and Fortran routines. In this article we describe the methods implemented in BGLR, present examples of the use of the package, and discuss practical issues emerging in real-data analysis.
format Online
Article
Text
id pubmed-4196607
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-41966072014-10-17 Genome-Wide Regression and Prediction with the BGLR Statistical Package Pérez, Paulino de los Campos, Gustavo Genetics Investigations Many modern genomic data analyses require implementing regressions where the number of parameters (p, e.g., the number of marker effects) exceeds sample size (n). Implementing these large-p-with-small-n regressions poses several statistical and computational challenges, some of which can be confronted using Bayesian methods. This approach allows integrating various parametric and nonparametric shrinkage and variable selection procedures in a unified and consistent manner. The BGLR R-package implements a large collection of Bayesian regression models, including parametric variable selection and shrinkage methods and semiparametric procedures (Bayesian reproducing kernel Hilbert spaces regressions, RKHS). The software was originally developed for genomic applications; however, the methods implemented are useful for many nongenomic applications as well. The response can be continuous (censored or not) or categorical (either binary or ordinal). The algorithm is based on a Gibbs sampler with scalar updates and the implementation takes advantage of efficient compiled C and Fortran routines. In this article we describe the methods implemented in BGLR, present examples of the use of the package, and discuss practical issues emerging in real-data analysis. Genetics Society of America 2014-10 2014-07-09 /pmc/articles/PMC4196607/ /pubmed/25009151 http://dx.doi.org/10.1534/genetics.114.164442 Text en Copyright © 2014 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Pérez, Paulino
de los Campos, Gustavo
Genome-Wide Regression and Prediction with the BGLR Statistical Package
title Genome-Wide Regression and Prediction with the BGLR Statistical Package
title_full Genome-Wide Regression and Prediction with the BGLR Statistical Package
title_fullStr Genome-Wide Regression and Prediction with the BGLR Statistical Package
title_full_unstemmed Genome-Wide Regression and Prediction with the BGLR Statistical Package
title_short Genome-Wide Regression and Prediction with the BGLR Statistical Package
title_sort genome-wide regression and prediction with the bglr statistical package
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4196607/
https://www.ncbi.nlm.nih.gov/pubmed/25009151
http://dx.doi.org/10.1534/genetics.114.164442
work_keys_str_mv AT perezpaulino genomewideregressionandpredictionwiththebglrstatisticalpackage
AT deloscamposgustavo genomewideregressionandpredictionwiththebglrstatisticalpackage