Cargando…
Genome-Wide Regression and Prediction with the BGLR Statistical Package
Many modern genomic data analyses require implementing regressions where the number of parameters (p, e.g., the number of marker effects) exceeds sample size (n). Implementing these large-p-with-small-n regressions poses several statistical and computational challenges, some of which can be confront...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4196607/ https://www.ncbi.nlm.nih.gov/pubmed/25009151 http://dx.doi.org/10.1534/genetics.114.164442 |
_version_ | 1782339507269402624 |
---|---|
author | Pérez, Paulino de los Campos, Gustavo |
author_facet | Pérez, Paulino de los Campos, Gustavo |
author_sort | Pérez, Paulino |
collection | PubMed |
description | Many modern genomic data analyses require implementing regressions where the number of parameters (p, e.g., the number of marker effects) exceeds sample size (n). Implementing these large-p-with-small-n regressions poses several statistical and computational challenges, some of which can be confronted using Bayesian methods. This approach allows integrating various parametric and nonparametric shrinkage and variable selection procedures in a unified and consistent manner. The BGLR R-package implements a large collection of Bayesian regression models, including parametric variable selection and shrinkage methods and semiparametric procedures (Bayesian reproducing kernel Hilbert spaces regressions, RKHS). The software was originally developed for genomic applications; however, the methods implemented are useful for many nongenomic applications as well. The response can be continuous (censored or not) or categorical (either binary or ordinal). The algorithm is based on a Gibbs sampler with scalar updates and the implementation takes advantage of efficient compiled C and Fortran routines. In this article we describe the methods implemented in BGLR, present examples of the use of the package, and discuss practical issues emerging in real-data analysis. |
format | Online Article Text |
id | pubmed-4196607 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-41966072014-10-17 Genome-Wide Regression and Prediction with the BGLR Statistical Package Pérez, Paulino de los Campos, Gustavo Genetics Investigations Many modern genomic data analyses require implementing regressions where the number of parameters (p, e.g., the number of marker effects) exceeds sample size (n). Implementing these large-p-with-small-n regressions poses several statistical and computational challenges, some of which can be confronted using Bayesian methods. This approach allows integrating various parametric and nonparametric shrinkage and variable selection procedures in a unified and consistent manner. The BGLR R-package implements a large collection of Bayesian regression models, including parametric variable selection and shrinkage methods and semiparametric procedures (Bayesian reproducing kernel Hilbert spaces regressions, RKHS). The software was originally developed for genomic applications; however, the methods implemented are useful for many nongenomic applications as well. The response can be continuous (censored or not) or categorical (either binary or ordinal). The algorithm is based on a Gibbs sampler with scalar updates and the implementation takes advantage of efficient compiled C and Fortran routines. In this article we describe the methods implemented in BGLR, present examples of the use of the package, and discuss practical issues emerging in real-data analysis. Genetics Society of America 2014-10 2014-07-09 /pmc/articles/PMC4196607/ /pubmed/25009151 http://dx.doi.org/10.1534/genetics.114.164442 Text en Copyright © 2014 by the Genetics Society of America Available freely online through the author-supported open access option. |
spellingShingle | Investigations Pérez, Paulino de los Campos, Gustavo Genome-Wide Regression and Prediction with the BGLR Statistical Package |
title | Genome-Wide Regression and Prediction with the BGLR Statistical Package |
title_full | Genome-Wide Regression and Prediction with the BGLR Statistical Package |
title_fullStr | Genome-Wide Regression and Prediction with the BGLR Statistical Package |
title_full_unstemmed | Genome-Wide Regression and Prediction with the BGLR Statistical Package |
title_short | Genome-Wide Regression and Prediction with the BGLR Statistical Package |
title_sort | genome-wide regression and prediction with the bglr statistical package |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4196607/ https://www.ncbi.nlm.nih.gov/pubmed/25009151 http://dx.doi.org/10.1534/genetics.114.164442 |
work_keys_str_mv | AT perezpaulino genomewideregressionandpredictionwiththebglrstatisticalpackage AT deloscamposgustavo genomewideregressionandpredictionwiththebglrstatisticalpackage |