Cargando…

Bayesian LASSO, Scale Space and Decision Making in Association Genetics

BACKGROUND: LASSO is a penalized regression method that facilitates model fitting in situations where there are as many, or even more explanatory variables than observations, and only a few variables are relevant in explaining the data. We focus on the Bayesian version of LASSO and consider four pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Pasanen, Leena, Holmström, Lasse, Sillanpää, Mikko J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391919/
https://www.ncbi.nlm.nih.gov/pubmed/25856391
http://dx.doi.org/10.1371/journal.pone.0120017
_version_ 1782365896951463936
author Pasanen, Leena
Holmström, Lasse
Sillanpää, Mikko J.
author_facet Pasanen, Leena
Holmström, Lasse
Sillanpää, Mikko J.
author_sort Pasanen, Leena
collection PubMed
description BACKGROUND: LASSO is a penalized regression method that facilitates model fitting in situations where there are as many, or even more explanatory variables than observations, and only a few variables are relevant in explaining the data. We focus on the Bayesian version of LASSO and consider four problems that need special attention: (i) controlling false positives, (ii) multiple comparisons, (iii) collinearity among explanatory variables, and (iv) the choice of the tuning parameter that controls the amount of shrinkage and the sparsity of the estimates. The particular application considered is association genetics, where LASSO regression can be used to find links between chromosome locations and phenotypic traits in a biological organism. However, the proposed techniques are relevant also in other contexts where LASSO is used for variable selection. RESULTS: We separate the true associations from false positives using the posterior distribution of the effects (regression coefficients) provided by Bayesian LASSO. We propose to solve the multiple comparisons problem by using simultaneous inference based on the joint posterior distribution of the effects. Bayesian LASSO also tends to distribute an effect among collinear variables, making detection of an association difficult. We propose to solve this problem by considering not only individual effects but also their functionals (i.e. sums and differences). Finally, whereas in Bayesian LASSO the tuning parameter is often regarded as a random variable, we adopt a scale space view and consider a whole range of fixed tuning parameters, instead. The effect estimates and the associated inference are considered for all tuning parameters in the selected range and the results are visualized with color maps that provide useful insights into data and the association problem considered. The methods are illustrated using two sets of artificial data and one real data set, all representing typical settings in association genetics.
format Online
Article
Text
id pubmed-4391919
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43919192015-04-21 Bayesian LASSO, Scale Space and Decision Making in Association Genetics Pasanen, Leena Holmström, Lasse Sillanpää, Mikko J. PLoS One Research Article BACKGROUND: LASSO is a penalized regression method that facilitates model fitting in situations where there are as many, or even more explanatory variables than observations, and only a few variables are relevant in explaining the data. We focus on the Bayesian version of LASSO and consider four problems that need special attention: (i) controlling false positives, (ii) multiple comparisons, (iii) collinearity among explanatory variables, and (iv) the choice of the tuning parameter that controls the amount of shrinkage and the sparsity of the estimates. The particular application considered is association genetics, where LASSO regression can be used to find links between chromosome locations and phenotypic traits in a biological organism. However, the proposed techniques are relevant also in other contexts where LASSO is used for variable selection. RESULTS: We separate the true associations from false positives using the posterior distribution of the effects (regression coefficients) provided by Bayesian LASSO. We propose to solve the multiple comparisons problem by using simultaneous inference based on the joint posterior distribution of the effects. Bayesian LASSO also tends to distribute an effect among collinear variables, making detection of an association difficult. We propose to solve this problem by considering not only individual effects but also their functionals (i.e. sums and differences). Finally, whereas in Bayesian LASSO the tuning parameter is often regarded as a random variable, we adopt a scale space view and consider a whole range of fixed tuning parameters, instead. The effect estimates and the associated inference are considered for all tuning parameters in the selected range and the results are visualized with color maps that provide useful insights into data and the association problem considered. The methods are illustrated using two sets of artificial data and one real data set, all representing typical settings in association genetics. Public Library of Science 2015-04-09 /pmc/articles/PMC4391919/ /pubmed/25856391 http://dx.doi.org/10.1371/journal.pone.0120017 Text en © 2015 Pasanen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Pasanen, Leena
Holmström, Lasse
Sillanpää, Mikko J.
Bayesian LASSO, Scale Space and Decision Making in Association Genetics
title Bayesian LASSO, Scale Space and Decision Making in Association Genetics
title_full Bayesian LASSO, Scale Space and Decision Making in Association Genetics
title_fullStr Bayesian LASSO, Scale Space and Decision Making in Association Genetics
title_full_unstemmed Bayesian LASSO, Scale Space and Decision Making in Association Genetics
title_short Bayesian LASSO, Scale Space and Decision Making in Association Genetics
title_sort bayesian lasso, scale space and decision making in association genetics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391919/
https://www.ncbi.nlm.nih.gov/pubmed/25856391
http://dx.doi.org/10.1371/journal.pone.0120017
work_keys_str_mv AT pasanenleena bayesianlassoscalespaceanddecisionmakinginassociationgenetics
AT holmstromlasse bayesianlassoscalespaceanddecisionmakinginassociationgenetics
AT sillanpaamikkoj bayesianlassoscalespaceanddecisionmakinginassociationgenetics