Cargando…

A tutorial on Bayesian multi-model linear regression with BAS and JASP

Linear regression analyses commonly involve two consecutive stages of statistical inquiry. In the first stage, a single ‘best’ model is defined by a specific selection of relevant predictors; in the second stage, the regression coefficients of the winning model are used for prediction and for infere...

Descripción completa

Detalles Bibliográficos
Autores principales: Bergh, Don van den, Clyde, Merlise A., Gupta, Akash R. Komarlu Narendra, de Jong, Tim, Gronau, Quentin F., Marsman, Maarten, Ly, Alexander, Wagenmakers, Eric-Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8613115/
https://www.ncbi.nlm.nih.gov/pubmed/33835394
http://dx.doi.org/10.3758/s13428-021-01552-2
_version_ 1784603570067210240
author Bergh, Don van den
Clyde, Merlise A.
Gupta, Akash R. Komarlu Narendra
de Jong, Tim
Gronau, Quentin F.
Marsman, Maarten
Ly, Alexander
Wagenmakers, Eric-Jan
author_facet Bergh, Don van den
Clyde, Merlise A.
Gupta, Akash R. Komarlu Narendra
de Jong, Tim
Gronau, Quentin F.
Marsman, Maarten
Ly, Alexander
Wagenmakers, Eric-Jan
author_sort Bergh, Don van den
collection PubMed
description Linear regression analyses commonly involve two consecutive stages of statistical inquiry. In the first stage, a single ‘best’ model is defined by a specific selection of relevant predictors; in the second stage, the regression coefficients of the winning model are used for prediction and for inference concerning the importance of the predictors. However, such second-stage inference ignores the model uncertainty from the first stage, resulting in overconfident parameter estimates that generalize poorly. These drawbacks can be overcome by model averaging, a technique that retains all models for inference, weighting each model’s contribution by its posterior probability. Although conceptually straightforward, model averaging is rarely used in applied research, possibly due to the lack of easily accessible software. To bridge the gap between theory and practice, we provide a tutorial on linear regression using Bayesian model averaging in JASP, based on the BAS package in R. Firstly, we provide theoretical background on linear regression, Bayesian inference, and Bayesian model averaging. Secondly, we demonstrate the method on an example data set from the World Happiness Report. Lastly, we discuss limitations of model averaging and directions for dealing with violations of model assumptions.
format Online
Article
Text
id pubmed-8613115
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-86131152021-12-10 A tutorial on Bayesian multi-model linear regression with BAS and JASP Bergh, Don van den Clyde, Merlise A. Gupta, Akash R. Komarlu Narendra de Jong, Tim Gronau, Quentin F. Marsman, Maarten Ly, Alexander Wagenmakers, Eric-Jan Behav Res Methods Article Linear regression analyses commonly involve two consecutive stages of statistical inquiry. In the first stage, a single ‘best’ model is defined by a specific selection of relevant predictors; in the second stage, the regression coefficients of the winning model are used for prediction and for inference concerning the importance of the predictors. However, such second-stage inference ignores the model uncertainty from the first stage, resulting in overconfident parameter estimates that generalize poorly. These drawbacks can be overcome by model averaging, a technique that retains all models for inference, weighting each model’s contribution by its posterior probability. Although conceptually straightforward, model averaging is rarely used in applied research, possibly due to the lack of easily accessible software. To bridge the gap between theory and practice, we provide a tutorial on linear regression using Bayesian model averaging in JASP, based on the BAS package in R. Firstly, we provide theoretical background on linear regression, Bayesian inference, and Bayesian model averaging. Secondly, we demonstrate the method on an example data set from the World Happiness Report. Lastly, we discuss limitations of model averaging and directions for dealing with violations of model assumptions. Springer US 2021-04-09 2021 /pmc/articles/PMC8613115/ /pubmed/33835394 http://dx.doi.org/10.3758/s13428-021-01552-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Bergh, Don van den
Clyde, Merlise A.
Gupta, Akash R. Komarlu Narendra
de Jong, Tim
Gronau, Quentin F.
Marsman, Maarten
Ly, Alexander
Wagenmakers, Eric-Jan
A tutorial on Bayesian multi-model linear regression with BAS and JASP
title A tutorial on Bayesian multi-model linear regression with BAS and JASP
title_full A tutorial on Bayesian multi-model linear regression with BAS and JASP
title_fullStr A tutorial on Bayesian multi-model linear regression with BAS and JASP
title_full_unstemmed A tutorial on Bayesian multi-model linear regression with BAS and JASP
title_short A tutorial on Bayesian multi-model linear regression with BAS and JASP
title_sort tutorial on bayesian multi-model linear regression with bas and jasp
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8613115/
https://www.ncbi.nlm.nih.gov/pubmed/33835394
http://dx.doi.org/10.3758/s13428-021-01552-2
work_keys_str_mv AT berghdonvanden atutorialonbayesianmultimodellinearregressionwithbasandjasp
AT clydemerlisea atutorialonbayesianmultimodellinearregressionwithbasandjasp
AT guptaakashrkomarlunarendra atutorialonbayesianmultimodellinearregressionwithbasandjasp
AT dejongtim atutorialonbayesianmultimodellinearregressionwithbasandjasp
AT gronauquentinf atutorialonbayesianmultimodellinearregressionwithbasandjasp
AT marsmanmaarten atutorialonbayesianmultimodellinearregressionwithbasandjasp
AT lyalexander atutorialonbayesianmultimodellinearregressionwithbasandjasp
AT wagenmakersericjan atutorialonbayesianmultimodellinearregressionwithbasandjasp
AT berghdonvanden tutorialonbayesianmultimodellinearregressionwithbasandjasp
AT clydemerlisea tutorialonbayesianmultimodellinearregressionwithbasandjasp
AT guptaakashrkomarlunarendra tutorialonbayesianmultimodellinearregressionwithbasandjasp
AT dejongtim tutorialonbayesianmultimodellinearregressionwithbasandjasp
AT gronauquentinf tutorialonbayesianmultimodellinearregressionwithbasandjasp
AT marsmanmaarten tutorialonbayesianmultimodellinearregressionwithbasandjasp
AT lyalexander tutorialonbayesianmultimodellinearregressionwithbasandjasp
AT wagenmakersericjan tutorialonbayesianmultimodellinearregressionwithbasandjasp