Cargando…
The value of Bayesian predictive projection for variable selection: an example of selecting lifestyle predictors of young adult well-being
BACKGROUND: Variable selection is an important issue in many fields such as public health and psychology. Researchers often gather data on many variables of interest and then are faced with two challenging goals: building an accurate model with few predictors, and making probabilistic statements (in...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8033696/ https://www.ncbi.nlm.nih.gov/pubmed/33836714 http://dx.doi.org/10.1186/s12889-021-10690-3 |
_version_ | 1783676448429572096 |
---|---|
author | Bartonicek, A. Wickham, S. R. Pat, N. Conner, T. S. |
author_facet | Bartonicek, A. Wickham, S. R. Pat, N. Conner, T. S. |
author_sort | Bartonicek, A. |
collection | PubMed |
description | BACKGROUND: Variable selection is an important issue in many fields such as public health and psychology. Researchers often gather data on many variables of interest and then are faced with two challenging goals: building an accurate model with few predictors, and making probabilistic statements (inference) about this model. Unfortunately, it is currently difficult to attain these goals with the two most popular methods for variable selection methods: stepwise selection and LASSO. The aim of the present study was to demonstrate the use predictive projection feature selection – a novel Bayesian variable selection method that delivers both predictive power and inference. We apply predictive projection to a sample of New Zealand young adults, use it to build a compact model for predicting well-being, and compare it to other variable selection methods. METHODS: The sample consisted of 791 young adults (ages 18 to 25, 71.7% female) living in Dunedin, New Zealand who had taken part in the Daily Life Study in 2013–2014. Participants completed a 13-day online daily diary assessment of their well-being and a range of lifestyle variables (e.g., sleep, physical activity, diet variables). The participants’ diary data was averaged across days and analyzed cross-sectionally to identify predictors of average flourishing. Predictive projection was used to select as few predictors as necessary to approximate the predictive accuracy of a reference model with all 28 predictors. Predictive projection was also compared to other variable selection methods, including stepwise selection and LASSO. RESULTS: Three predictors were sufficient to approximate the predictions of the reference model: higher sleep quality, less trouble concentrating, and more servings of fruit. The performance of the projected submodel generalized well. Compared to other variable selection methods, predictive projection produced models with either matching or slightly worse performance; however, this performance was achieved with much fewer predictors. CONCLUSION: Predictive projection was used to efficiently arrive at a compact model with good predictive accuracy. The predictors selected into the submodel – felt refreshed after waking up, had less trouble concentrating, and ate more servings of fruit – were all theoretically meaningful. Our findings showcase the utility of predictive projection in a practical variable selection problem. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12889-021-10690-3. |
format | Online Article Text |
id | pubmed-8033696 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-80336962021-04-09 The value of Bayesian predictive projection for variable selection: an example of selecting lifestyle predictors of young adult well-being Bartonicek, A. Wickham, S. R. Pat, N. Conner, T. S. BMC Public Health Research Article BACKGROUND: Variable selection is an important issue in many fields such as public health and psychology. Researchers often gather data on many variables of interest and then are faced with two challenging goals: building an accurate model with few predictors, and making probabilistic statements (inference) about this model. Unfortunately, it is currently difficult to attain these goals with the two most popular methods for variable selection methods: stepwise selection and LASSO. The aim of the present study was to demonstrate the use predictive projection feature selection – a novel Bayesian variable selection method that delivers both predictive power and inference. We apply predictive projection to a sample of New Zealand young adults, use it to build a compact model for predicting well-being, and compare it to other variable selection methods. METHODS: The sample consisted of 791 young adults (ages 18 to 25, 71.7% female) living in Dunedin, New Zealand who had taken part in the Daily Life Study in 2013–2014. Participants completed a 13-day online daily diary assessment of their well-being and a range of lifestyle variables (e.g., sleep, physical activity, diet variables). The participants’ diary data was averaged across days and analyzed cross-sectionally to identify predictors of average flourishing. Predictive projection was used to select as few predictors as necessary to approximate the predictive accuracy of a reference model with all 28 predictors. Predictive projection was also compared to other variable selection methods, including stepwise selection and LASSO. RESULTS: Three predictors were sufficient to approximate the predictions of the reference model: higher sleep quality, less trouble concentrating, and more servings of fruit. The performance of the projected submodel generalized well. Compared to other variable selection methods, predictive projection produced models with either matching or slightly worse performance; however, this performance was achieved with much fewer predictors. CONCLUSION: Predictive projection was used to efficiently arrive at a compact model with good predictive accuracy. The predictors selected into the submodel – felt refreshed after waking up, had less trouble concentrating, and ate more servings of fruit – were all theoretically meaningful. Our findings showcase the utility of predictive projection in a practical variable selection problem. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12889-021-10690-3. BioMed Central 2021-04-09 /pmc/articles/PMC8033696/ /pubmed/33836714 http://dx.doi.org/10.1186/s12889-021-10690-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Bartonicek, A. Wickham, S. R. Pat, N. Conner, T. S. The value of Bayesian predictive projection for variable selection: an example of selecting lifestyle predictors of young adult well-being |
title | The value of Bayesian predictive projection for variable selection: an example of selecting lifestyle predictors of young adult well-being |
title_full | The value of Bayesian predictive projection for variable selection: an example of selecting lifestyle predictors of young adult well-being |
title_fullStr | The value of Bayesian predictive projection for variable selection: an example of selecting lifestyle predictors of young adult well-being |
title_full_unstemmed | The value of Bayesian predictive projection for variable selection: an example of selecting lifestyle predictors of young adult well-being |
title_short | The value of Bayesian predictive projection for variable selection: an example of selecting lifestyle predictors of young adult well-being |
title_sort | value of bayesian predictive projection for variable selection: an example of selecting lifestyle predictors of young adult well-being |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8033696/ https://www.ncbi.nlm.nih.gov/pubmed/33836714 http://dx.doi.org/10.1186/s12889-021-10690-3 |
work_keys_str_mv | AT bartoniceka thevalueofbayesianpredictiveprojectionforvariableselectionanexampleofselectinglifestylepredictorsofyoungadultwellbeing AT wickhamsr thevalueofbayesianpredictiveprojectionforvariableselectionanexampleofselectinglifestylepredictorsofyoungadultwellbeing AT patn thevalueofbayesianpredictiveprojectionforvariableselectionanexampleofselectinglifestylepredictorsofyoungadultwellbeing AT connerts thevalueofbayesianpredictiveprojectionforvariableselectionanexampleofselectinglifestylepredictorsofyoungadultwellbeing AT bartoniceka valueofbayesianpredictiveprojectionforvariableselectionanexampleofselectinglifestylepredictorsofyoungadultwellbeing AT wickhamsr valueofbayesianpredictiveprojectionforvariableselectionanexampleofselectinglifestylepredictorsofyoungadultwellbeing AT patn valueofbayesianpredictiveprojectionforvariableselectionanexampleofselectinglifestylepredictorsofyoungadultwellbeing AT connerts valueofbayesianpredictiveprojectionforvariableselectionanexampleofselectinglifestylepredictorsofyoungadultwellbeing |