Cargando…

Adaptive penalization in high-dimensional regression and classification with external covariates using variational Bayes

Penalization schemes like Lasso or ridge regression are routinely used to regress a response of interest on a high-dimensional set of potential predictors. Despite being decisive, the question of the relative strength of penalization is often glossed over and only implicitly determined by the scale...

Descripción completa

Detalles Bibliográficos
Autores principales: Velten, Britta, Huber, Wolfgang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8036004/
https://www.ncbi.nlm.nih.gov/pubmed/31596468
http://dx.doi.org/10.1093/biostatistics/kxz034
_version_ 1783676814514716672
author Velten, Britta
Huber, Wolfgang
author_facet Velten, Britta
Huber, Wolfgang
author_sort Velten, Britta
collection PubMed
description Penalization schemes like Lasso or ridge regression are routinely used to regress a response of interest on a high-dimensional set of potential predictors. Despite being decisive, the question of the relative strength of penalization is often glossed over and only implicitly determined by the scale of individual predictors. At the same time, additional information on the predictors is available in many applications but left unused. Here, we propose to make use of such external covariates to adapt the penalization in a data-driven manner. We present a method that differentially penalizes feature groups defined by the covariates and adapts the relative strength of penalization to the information content of each group. Using techniques from the Bayesian tool-set our procedure combines shrinkage with feature selection and provides a scalable optimization scheme. We demonstrate in simulations that the method accurately recovers the true effect sizes and sparsity patterns per feature group. Furthermore, it leads to an improved prediction performance in situations where the groups have strong differences in dynamic range. In applications to data from high-throughput biology, the method enables re-weighting the importance of feature groups from different assays. Overall, using available covariates extends the range of applications of penalized regression, improves model interpretability and can improve prediction performance.
format Online
Article
Text
id pubmed-8036004
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80360042021-04-14 Adaptive penalization in high-dimensional regression and classification with external covariates using variational Bayes Velten, Britta Huber, Wolfgang Biostatistics Articles Penalization schemes like Lasso or ridge regression are routinely used to regress a response of interest on a high-dimensional set of potential predictors. Despite being decisive, the question of the relative strength of penalization is often glossed over and only implicitly determined by the scale of individual predictors. At the same time, additional information on the predictors is available in many applications but left unused. Here, we propose to make use of such external covariates to adapt the penalization in a data-driven manner. We present a method that differentially penalizes feature groups defined by the covariates and adapts the relative strength of penalization to the information content of each group. Using techniques from the Bayesian tool-set our procedure combines shrinkage with feature selection and provides a scalable optimization scheme. We demonstrate in simulations that the method accurately recovers the true effect sizes and sparsity patterns per feature group. Furthermore, it leads to an improved prediction performance in situations where the groups have strong differences in dynamic range. In applications to data from high-throughput biology, the method enables re-weighting the importance of feature groups from different assays. Overall, using available covariates extends the range of applications of penalized regression, improves model interpretability and can improve prediction performance. Oxford University Press 2019-10-09 /pmc/articles/PMC8036004/ /pubmed/31596468 http://dx.doi.org/10.1093/biostatistics/kxz034 Text en © The Author 2019. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Velten, Britta
Huber, Wolfgang
Adaptive penalization in high-dimensional regression and classification with external covariates using variational Bayes
title Adaptive penalization in high-dimensional regression and classification with external covariates using variational Bayes
title_full Adaptive penalization in high-dimensional regression and classification with external covariates using variational Bayes
title_fullStr Adaptive penalization in high-dimensional regression and classification with external covariates using variational Bayes
title_full_unstemmed Adaptive penalization in high-dimensional regression and classification with external covariates using variational Bayes
title_short Adaptive penalization in high-dimensional regression and classification with external covariates using variational Bayes
title_sort adaptive penalization in high-dimensional regression and classification with external covariates using variational bayes
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8036004/
https://www.ncbi.nlm.nih.gov/pubmed/31596468
http://dx.doi.org/10.1093/biostatistics/kxz034
work_keys_str_mv AT veltenbritta adaptivepenalizationinhighdimensionalregressionandclassificationwithexternalcovariatesusingvariationalbayes
AT huberwolfgang adaptivepenalizationinhighdimensionalregressionandclassificationwithexternalcovariatesusingvariationalbayes