Cargando…

Learning from a lot: Empirical Bayes for high‐dimensional model‐based prediction

Empirical Bayes is a versatile approach to “learn from a lot” in two ways: first, from a large number of variables and, second, from a potentially large amount of prior information, for example, stored in public repositories. We review applications of a variety of empirical Bayes methods to several...

Descripción completa

Detalles Bibliográficos
Autores principales:	van de Wiel, Mark A., Te Beest, Dennis E., Münch, Magnus M.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2018
Materias:	Original Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6472625/ https://www.ncbi.nlm.nih.gov/pubmed/31007342 http://dx.doi.org/10.1111/sjos.12335

_version_	1783412277942157312
author	van de Wiel, Mark A. Te Beest, Dennis E. Münch, Magnus M.
author_facet	van de Wiel, Mark A. Te Beest, Dennis E. Münch, Magnus M.
author_sort	van de Wiel, Mark A.
collection	PubMed
description	Empirical Bayes is a versatile approach to “learn from a lot” in two ways: first, from a large number of variables and, second, from a potentially large amount of prior information, for example, stored in public repositories. We review applications of a variety of empirical Bayes methods to several well‐known model‐based prediction methods, including penalized regression, linear discriminant analysis, and Bayesian models with sparse or dense priors. We discuss “formal” empirical Bayes methods that maximize the marginal likelihood but also more informal approaches based on other data summaries. We contrast empirical Bayes to cross‐validation and full Bayes and discuss hybrid approaches. To study the relation between the quality of an empirical Bayes estimator and p, the number of variables, we consider a simple empirical Bayes estimator in a linear model setting. We argue that empirical Bayes is particularly useful when the prior contains multiple parameters, which model a priori information on variables termed “co‐data”. In particular, we present two novel examples that allow for co‐data: first, a Bayesian spike‐and‐slab setting that facilitates inclusion of multiple co‐data sources and types and, second, a hybrid empirical Bayes–full Bayes ridge regression approach for estimation of the posterior predictive interval.
format	Online Article Text
id	pubmed-6472625
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-64726252019-04-19 Learning from a lot: Empirical Bayes for high‐dimensional model‐based prediction van de Wiel, Mark A. Te Beest, Dennis E. Münch, Magnus M. Scand Stat Theory Appl Original Articles Empirical Bayes is a versatile approach to “learn from a lot” in two ways: first, from a large number of variables and, second, from a potentially large amount of prior information, for example, stored in public repositories. We review applications of a variety of empirical Bayes methods to several well‐known model‐based prediction methods, including penalized regression, linear discriminant analysis, and Bayesian models with sparse or dense priors. We discuss “formal” empirical Bayes methods that maximize the marginal likelihood but also more informal approaches based on other data summaries. We contrast empirical Bayes to cross‐validation and full Bayes and discuss hybrid approaches. To study the relation between the quality of an empirical Bayes estimator and p, the number of variables, we consider a simple empirical Bayes estimator in a linear model setting. We argue that empirical Bayes is particularly useful when the prior contains multiple parameters, which model a priori information on variables termed “co‐data”. In particular, we present two novel examples that allow for co‐data: first, a Bayesian spike‐and‐slab setting that facilitates inclusion of multiple co‐data sources and types and, second, a hybrid empirical Bayes–full Bayes ridge regression approach for estimation of the posterior predictive interval. John Wiley and Sons Inc. 2018-06-01 2019-03 /pmc/articles/PMC6472625/ /pubmed/31007342 http://dx.doi.org/10.1111/sjos.12335 Text en © 2018 The Authors Scandinavian Journal of Statistics published by John Wiley & Sons Ltd on behalf of The Board of the Foundation of the Scandinavian Journal of Statistics This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Articles van de Wiel, Mark A. Te Beest, Dennis E. Münch, Magnus M. Learning from a lot: Empirical Bayes for high‐dimensional model‐based prediction
title	Learning from a lot: Empirical Bayes for high‐dimensional model‐based prediction
title_full	Learning from a lot: Empirical Bayes for high‐dimensional model‐based prediction
title_fullStr	Learning from a lot: Empirical Bayes for high‐dimensional model‐based prediction
title_full_unstemmed	Learning from a lot: Empirical Bayes for high‐dimensional model‐based prediction
title_short	Learning from a lot: Empirical Bayes for high‐dimensional model‐based prediction
title_sort	learning from a lot: empirical bayes for high‐dimensional model‐based prediction
topic	Original Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6472625/ https://www.ncbi.nlm.nih.gov/pubmed/31007342 http://dx.doi.org/10.1111/sjos.12335
work_keys_str_mv	AT vandewielmarka learningfromalotempiricalbayesforhighdimensionalmodelbasedprediction AT tebeestdennise learningfromalotempiricalbayesforhighdimensionalmodelbasedprediction AT munchmagnusm learningfromalotempiricalbayesforhighdimensionalmodelbasedprediction

Learning from a lot: Empirical Bayes for high‐dimensional model‐based prediction

Ejemplares similares