Cargando…

Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory

Gaussian process emulators (GPE) are a machine learning approach that replicates computational demanding models using training runs of that model. Constructing such a surrogate is very challenging and, in the context of Bayesian inference, the training runs should be well invested. The current paper...

Descripción completa

Detalles Bibliográficos
Autores principales:	Oladyshkin, Sergey, Mohammadi, Farid, Kroeker, Ilja, Nowak, Wolfgang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517511/ https://www.ncbi.nlm.nih.gov/pubmed/33286660 http://dx.doi.org/10.3390/e22080890

_version_	1783587242994827264
author	Oladyshkin, Sergey Mohammadi, Farid Kroeker, Ilja Nowak, Wolfgang
author_facet	Oladyshkin, Sergey Mohammadi, Farid Kroeker, Ilja Nowak, Wolfgang
author_sort	Oladyshkin, Sergey
collection	PubMed
description	Gaussian process emulators (GPE) are a machine learning approach that replicates computational demanding models using training runs of that model. Constructing such a surrogate is very challenging and, in the context of Bayesian inference, the training runs should be well invested. The current paper offers a fully Bayesian view on GPEs for Bayesian inference accompanied by Bayesian active learning (BAL). We introduce three BAL strategies that adaptively identify training sets for the GPE using information-theoretic arguments. The first strategy relies on Bayesian model evidence that indicates the GPE’s quality of matching the measurement data, the second strategy is based on relative entropy that indicates the relative information gain for the GPE, and the third is founded on information entropy that indicates the missing information in the GPE. We illustrate the performance of our three strategies using analytical- and carbon-dioxide benchmarks. The paper shows evidence of convergence against a reference solution and demonstrates quantification of post-calibration uncertainty by comparing the introduced three strategies. We conclude that Bayesian model evidence-based and relative entropy-based strategies outperform the entropy-based strategy because the latter can be misleading during the BAL. The relative entropy-based strategy demonstrates superior performance to the Bayesian model evidence-based strategy.
format	Online Article Text
id	pubmed-7517511
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-75175112020-11-09 Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory Oladyshkin, Sergey Mohammadi, Farid Kroeker, Ilja Nowak, Wolfgang Entropy (Basel) Article Gaussian process emulators (GPE) are a machine learning approach that replicates computational demanding models using training runs of that model. Constructing such a surrogate is very challenging and, in the context of Bayesian inference, the training runs should be well invested. The current paper offers a fully Bayesian view on GPEs for Bayesian inference accompanied by Bayesian active learning (BAL). We introduce three BAL strategies that adaptively identify training sets for the GPE using information-theoretic arguments. The first strategy relies on Bayesian model evidence that indicates the GPE’s quality of matching the measurement data, the second strategy is based on relative entropy that indicates the relative information gain for the GPE, and the third is founded on information entropy that indicates the missing information in the GPE. We illustrate the performance of our three strategies using analytical- and carbon-dioxide benchmarks. The paper shows evidence of convergence against a reference solution and demonstrates quantification of post-calibration uncertainty by comparing the introduced three strategies. We conclude that Bayesian model evidence-based and relative entropy-based strategies outperform the entropy-based strategy because the latter can be misleading during the BAL. The relative entropy-based strategy demonstrates superior performance to the Bayesian model evidence-based strategy. MDPI 2020-08-13 /pmc/articles/PMC7517511/ /pubmed/33286660 http://dx.doi.org/10.3390/e22080890 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Oladyshkin, Sergey Mohammadi, Farid Kroeker, Ilja Nowak, Wolfgang Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title	Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title_full	Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title_fullStr	Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title_full_unstemmed	Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title_short	Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title_sort	bayesian(3) active learning for the gaussian process emulator using information theory
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517511/ https://www.ncbi.nlm.nih.gov/pubmed/33286660 http://dx.doi.org/10.3390/e22080890
work_keys_str_mv	AT oladyshkinsergey bayesian3activelearningforthegaussianprocessemulatorusinginformationtheory AT mohammadifarid bayesian3activelearningforthegaussianprocessemulatorusinginformationtheory AT kroekerilja bayesian3activelearningforthegaussianprocessemulatorusinginformationtheory AT nowakwolfgang bayesian3activelearningforthegaussianprocessemulatorusinginformationtheory

Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory

Ejemplares similares