Cargando…

Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory

Gaussian process emulators (GPE) are a machine learning approach that replicates computational demanding models using training runs of that model. Constructing such a surrogate is very challenging and, in the context of Bayesian inference, the training runs should be well invested. The current paper...

Descripción completa

Detalles Bibliográficos
Autores principales: Oladyshkin, Sergey, Mohammadi, Farid, Kroeker, Ilja, Nowak, Wolfgang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517511/
https://www.ncbi.nlm.nih.gov/pubmed/33286660
http://dx.doi.org/10.3390/e22080890
_version_ 1783587242994827264
author Oladyshkin, Sergey
Mohammadi, Farid
Kroeker, Ilja
Nowak, Wolfgang
author_facet Oladyshkin, Sergey
Mohammadi, Farid
Kroeker, Ilja
Nowak, Wolfgang
author_sort Oladyshkin, Sergey
collection PubMed
description Gaussian process emulators (GPE) are a machine learning approach that replicates computational demanding models using training runs of that model. Constructing such a surrogate is very challenging and, in the context of Bayesian inference, the training runs should be well invested. The current paper offers a fully Bayesian view on GPEs for Bayesian inference accompanied by Bayesian active learning (BAL). We introduce three BAL strategies that adaptively identify training sets for the GPE using information-theoretic arguments. The first strategy relies on Bayesian model evidence that indicates the GPE’s quality of matching the measurement data, the second strategy is based on relative entropy that indicates the relative information gain for the GPE, and the third is founded on information entropy that indicates the missing information in the GPE. We illustrate the performance of our three strategies using analytical- and carbon-dioxide benchmarks. The paper shows evidence of convergence against a reference solution and demonstrates quantification of post-calibration uncertainty by comparing the introduced three strategies. We conclude that Bayesian model evidence-based and relative entropy-based strategies outperform the entropy-based strategy because the latter can be misleading during the BAL. The relative entropy-based strategy demonstrates superior performance to the Bayesian model evidence-based strategy.
format Online
Article
Text
id pubmed-7517511
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75175112020-11-09 Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory Oladyshkin, Sergey Mohammadi, Farid Kroeker, Ilja Nowak, Wolfgang Entropy (Basel) Article Gaussian process emulators (GPE) are a machine learning approach that replicates computational demanding models using training runs of that model. Constructing such a surrogate is very challenging and, in the context of Bayesian inference, the training runs should be well invested. The current paper offers a fully Bayesian view on GPEs for Bayesian inference accompanied by Bayesian active learning (BAL). We introduce three BAL strategies that adaptively identify training sets for the GPE using information-theoretic arguments. The first strategy relies on Bayesian model evidence that indicates the GPE’s quality of matching the measurement data, the second strategy is based on relative entropy that indicates the relative information gain for the GPE, and the third is founded on information entropy that indicates the missing information in the GPE. We illustrate the performance of our three strategies using analytical- and carbon-dioxide benchmarks. The paper shows evidence of convergence against a reference solution and demonstrates quantification of post-calibration uncertainty by comparing the introduced three strategies. We conclude that Bayesian model evidence-based and relative entropy-based strategies outperform the entropy-based strategy because the latter can be misleading during the BAL. The relative entropy-based strategy demonstrates superior performance to the Bayesian model evidence-based strategy. MDPI 2020-08-13 /pmc/articles/PMC7517511/ /pubmed/33286660 http://dx.doi.org/10.3390/e22080890 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Oladyshkin, Sergey
Mohammadi, Farid
Kroeker, Ilja
Nowak, Wolfgang
Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title_full Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title_fullStr Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title_full_unstemmed Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title_short Bayesian(3) Active Learning for the Gaussian Process Emulator Using Information Theory
title_sort bayesian(3) active learning for the gaussian process emulator using information theory
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517511/
https://www.ncbi.nlm.nih.gov/pubmed/33286660
http://dx.doi.org/10.3390/e22080890
work_keys_str_mv AT oladyshkinsergey bayesian3activelearningforthegaussianprocessemulatorusinginformationtheory
AT mohammadifarid bayesian3activelearningforthegaussianprocessemulatorusinginformationtheory
AT kroekerilja bayesian3activelearningforthegaussianprocessemulatorusinginformationtheory
AT nowakwolfgang bayesian3activelearningforthegaussianprocessemulatorusinginformationtheory