Cargando…
The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits
The connection between optimal stopping times of American Options and multi-armed bandits is the subject of active research. This article investigates the effects of optional stopping in a particular class of multi-armed bandit experiments, which randomly allocates observations to arms proportional...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8299077/ https://www.ncbi.nlm.nih.gov/pubmed/34308342 http://dx.doi.org/10.3389/frai.2021.715690 |
_version_ | 1783726192332898304 |
---|---|
author | Loecher, Markus |
author_facet | Loecher, Markus |
author_sort | Loecher, Markus |
collection | PubMed |
description | The connection between optimal stopping times of American Options and multi-armed bandits is the subject of active research. This article investigates the effects of optional stopping in a particular class of multi-armed bandit experiments, which randomly allocates observations to arms proportional to the Bayesian posterior probability that each arm is optimal (Thompson sampling). The interplay between optional stopping and prior mismatch is examined. We propose a novel partitioning of regret into peri/post testing. We further show a strong dependence of the parameters of interest on the assumed prior probability density. |
format | Online Article Text |
id | pubmed-8299077 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82990772021-07-24 The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits Loecher, Markus Front Artif Intell Artificial Intelligence The connection between optimal stopping times of American Options and multi-armed bandits is the subject of active research. This article investigates the effects of optional stopping in a particular class of multi-armed bandit experiments, which randomly allocates observations to arms proportional to the Bayesian posterior probability that each arm is optimal (Thompson sampling). The interplay between optional stopping and prior mismatch is examined. We propose a novel partitioning of regret into peri/post testing. We further show a strong dependence of the parameters of interest on the assumed prior probability density. Frontiers Media S.A. 2021-07-09 /pmc/articles/PMC8299077/ /pubmed/34308342 http://dx.doi.org/10.3389/frai.2021.715690 Text en Copyright © 2021 Loecher. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Loecher, Markus The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits |
title | The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits |
title_full | The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits |
title_fullStr | The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits |
title_full_unstemmed | The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits |
title_short | The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits |
title_sort | perils of misspecified priors and optional stopping in multi-armed bandits |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8299077/ https://www.ncbi.nlm.nih.gov/pubmed/34308342 http://dx.doi.org/10.3389/frai.2021.715690 |
work_keys_str_mv | AT loechermarkus theperilsofmisspecifiedpriorsandoptionalstoppinginmultiarmedbandits AT loechermarkus perilsofmisspecifiedpriorsandoptionalstoppinginmultiarmedbandits |