Cargando…
On Gap-Based Lower Bounding Techniques for Best-Arm Identification
In this paper, we consider techniques for establishing lower bounds on the number of arm pulls for best-arm identification in the multi-armed bandit problem. While a recent divergence-based approach was shown to provide improvements over an older gap-based approach, we show that the latter can be re...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517353/ https://www.ncbi.nlm.nih.gov/pubmed/33286559 http://dx.doi.org/10.3390/e22070788 |