Cargando…

An Analysis of the Value of Information When Exploring Stochastic, Discrete Multi-Armed Bandits

In this paper, we propose an information-theoretic exploration strategy for stochastic, discrete multi-armed bandits that achieves optimal regret. Our strategy is based on the value of information criterion. This criterion measures the trade-off between policy information and obtainable rewards. Hig...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sledge, Isaac J., Príncipe, José C.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512671/ https://www.ncbi.nlm.nih.gov/pubmed/33265246 http://dx.doi.org/10.3390/e20030155

Ejemplares similares

Introduction to multi-armed bandits
por: Slivkins, Aleksandrs
Publicado: (2019)

Multi-Armed Bandits in Brain-Computer Interfaces
por: Heskebeck, Frida, et al.
Publicado: (2022)

Some performance considerations when using multi-armed bandit algorithms in the presence of missing data
por: Chen, Xijin, et al.
Publicado: (2022)

The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits
por: Loecher, Markus
Publicado: (2021)

Arm order recognition in multi-armed bandit problem with laser chaos time series
por: Narisawa, Naoki, et al.
Publicado: (2021)

Risk-aware multi-armed bandit problem with application to portfolio selection
por: Huo, Xiaoguang, et al.
Publicado: (2017)

Reduction of Markov Chains Using a Value-of-Information-Based Approach
por: Sledge, Isaac J., et al.
Publicado: (2019)

Study of Multi-Armed Bandits for Energy Conservation in Cognitive Radio Sensor Networks
por: Zhang, Juan, et al.
Publicado: (2015)

Bandit algorithms in information retrieval
por: Głowacka, Dorota
Publicado: (2019)

Revisiting the multi-armed bandit model for the optimal design of clinical trials: benefits and drawbacks
por: Villar, Sofia S, et al.
Publicado: (2013)

Learning with insufficient data: a multi-armed bandit perspective on covid-19 interventions
por: Ortega, Jean Czerlinski Whitmore
Publicado: (2022)

Bandits
por: Hobsbawm, E. J. (Eric J.), 1917-2012
Publicado: (1969)

Bandits /
por: Hobsbawm, E. J. (Eric J.), 1917-2012
Publicado: (2000)

Gateway Selection in Millimeter Wave UAV Wireless Networks Using Multi-Player Multi-Armed Bandit
por: Mohamed, Ehab Mahmoud, et al.
Publicado: (2020)

Techno-bandits
por: Melvern, Linda, et al.
Publicado: (1984)

Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm
por: Cavenaghi, Emanuele, et al.
Publicado: (2021)

Spectrum Allocation and User Scheduling Based on Combinatorial Multi-Armed Bandit for 5G Massive MIMO
por: Dou, Jian, et al.
Publicado: (2023)

Optimizing Infill Drilling Decisions Using Multi-Armed Bandits: Application in a Long-Term, Multi-Element Stockpile
por: Dirkx, Rein, et al.
Publicado: (2017)

Theory of Choice in Bandit, Information Sampling and Foraging Tasks
por: Averbeck, Bruno B.
Publicado: (2015)

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints
por: Smith, Adam L., et al.
Publicado: (2017)

Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task
por: Horvath, Lilla, et al.
Publicado: (2021)

Wi-Fi Assisted Contextual Multi-Armed Bandit for Neighbor Discovery and Selection in Millimeter Wave Device to Device Communications
por: Hashima, Sherief, et al.
Publicado: (2021)

Adversarial Autoencoder and Multi-Armed Bandit for Dynamic Difficulty Adjustment in Immersive Virtual Reality for Rehabilitation: Application to Hand Movement
por: Kamikokuryo, Kenta, et al.
Publicado: (2022)

Les bandits de Dieu
por: Ganachaud, Guy
Publicado: (1957)

Bandit Algorithms for Website Optimization
por: White, John Myles
Publicado: (2013)

Adversarial Thresholding Semi-Bandits
por: Bower, Craig Steven
Publicado: (2021)

Ten Weeks with Chinese Bandits
Publicado: (1927)

Dynamic channel selection in wireless communications via a multi-armed bandit algorithm using laser chaos time series
por: Takeuchi, Shungo, et al.
Publicado: (2020)

Re-Learning EXP3 Multi-Armed Bandit Algorithm for Enhancing the Massive IoT-LoRaWAN Network Performance
por: Almarzoqi, Samar Adel, et al.
Publicado: (2022)

Predicting Ecological Momentary Assessments in an App for Tinnitus by Learning From Each User's Stream With a Contextual Multi-Armed Bandit
por: Shahania, Saijal, et al.
Publicado: (2022)

Decision making for large-scale multi-armed bandit problems using bias control of chaotic temporal waveforms in semiconductor lasers
por: Morijiri, Kensei, et al.
Publicado: (2022)

Basal Ganglia Preferentially Encode Context Dependent Choice in a Two-Armed Bandit Task
por: Garenne, André, et al.
Publicado: (2011)

Decision-making without a brain: how an amoeboid organism solves the two-armed bandit
por: Reid, Chris R., et al.
Publicado: (2016)

Multi-Agent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures
por: Verstraeten, Timothy, et al.
Publicado: (2020)

Enhanced Dynamic Spectrum Access in UAV Wireless Networks for Post-Disaster Area Surveillance System: A Multi-Player Multi-Armed Bandit Approach
por: Amrallah, Amr, et al.
Publicado: (2021)

Discrete stochastics
por: Jacobs, Konrad
Publicado: (1992)

Maximum Entropy Exploration in Contextual Bandits with Neural Networks and Energy Based Models
por: Elwood, Adam, et al.
Publicado: (2023)

Bandit problems: sequential allocation of experiments
por: Berry, Donald A, et al.
Publicado: (1985)

Bandits Who Make Society Their Prey
Publicado: (1898)

Signal detection models as contextual bandits
por: Sherratt, Thomas N., et al.
Publicado: (2023)

Cannot write session to /tmp/vufind_sessions/sess_gdbob8gofr7t5cftg4mg9oand3