Picture for Benjamin Howson

Benjamin Howson

QuACK: A Multipurpose Queuing Algorithm for Cooperative $k$-Armed Bandits

Add code
Oct 31, 2024
Viaarxiv icon

DISCO: An End-to-End Bandit Framework for Personalised Discount Allocation

Add code
Jun 11, 2024
Viaarxiv icon

Delayed Feedback in Generalised Linear Bandits Revisited

Add code
Jul 25, 2022
Figure 1 for Delayed Feedback in Generalised Linear Bandits Revisited
Figure 2 for Delayed Feedback in Generalised Linear Bandits Revisited
Viaarxiv icon

Delayed Feedback in Episodic Reinforcement Learning

Add code
Nov 15, 2021
Figure 1 for Delayed Feedback in Episodic Reinforcement Learning
Viaarxiv icon