Picture for Ciara Pike-Burke

Ciara Pike-Burke

QuACK: A Multipurpose Queuing Algorithm for Cooperative $k$-Armed Bandits

Add code
Oct 31, 2024
Viaarxiv icon

Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity

Add code
Oct 02, 2023
Viaarxiv icon

Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts

Add code
Jul 03, 2023
Viaarxiv icon

Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes

Add code
Feb 22, 2023
Figure 1 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Figure 2 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Viaarxiv icon

Delayed Feedback in Kernel Bandits

Add code
Feb 01, 2023
Viaarxiv icon

Delayed Feedback in Generalised Linear Bandits Revisited

Add code
Jul 25, 2022
Figure 1 for Delayed Feedback in Generalised Linear Bandits Revisited
Figure 2 for Delayed Feedback in Generalised Linear Bandits Revisited
Viaarxiv icon

Bandit problems with fidelity rewards

Add code
Nov 25, 2021
Figure 1 for Bandit problems with fidelity rewards
Figure 2 for Bandit problems with fidelity rewards
Figure 3 for Bandit problems with fidelity rewards
Figure 4 for Bandit problems with fidelity rewards
Viaarxiv icon

Delayed Feedback in Episodic Reinforcement Learning

Add code
Nov 15, 2021
Figure 1 for Delayed Feedback in Episodic Reinforcement Learning
Viaarxiv icon

Local Differentially Private Regret Minimization in Reinforcement Learning

Add code
Oct 15, 2020
Figure 1 for Local Differentially Private Regret Minimization in Reinforcement Learning
Figure 2 for Local Differentially Private Regret Minimization in Reinforcement Learning
Figure 3 for Local Differentially Private Regret Minimization in Reinforcement Learning
Figure 4 for Local Differentially Private Regret Minimization in Reinforcement Learning
Viaarxiv icon

A Unifying View of Optimism in Episodic Reinforcement Learning

Add code
Jul 03, 2020
Figure 1 for A Unifying View of Optimism in Episodic Reinforcement Learning
Viaarxiv icon