Picture for Kevin Jamieson

Kevin Jamieson

Batched Stochastic Linear Bandits with 1-Bit Communication Constraints

Add code
May 29, 2026
Viaarxiv icon

Near-Optimal Regret in Adversarial Kernel Bandits

Add code
May 26, 2026
Viaarxiv icon

Optimal Posterior Sampling for Policy Identification in Tabular Markov Decision Processes

Add code
May 05, 2026
Viaarxiv icon

Minimax Optimal Strategy for Delayed Observations in Online Reinforcement Learning

Add code
Mar 03, 2026
Viaarxiv icon

Revisiting the Bertrand Paradox via Equilibrium Analysis of No-regret Learners

Add code
Feb 25, 2026
Viaarxiv icon

Efficient Uncoupled Learning Dynamics with $\tilde{O}\!\left(T^{-1/4}\right)$ Last-Iterate Convergence in Bilinear Saddle-Point Problems over Convex Sets under Bandit Feedback

Add code
Feb 24, 2026
Viaarxiv icon

Improved Regret Bounds for Linear Bandits with Heavy-Tailed Rewards

Add code
Jun 05, 2025
Figure 1 for Improved Regret Bounds for Linear Bandits with Heavy-Tailed Rewards
Figure 2 for Improved Regret Bounds for Linear Bandits with Heavy-Tailed Rewards
Figure 3 for Improved Regret Bounds for Linear Bandits with Heavy-Tailed Rewards
Viaarxiv icon

Learning to Incentivize in Repeated Principal-Agent Problems with Adversarial Agent Arrivals

Add code
May 29, 2025
Viaarxiv icon

Stow: Robotic Packing of Items into Fabric Pods

Add code
May 07, 2025
Figure 1 for Stow: Robotic Packing of Items into Fabric Pods
Figure 2 for Stow: Robotic Packing of Items into Fabric Pods
Figure 3 for Stow: Robotic Packing of Items into Fabric Pods
Figure 4 for Stow: Robotic Packing of Items into Fabric Pods
Viaarxiv icon

Efficient Near-Optimal Algorithm for Online Shortest Paths in Directed Acyclic Graphs with Bandit Feedback Against Adaptive Adversaries

Add code
Apr 01, 2025
Viaarxiv icon