Picture for Tor Lattimore

Tor Lattimore

Refined Detection for Gumbel Watermarking

Add code
Mar 31, 2026
Viaarxiv icon

A Lyapunov Analysis of Softmax Policy Gradient for Stochastic Bandits

Add code
Mar 27, 2026
Viaarxiv icon

A Diffusion Analysis of Policy Gradient for Stochastic Bandits

Add code
Mar 10, 2026
Viaarxiv icon

Online Newton Method for Bandit Convex Optimisation

Add code
Jun 10, 2024
Viaarxiv icon

Bandit Convex Optimisation

Add code
Feb 09, 2024
Figure 1 for Bandit Convex Optimisation
Figure 2 for Bandit Convex Optimisation
Figure 3 for Bandit Convex Optimisation
Figure 4 for Bandit Convex Optimisation
Viaarxiv icon

Probabilistic Inference in Reinforcement Learning Done Right

Add code
Nov 22, 2023
Viaarxiv icon

Context-lumpable stochastic bandits

Add code
Jun 22, 2023
Viaarxiv icon

Sequential Best-Arm Identification with Application to Brain-Computer Interface

Add code
May 17, 2023
Viaarxiv icon

A Second-Order Method for Stochastic Bandit Convex Optimisation

Add code
Feb 10, 2023
Figure 1 for A Second-Order Method for Stochastic Bandit Convex Optimisation
Figure 2 for A Second-Order Method for Stochastic Bandit Convex Optimisation
Viaarxiv icon

Leveraging Demonstrations to Improve Online Learning: Quality Matters

Add code
Feb 08, 2023
Figure 1 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 2 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 3 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 4 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Viaarxiv icon