Picture for Tor Lattimore

Tor Lattimore

Online Newton Method for Bandit Convex Optimisation

Add code
Jun 10, 2024
Viaarxiv icon

Bandit Convex Optimisation

Add code
Feb 09, 2024
Viaarxiv icon

Probabilistic Inference in Reinforcement Learning Done Right

Add code
Nov 22, 2023
Viaarxiv icon

Context-lumpable stochastic bandits

Add code
Jun 22, 2023
Viaarxiv icon

Sequential Best-Arm Identification with Application to Brain-Computer Interface

Add code
May 17, 2023
Viaarxiv icon

A Second-Order Method for Stochastic Bandit Convex Optimisation

Add code
Feb 10, 2023
Viaarxiv icon

Leveraging Demonstrations to Improve Online Learning: Quality Matters

Add code
Feb 08, 2023
Viaarxiv icon

Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications

Add code
Feb 07, 2023
Viaarxiv icon

Regret Bounds for Information-Directed Reinforcement Learning

Add code
Jun 09, 2022
Viaarxiv icon

Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

Add code
May 26, 2022
Figure 1 for Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Figure 2 for Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Viaarxiv icon