Picture for Dirk van der Hoeven

Dirk van der Hoeven

Online Newton Method for Bandit Convex Optimisation

Add code
Jun 10, 2024
Viaarxiv icon

High-Probability Risk Bounds via Sequential Predictors

Add code
Aug 15, 2023
Viaarxiv icon

Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts

Add code
Jul 03, 2023
Viaarxiv icon

Delayed Bandits: When Do Intermediate Observations Help?

Add code
May 30, 2023
Viaarxiv icon

A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs

Add code
May 15, 2023
Viaarxiv icon

Learning on the Edge: Online Learning with Stochastic Feedback Graphs

Add code
Oct 09, 2022
Viaarxiv icon

A Regret-Variance Trade-Off in Online Learning

Add code
Jun 06, 2022
Viaarxiv icon

A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs

Add code
Jun 01, 2022
Viaarxiv icon

Nonstochastic Bandits and Experts with Arm-Dependent Delays

Add code
Nov 02, 2021
Viaarxiv icon

Beyond Bandit Feedback in Online Multiclass Classification

Add code
Jun 07, 2021
Figure 1 for Beyond Bandit Feedback in Online Multiclass Classification
Figure 2 for Beyond Bandit Feedback in Online Multiclass Classification
Figure 3 for Beyond Bandit Feedback in Online Multiclass Classification
Figure 4 for Beyond Bandit Feedback in Online Multiclass Classification
Viaarxiv icon