Picture for Julian Zimmert

Julian Zimmert

Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit Feedback

Add code
Nov 11, 2024
Viaarxiv icon

Incentive-compatible Bandits: Importance Weighting No More

Add code
May 10, 2024
Viaarxiv icon

Optimal cross-learning for contextual bandits with unknown context distributions

Add code
Jan 03, 2024
Viaarxiv icon

Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback

Add code
Oct 17, 2023
Viaarxiv icon

Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits

Add code
Sep 02, 2023
Viaarxiv icon

An Improved Best-of-both-worlds Algorithm for Bandits with Delayed Feedback

Add code
Aug 21, 2023
Viaarxiv icon

A Blackbox Approach to Best of Both Worlds in Bandits and Beyond

Add code
Feb 20, 2023
Viaarxiv icon

Best of Both Worlds Policy Optimization

Add code
Feb 18, 2023
Viaarxiv icon

Refined Regret for Adversarial MDPs with Linear Function Approximation

Add code
Jan 30, 2023
Viaarxiv icon

A Unified Algorithm for Stochastic Path Problems

Add code
Oct 17, 2022
Figure 1 for A Unified Algorithm for Stochastic Path Problems
Figure 2 for A Unified Algorithm for Stochastic Path Problems
Viaarxiv icon