Picture for Sang Bin Moon

Sang Bin Moon

Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes

Add code
May 03, 2024
Viaarxiv icon