Picture for Itai Shufaro

Itai Shufaro

On Bits and Bandits: Quantifying the Regret-Information Trade-off

Add code
May 26, 2024
Viaarxiv icon

On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

Add code
Mar 11, 2024
Viaarxiv icon