Picture for Emmeran Johnson

Emmeran Johnson

Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity

Add code
Oct 02, 2023
Viaarxiv icon

Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes

Add code
Feb 22, 2023
Figure 1 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Figure 2 for Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes
Viaarxiv icon