Picture for Miroslav Štrupl

Miroslav Štrupl

Upside-Down Reinforcement Learning Can Diverge in Stochastic Environments With Episodic Resets

Add code
May 13, 2022
Figure 1 for Upside-Down Reinforcement Learning Can Diverge in Stochastic Environments With Episodic Resets
Viaarxiv icon

Reward-Weighted Regression Converges to a Global Optimum

Add code
Jul 19, 2021
Figure 1 for Reward-Weighted Regression Converges to a Global Optimum
Figure 2 for Reward-Weighted Regression Converges to a Global Optimum
Viaarxiv icon