Picture for Miroslav Štrupl

Miroslav Štrupl

On the Convergence and Stability of Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning, and Online Decision Transformers

Add code
Feb 08, 2025
Viaarxiv icon

Upside-Down Reinforcement Learning Can Diverge in Stochastic Environments With Episodic Resets

Add code
May 13, 2022
Figure 1 for Upside-Down Reinforcement Learning Can Diverge in Stochastic Environments With Episodic Resets
Viaarxiv icon

Reward-Weighted Regression Converges to a Global Optimum

Add code
Jul 19, 2021
Figure 1 for Reward-Weighted Regression Converges to a Global Optimum
Figure 2 for Reward-Weighted Regression Converges to a Global Optimum
Viaarxiv icon