Picture for Claas A Voelcker

Claas A Voelcker

Can we hop in general? A discussion of benchmark selection and design using the Hopper environment

Add code
Oct 11, 2024
Viaarxiv icon

MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL

Add code
Oct 11, 2024
Viaarxiv icon

$λ$-AC: Learning latent decision-aware models for reinforcement learning in continuous state-spaces

Add code
Jun 30, 2023
Viaarxiv icon