Picture for Carlo Romeo

Carlo Romeo

SPEQ: Stabilization Phases for Efficient Q-Learning in High Update-To-Data Ratio Reinforcement Learning

Add code
Jan 15, 2025
Viaarxiv icon

Offline Reinforcement Learning with Imputed Rewards

Add code
Jul 15, 2024
Viaarxiv icon