Picture for Yaocheng Zhang

Yaocheng Zhang

In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning

Add code
Dec 12, 2024
Viaarxiv icon