Picture for Tianchen Zhu

Tianchen Zhu

Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation and Improvement through LLMs

Add code
Jun 28, 2024
Viaarxiv icon