Picture for Kaihe Xu

Kaihe Xu

Reinforcement Learning with Token-level Feedback for Controllable Text Generation

Add code
Mar 18, 2024
Viaarxiv icon