Picture for Xuejing Feng

Xuejing Feng

Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue

Add code
Jun 20, 2024
Viaarxiv icon