Picture for Xuejing Feng

Xuejing Feng

Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue

Add code
Jun 20, 2024
Figure 1 for Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Figure 2 for Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Figure 3 for Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Figure 4 for Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Viaarxiv icon