Picture for Yangyang Zhao

Yangyang Zhao

Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization

Add code
May 05, 2023
Viaarxiv icon

Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy Learning

Add code
Dec 28, 2020
Figure 1 for Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy Learning
Figure 2 for Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy Learning
Figure 3 for Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy Learning
Figure 4 for Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy Learning
Viaarxiv icon