Picture for Noah Jones

Noah Jones

Human-centric Dialog Training via Offline Reinforcement Learning

Add code
Oct 12, 2020
Figure 1 for Human-centric Dialog Training via Offline Reinforcement Learning
Figure 2 for Human-centric Dialog Training via Offline Reinforcement Learning
Figure 3 for Human-centric Dialog Training via Offline Reinforcement Learning
Figure 4 for Human-centric Dialog Training via Offline Reinforcement Learning
Viaarxiv icon

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog

Add code
Jul 08, 2019
Figure 1 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 2 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 3 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Figure 4 for Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Viaarxiv icon

Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems

Add code
Jun 21, 2019
Figure 1 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Figure 2 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Figure 3 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Figure 4 for Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
Viaarxiv icon