Picture for Harrison Lee

Harrison Lee

RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs

Add code
Sep 06, 2024
Figure 1 for RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
Figure 2 for RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
Figure 3 for RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
Figure 4 for RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
Viaarxiv icon

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Add code
Sep 01, 2023
Figure 1 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 2 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 3 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 4 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Viaarxiv icon

Conversational Recommendation as Retrieval: A Simple, Strong Baseline

Add code
May 23, 2023
Viaarxiv icon

AnyTOD: A Programmable Task-Oriented Dialog System

Add code
Dec 20, 2022
Figure 1 for AnyTOD: A Programmable Task-Oriented Dialog System
Figure 2 for AnyTOD: A Programmable Task-Oriented Dialog System
Figure 3 for AnyTOD: A Programmable Task-Oriented Dialog System
Figure 4 for AnyTOD: A Programmable Task-Oriented Dialog System
Viaarxiv icon

Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue

Add code
Apr 08, 2022
Figure 1 for Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue
Figure 2 for Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue
Figure 3 for Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue
Figure 4 for Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue
Viaarxiv icon

Description-Driven Task-Oriented Dialog Modeling

Add code
Jan 21, 2022
Figure 1 for Description-Driven Task-Oriented Dialog Modeling
Figure 2 for Description-Driven Task-Oriented Dialog Modeling
Figure 3 for Description-Driven Task-Oriented Dialog Modeling
Figure 4 for Description-Driven Task-Oriented Dialog Modeling
Viaarxiv icon

SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems

Add code
Oct 13, 2021
Figure 1 for SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems
Figure 2 for SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems
Figure 3 for SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems
Figure 4 for SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems
Viaarxiv icon