Picture for Zeqiu Wu

Zeqiu Wu

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Add code
Jun 13, 2024
Viaarxiv icon

Training Language Models to Generate Text with Citations via Fine-grained Rewards

Add code
Feb 06, 2024
Viaarxiv icon

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Add code
Oct 17, 2023
Viaarxiv icon

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations

Add code
Jul 13, 2023
Figure 1 for DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations
Figure 2 for DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations
Figure 3 for DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations
Figure 4 for DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations
Viaarxiv icon

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Add code
Jun 02, 2023
Viaarxiv icon

INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions

Add code
Jul 02, 2022
Figure 1 for INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions
Figure 2 for INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions
Figure 3 for INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions
Figure 4 for INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions
Viaarxiv icon

CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning

Add code
Dec 16, 2021
Figure 1 for CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning
Figure 2 for CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning
Figure 3 for CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning
Figure 4 for CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning
Viaarxiv icon

DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization

Add code
Sep 10, 2021
Figure 1 for DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization
Figure 2 for DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization
Figure 3 for DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization
Figure 4 for DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization
Viaarxiv icon

Automatic Document Sketching: Generating Drafts from Analogous Texts

Add code
Jun 14, 2021
Figure 1 for Automatic Document Sketching: Generating Drafts from Analogous Texts
Figure 2 for Automatic Document Sketching: Generating Drafts from Analogous Texts
Figure 3 for Automatic Document Sketching: Generating Drafts from Analogous Texts
Figure 4 for Automatic Document Sketching: Generating Drafts from Analogous Texts
Viaarxiv icon

Extracting Summary Knowledge Graphs from Long Documents

Add code
Sep 19, 2020
Figure 1 for Extracting Summary Knowledge Graphs from Long Documents
Figure 2 for Extracting Summary Knowledge Graphs from Long Documents
Figure 3 for Extracting Summary Knowledge Graphs from Long Documents
Figure 4 for Extracting Summary Knowledge Graphs from Long Documents
Viaarxiv icon