Picture for Chak Tou Leong

Chak Tou Leong

Direct Preference Optimization Using Sparse Feature-Level Constraints

Add code
Nov 12, 2024
Viaarxiv icon

Subtle Errors Matter: Preference Learning via Error-injected Self-editing

Add code
Oct 09, 2024
Figure 1 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 2 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 3 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 4 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Viaarxiv icon

Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning

Add code
Oct 07, 2024
Figure 1 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 2 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 3 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 4 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Viaarxiv icon

E2CL: Exploration-based Error Correction Learning for Embodied Agents

Add code
Sep 05, 2024
Viaarxiv icon

Evolving to be Your Soulmate: Personalized Dialogue Agents with Dynamically Adapted Personas

Add code
Jun 20, 2024
Viaarxiv icon

No Two Devils Alike: Unveiling Distinct Mechanisms of Fine-tuning Attacks

Add code
May 25, 2024
Viaarxiv icon

Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue

Add code
Feb 10, 2024
Viaarxiv icon

Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback

Add code
Jan 11, 2024
Viaarxiv icon

COOPER: Coordinating Specialized Agents towards a Complex Dialogue Goal

Add code
Dec 19, 2023
Viaarxiv icon

Self-Detoxifying Language Models via Toxification Reversal

Add code
Oct 14, 2023
Viaarxiv icon