Picture for Chak Tou Leong

Chak Tou Leong

Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region

Add code
Feb 19, 2025
Viaarxiv icon

TokenSkip: Controllable Chain-of-Thought Compression in LLMs

Add code
Feb 17, 2025
Viaarxiv icon

Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection

Add code
Dec 22, 2024
Viaarxiv icon

Direct Preference Optimization Using Sparse Feature-Level Constraints

Add code
Nov 12, 2024
Figure 1 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Figure 2 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Figure 3 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Figure 4 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Viaarxiv icon

Subtle Errors Matter: Preference Learning via Error-injected Self-editing

Add code
Oct 09, 2024
Figure 1 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 2 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 3 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Figure 4 for Subtle Errors Matter: Preference Learning via Error-injected Self-editing
Viaarxiv icon

Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning

Add code
Oct 07, 2024
Figure 1 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 2 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 3 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 4 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Viaarxiv icon

E2CL: Exploration-based Error Correction Learning for Embodied Agents

Add code
Sep 05, 2024
Viaarxiv icon

Evolving to be Your Soulmate: Personalized Dialogue Agents with Dynamically Adapted Personas

Add code
Jun 20, 2024
Figure 1 for Evolving to be Your Soulmate: Personalized Dialogue Agents with Dynamically Adapted Personas
Figure 2 for Evolving to be Your Soulmate: Personalized Dialogue Agents with Dynamically Adapted Personas
Figure 3 for Evolving to be Your Soulmate: Personalized Dialogue Agents with Dynamically Adapted Personas
Figure 4 for Evolving to be Your Soulmate: Personalized Dialogue Agents with Dynamically Adapted Personas
Viaarxiv icon

No Two Devils Alike: Unveiling Distinct Mechanisms of Fine-tuning Attacks

Add code
May 25, 2024
Viaarxiv icon

Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue

Add code
Feb 10, 2024
Viaarxiv icon