Picture for Chung-En Sun

Chung-En Sun

ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models

Add code
Mar 27, 2025
Viaarxiv icon

Effective Skill Unlearning through Intervention and Abstention

Add code
Mar 27, 2025
Viaarxiv icon

Interpretable Generative Models through Post-hoc Concept Bottlenecks

Add code
Mar 25, 2025
Viaarxiv icon

Concept Bottleneck Large Language Models

Add code
Dec 11, 2024
Figure 1 for Concept Bottleneck Large Language Models
Figure 2 for Concept Bottleneck Large Language Models
Figure 3 for Concept Bottleneck Large Language Models
Figure 4 for Concept Bottleneck Large Language Models
Viaarxiv icon

Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities

Add code
Oct 24, 2024
Figure 1 for Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Figure 2 for Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Figure 3 for Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Figure 4 for Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Viaarxiv icon

Crafting Large Language Models for Enhanced Interpretability

Add code
Jul 05, 2024
Figure 1 for Crafting Large Language Models for Enhanced Interpretability
Figure 2 for Crafting Large Language Models for Enhanced Interpretability
Figure 3 for Crafting Large Language Models for Enhanced Interpretability
Figure 4 for Crafting Large Language Models for Enhanced Interpretability
Viaarxiv icon

Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents

Add code
Jun 26, 2024
Figure 1 for Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Figure 2 for Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Figure 3 for Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Figure 4 for Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Viaarxiv icon

NTIRE 2020 Challenge on NonHomogeneous Dehazing

Add code
May 07, 2020
Figure 1 for NTIRE 2020 Challenge on NonHomogeneous Dehazing
Figure 2 for NTIRE 2020 Challenge on NonHomogeneous Dehazing
Figure 3 for NTIRE 2020 Challenge on NonHomogeneous Dehazing
Figure 4 for NTIRE 2020 Challenge on NonHomogeneous Dehazing
Viaarxiv icon