Picture for Huanqian Wang

Huanqian Wang

Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing

Add code
Jul 11, 2024
Figure 1 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Figure 2 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Figure 3 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Figure 4 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Viaarxiv icon

Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning

Add code
Sep 04, 2023
Viaarxiv icon