Picture for Lijie Hu

Lijie Hu

Dissecting Misalignment of Multimodal Large Language Models via Influence Function

Add code
Nov 18, 2024
Viaarxiv icon

SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents

Add code
Nov 05, 2024
Viaarxiv icon

Towards Multi-dimensional Explanation Alignment for Medical Classification

Add code
Oct 28, 2024
Viaarxiv icon

Private Language Models via Truncated Laplacian Mechanism

Add code
Oct 10, 2024
Figure 1 for Private Language Models via Truncated Laplacian Mechanism
Figure 2 for Private Language Models via Truncated Laplacian Mechanism
Figure 3 for Private Language Models via Truncated Laplacian Mechanism
Figure 4 for Private Language Models via Truncated Laplacian Mechanism
Viaarxiv icon

Dissecting Fine-Tuning Unlearning in Large Language Models

Add code
Oct 09, 2024
Viaarxiv icon

Faithful Interpretation for Graph Neural Networks

Add code
Oct 09, 2024
Viaarxiv icon

Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing

Add code
Oct 08, 2024
Viaarxiv icon

CAT: Concept-level backdoor ATtacks for Concept Bottleneck Models

Add code
Oct 07, 2024
Viaarxiv icon

What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMs

Add code
Oct 07, 2024
Figure 1 for What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMs
Figure 2 for What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMs
Figure 3 for What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMs
Figure 4 for What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMs
Viaarxiv icon

Understanding Reasoning in Chain-of-Thought from the Hopfieldian View

Add code
Oct 04, 2024
Viaarxiv icon