Picture for Baolei Zhang

Baolei Zhang

Prompt-Guided Internal States for Hallucination Detection of Large Language Models

Add code
Nov 07, 2024
Viaarxiv icon

BadActs: A Universal Backdoor Defense in the Activation Space

Add code
May 18, 2024
Figure 1 for BadActs: A Universal Backdoor Defense in the Activation Space
Figure 2 for BadActs: A Universal Backdoor Defense in the Activation Space
Figure 3 for BadActs: A Universal Backdoor Defense in the Activation Space
Figure 4 for BadActs: A Universal Backdoor Defense in the Activation Space
Viaarxiv icon