Picture for Baolei Zhang

Baolei Zhang

Prompt-Guided Internal States for Hallucination Detection of Large Language Models

Add code
Nov 07, 2024
Viaarxiv icon

BadActs: A Universal Backdoor Defense in the Activation Space

Add code
May 18, 2024
Viaarxiv icon