Picture for Xiaoyu Xu

Xiaoyu Xu

Weak-To-Strong Backdoor Attacks for LLMs with Contrastive Knowledge Distillation

Add code
Sep 26, 2024
Viaarxiv icon