Picture for Yinxing Xue

Yinxing Xue

Self and Cross-Model Distillation for LLMs: Effective Methods for Refusal Pattern Alignment

Add code
Jun 17, 2024
Viaarxiv icon

A Cross-Language Investigation into Jailbreak Attacks in Large Language Models

Add code
Jan 30, 2024
Viaarxiv icon