KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs

Add code
Feb 05, 2025
Figure 1 for KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs
Figure 2 for KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs
Figure 3 for KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs
Figure 4 for KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: