Picture for Junda Zhu

Junda Zhu

Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking

Add code
Feb 18, 2025
Viaarxiv icon

DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak

Add code
Dec 23, 2024
Figure 1 for DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
Figure 2 for DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
Figure 3 for DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
Figure 4 for DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
Viaarxiv icon

ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator

Add code
May 28, 2024
Viaarxiv icon

A Survey of Neural Network Robustness Assessment in Image Recognition

Add code
Apr 15, 2024
Viaarxiv icon