Picture for Yilei Jiang

Yilei Jiang

HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden States

Add code
Feb 21, 2025
Viaarxiv icon

Equilibrate RLHF: Towards Balancing Helpfulness-Safety Trade-off in Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting

Add code
Dec 25, 2024
Viaarxiv icon

DebiasDiff: Debiasing Text-to-image Diffusion Models with Self-discovering Latent Attribute Directions

Add code
Dec 25, 2024
Viaarxiv icon

Event-Customized Image Generation

Add code
Oct 03, 2024
Figure 1 for Event-Customized Image Generation
Figure 2 for Event-Customized Image Generation
Figure 3 for Event-Customized Image Generation
Figure 4 for Event-Customized Image Generation
Viaarxiv icon