Picture for Xiaojun Jia

Xiaojun Jia

Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis and Instruction-Level Chain-of-Thought Learning

Add code
Jan 08, 2026
Viaarxiv icon

GAMBIT: A Gamified Jailbreak Framework for Multimodal Large Language Models

Add code
Jan 06, 2026
Viaarxiv icon

Casting a SPELL: Sentence Pairing Exploration for LLM Limitation-breaking

Add code
Dec 24, 2025
Viaarxiv icon

Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography

Add code
Dec 23, 2025
Viaarxiv icon

Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack

Add code
Nov 17, 2025
Figure 1 for Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
Figure 2 for Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
Figure 3 for Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
Figure 4 for Shedding Light on VLN Robustness: A Black-box Framework for Indoor Lighting-based Adversarial Attack
Viaarxiv icon

Beyond Pixels: Semantic-aware Typographic Attack for Geo-Privacy Protection

Add code
Nov 16, 2025
Viaarxiv icon

LLM Jailbreak Detection for (Almost) Free!

Add code
Sep 18, 2025
Figure 1 for LLM Jailbreak Detection for (Almost) Free!
Figure 2 for LLM Jailbreak Detection for (Almost) Free!
Figure 3 for LLM Jailbreak Detection for (Almost) Free!
Figure 4 for LLM Jailbreak Detection for (Almost) Free!
Viaarxiv icon

PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems

Add code
Aug 07, 2025
Figure 1 for PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems
Figure 2 for PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems
Figure 3 for PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems
Figure 4 for PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving Systems
Viaarxiv icon

The Emotional Baby Is Truly Deadly: Does your Multimodal Large Reasoning Model Have Emotional Flattery towards Humans?

Add code
Aug 06, 2025
Viaarxiv icon

3D Gaussian Splatting Driven Multi-View Robust Physical Adversarial Camouflage Generation

Add code
Jul 02, 2025
Viaarxiv icon