Picture for Yunhan Zhao

Yunhan Zhao

DSA: Dynamic Step Allocation for Fast Autoregressive Video Generation

Add code
Jun 03, 2026
Viaarxiv icon

Chain of Risk: Safety Failures in Large Reasoning Models and Mitigation via Adaptive Multi-Principle Steering

Add code
May 07, 2026
Viaarxiv icon

HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models

Add code
Apr 14, 2026
Viaarxiv icon

Generating a Paracosm for Training-Free Zero-Shot Composed Image Retrieval

Add code
Feb 03, 2026
Viaarxiv icon

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Add code
Jan 16, 2026
Viaarxiv icon

AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models

Add code
Nov 15, 2025
Figure 1 for AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models
Figure 2 for AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models
Figure 3 for AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models
Figure 4 for AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models
Viaarxiv icon

Defense-to-Attack: Bypassing Weak Defenses Enables Stronger Jailbreaks in Vision-Language Models

Add code
Sep 16, 2025
Viaarxiv icon

BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks

Add code
Oct 28, 2024
Figure 1 for BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
Figure 2 for BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
Figure 3 for BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
Figure 4 for BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
Viaarxiv icon

BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models

Add code
Aug 23, 2024
Figure 1 for BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models
Figure 2 for BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models
Figure 3 for BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models
Figure 4 for BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models
Viaarxiv icon

Anomaly Detection of Tabular Data Using LLMs

Add code
Jun 24, 2024
Viaarxiv icon