Picture for Yuanhe Zhang

Yuanhe Zhang

From Helpfulness to Toxic Proactivity: Diagnosing Behavioral Misalignment in LLM Agents

Add code
Feb 04, 2026
Viaarxiv icon

Statistical Learning Theory in Lean 4: Empirical Processes from Scratch

Add code
Feb 02, 2026
Viaarxiv icon

SEE: Signal Embedding Energy for Quantifying Noise Interference in Large Audio Language Models

Add code
Jan 12, 2026
Viaarxiv icon

RECALLED: An Unbounded Resource Consumption Attack on Large Vision-Language Models

Add code
Jul 24, 2025
Viaarxiv icon

$PD^3F$: A Pluggable and Dynamic DoS-Defense Framework Against Resource Consumption Attacks Targeting Large Language Models

Add code
May 24, 2025
Viaarxiv icon

LIFEBench: Evaluating Length Instruction Following in Large Language Models

Add code
May 22, 2025
Viaarxiv icon

CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models

Add code
Feb 20, 2025
Viaarxiv icon

DemonAgent: Dynamically Encrypted Multi-Backdoor Implantation Attack on LLM-based Agent

Add code
Feb 18, 2025
Viaarxiv icon

One-step full gradient suffices for low-rank fine-tuning, provably and efficiently

Add code
Feb 03, 2025
Figure 1 for One-step full gradient suffices for low-rank fine-tuning, provably and efficiently
Figure 2 for One-step full gradient suffices for low-rank fine-tuning, provably and efficiently
Figure 3 for One-step full gradient suffices for low-rank fine-tuning, provably and efficiently
Figure 4 for One-step full gradient suffices for low-rank fine-tuning, provably and efficiently
Viaarxiv icon

Crabs: Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings

Add code
Dec 18, 2024
Figure 1 for Crabs: Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings
Figure 2 for Crabs: Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings
Figure 3 for Crabs: Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings
Figure 4 for Crabs: Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings
Viaarxiv icon