Picture for Nenghai Yu

Nenghai Yu

Provably Secure Agent Guardrail

Add code
May 28, 2026
Viaarxiv icon

Advancing Aesthetic Image Generation via Composition Transfer

Add code
May 06, 2026
Viaarxiv icon

When Agents Look the Same: Quantifying Distillation-Induced Similarity in Tool-Use Behaviors

Add code
Apr 23, 2026
Viaarxiv icon

VLMShield: Efficient and Robust Defense of Vision-Language Models against Malicious Prompts

Add code
Apr 07, 2026
Viaarxiv icon

Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs

Add code
Mar 29, 2026
Viaarxiv icon

State-Dependent Safety Failures in Multi-Turn Language Model Interaction

Add code
Mar 15, 2026
Viaarxiv icon

SWIFT: Sliding Window Reconstruction for Few-Shot Training-Free Generated Video Attribution

Add code
Mar 09, 2026
Viaarxiv icon

HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising

Add code
Mar 09, 2026
Viaarxiv icon

Rethinking Multi-Condition DiTs: Eliminating Redundant Attention via Position-Alignment and Keyword-Scoping

Add code
Feb 06, 2026
Viaarxiv icon

Character as a Latent Variable in Large Language Models: A Mechanistic Account of Emergent Misalignment and Conditional Safety Failures

Add code
Jan 30, 2026
Viaarxiv icon