Picture for Jiaheng Zhang

Jiaheng Zhang

STEP: Detecting Audio Backdoor Attacks via Stability-based Trigger Exposure Profiling

Add code
Mar 18, 2026
Viaarxiv icon

Gym-V: A Unified Vision Environment System for Agentic Vision Research

Add code
Mar 17, 2026
Viaarxiv icon

Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration

Add code
Mar 03, 2026
Viaarxiv icon

IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation

Add code
Feb 26, 2026
Viaarxiv icon

KLong: Training LLM Agent for Extremely Long-horizon Tasks

Add code
Feb 19, 2026
Viaarxiv icon

On Representation Redundancy in Large-Scale Instruction Tuning Data Selection

Add code
Feb 14, 2026
Viaarxiv icon

MemPot: Defending Against Memory Extraction Attack with Optimized Honeypots

Add code
Feb 07, 2026
Viaarxiv icon

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Add code
Feb 07, 2026
Viaarxiv icon

Reliable and Responsible Foundation Models: A Comprehensive Survey

Add code
Feb 04, 2026
Viaarxiv icon

Beyond Hard Masks: Progressive Token Evolution for Diffusion Language Models

Add code
Jan 12, 2026
Viaarxiv icon